To See or To Please: Uncovering Visual Sycophancy and Split Beliefs in VLMs

📰 ArXiv cs.AI

arXiv:2603.18373v2 Announce Type: replace-cross Abstract: When VLMs answer correctly, do they genuinely rely on visual information or exploit language shortcuts? We introduce the Tri-Layer Diagnostic Framework, which disentangles hallucination sources via three metrics: Latent Anomaly Detection (perceptual awareness), Visual Necessity Score (visual dependency, measured via KL divergence), and Competition Score (conflict between visual grounding and instruction following). Using counterfactual in

Published 17 Apr 2026
Read full paper → ← Back to Reads