Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries

📰 ArXiv cs.AI

Large language models exhibit categorical perception in hidden states when processing Arabic numerals, with enhanced discriminability at digit-count boundaries

advanced Published 31 Mar 2026

Action Steps

Analyze hidden-state representations of LLMs using representational similarity analysis
Identify geometric warping at category boundaries, such as digit-count boundaries
Apply CP-additive models to quantify the effect of categorical perception on LLM representations
Use findings to inform model design and fine-tuning decisions, such as adjusting model architecture or training data

Who Needs to Know This

ML researchers and AI engineers can benefit from understanding how LLMs process and represent categorical information, which can inform model design and fine-tuning decisions

Key Insight

💡 LLMs exhibit enhanced discriminability at category boundaries, similar to human perceptual psychology