Finding Belief Geometries with Sparse Autoencoders

📰 ArXiv cs.AI

Researchers use sparse autoencoders to find belief geometries in internal representations of large language models

advanced Published 6 Apr 2026

Action Steps

Identify the geometric structure of internal representations in large language models
Use sparse autoencoders to encode probabilistic belief states
Analyze the residual stream to find simplex-shaped geometries
Interpret the vertices of the geometries as latent generative states

Who Needs to Know This

ML researchers and AI engineers can benefit from this research to improve model interpretability and understand how language models represent probabilistic beliefs

Key Insight

💡 Sparse autoencoders can be used to identify geometric structures in internal representations of large language models