LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations

📰 ArXiv cs.AI

LLMs can predict their own success from pre-generation activations, enabling more efficient inference

advanced Published 7 Apr 2026

Action Steps

Train linear probes on pre-generation activations to predict policy-specific success
Use the predicted success signal to guide more efficient inference
Apply this approach to math and coding tasks to evaluate its effectiveness
Investigate the generalizability of this method to other tasks and domains

Who Needs to Know This

AI engineers and researchers can benefit from this knowledge to optimize LLM performance and reduce computational costs, while ML researchers can apply these findings to improve model efficiency

Key Insight

💡 LLMs' internal representations before generation contain signals about their likelihood of success, which can be used to improve efficiency