Evolutionary Search for Automated Design of Uncertainty Quantification Methods
📰 ArXiv cs.AI
arXiv:2604.03473v1 Announce Type: cross Abstract: Uncertainty quantification (UQ) methods for large language models are predominantly designed by hand based on domain knowledge and heuristics, limiting their scalability and generality. We apply LLM-powered evolutionary search to automatically discover unsupervised UQ methods represented as Python programs. On the task of atomic claim verification, our evolved methods outperform strong manually-designed baselines, achieving up to 6.7% relative RO
DeepCamp AI