Progressive Autonomy as Preference Learning: A Formalization of Trust Calibration for Agentic Tool Use

📰 ArXiv cs.AI

Learn how to formalize trust calibration for agentic tool use as a preference-learning problem to improve autonomous decision-making

advanced Published 20 May 2026

Action Steps

Formalize trust calibration as a preference-learning problem using Gaussian processes
Maintain a posterior over a latent human risk-tolerance function using probit likelihood on binary feedback
Escalate to human approval when the approval outcome is most uncertain
Implement a policy gateway to manage autonomous decision-making
Test and refine the trust calibration model using real-world scenarios

Who Needs to Know This

AI researchers and engineers working on autonomous systems can benefit from this formalization to develop more reliable and trustworthy agentic tools

Key Insight

💡 Trust calibration for agentic tool use can be formalized as a preference-learning problem using Gaussian processes and probit likelihood