Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis

📰 ArXiv cs.AI

Using LLMs to evaluate safety of model responses to users with psychosis

advanced Published 6 Apr 2026

Action Steps

Develop LLMs that can assess and evaluate model responses for safety and clinical validity
Train LLMs on clinically-validated datasets to recognize and flag potentially harmful responses
Implement LLM-as-a-Judge/Jury approach to scalable safety evaluations of model responses
Continuously monitor and update LLMs to improve accuracy and effectiveness

Who Needs to Know This

AI engineers and researchers working on mental health support systems can benefit from this approach to ensure safe and clinically-validated interactions with users

Key Insight

💡 LLMs can be used to advance scalable, clinically-validated safety evaluations of model responses to users demonstrating psychosis