CascadeDebate: Multi-Agent Deliberation for Cost-Aware LLM Cascades
📰 ArXiv cs.AI
arXiv:2604.12262v1 Announce Type: cross Abstract: Cascaded LLM systems coordinate models of varying sizes with human experts to balance accuracy, cost, and abstention under uncertainty. However, single-model tiers at each stage often struggle with ambiguous queries, triggering premature escalations to costlier models or experts due to under-confidence and inefficient compute scaling. CascadeDebate addresses this gap by inserting multi-agent deliberation directly at each tier's escalation boundar
DeepCamp AI