Build Agents That Run for Hours (Without Losing the Plot) — Ash Prabaker & Andrew Wilson, Anthropic
Why self-evaluation is a trap and adversarial evaluator agents work better; why context compaction doesn't cure coherence drift but structured handoffs do; how to decompose work into testable sprint contracts; how to grade subjective output with rubrics an LLM can actually apply; and how to read traces as your primary debugging loop. Plus the question nobody asks: which parts of your harness should you delete when the next model drops?
Speaker info:
- Ash Prabaker | https://www.linkedin.com/in/ash-prabaker/
- Andrew Wilson | https://www.linkedin.com/in/anddwilson/
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: Agent Foundations
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Building an Agentic AI for Oracle Fusion Cloud GL Reconciliation
Medium · AI
Building an Agentic AI for Oracle Fusion Cloud GL Reconciliation
Medium · Python
OpenClaw's $1.3 Million OpenAI Bill: What AI Agents Actually Cost in Production
Dev.to · Tom Tokita
AI Agents Don't Have Permissions — Runtimes Do
Dev.to · Glendel Joubert Fyne Acosta
🎓
Tutor Explanation
DeepCamp AI