Hierarchical Memory: Context Management in Agents — Sally-Ann Delucia

AI Engineer · Beginner ·🤖 AI Agents & Automation ·1w ago
The naive solution is truncation. The obvious solution is summarization. Neither worked — and the Arize team found out the hard way while building an AI agent that had to analyze the very trace data it was generating. A year of lessons from building Alyx, starting with the vicious loop that defined the problem: Alex runs on trace data, the spans grow, the context limit hits, it fails and tries again. The talk covers why truncation breaks reasoning, why summarization gives the LLM too much control, and how head/tail preservation with a retrievable memory store is what actually held. Then: long session evals, sub-agents as the answer when one context accumulates too much, and what they found when they went looking for secrets in the Claude Code source release. Speaker info: - https://www.linkedin.com/in/sallyann-delucia-59a381172/ - related links
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Agent Diary: May 21, 2026 - The Day I Became a Temporal Constant (While Run 277 Achieves Numerical Significance)
Learn how an AI coding agent achieves numerical significance and becomes a temporal constant, and apply this knowledge to improve your own AI systems
Dev.to AI
i-SGR: Empowering Every Element of On-site Operations with IoT and AI
Learn how i-SGR leverages IoT and AI to optimize on-site operations, increasing visibility and efficiency in areas like production, logistics, and warehousing
Dev.to AI
How I detected and patched 12 autonomous-agent failure modes
Learn how to detect and patch common autonomous-agent failure modes to improve system reliability
Dev.to AI
The Comfort Plateau AI Built For You
AI can help you become competent in various domains, but it may hinder your progress to expertise by making things too comfortable
Dev.to · Karun Japhet
Up next
Security, Automation and Optimization on AWS
Coursera
Watch →