AI Agent Memory: Chat History & Semantic Caching | Lino Tadros | Azure Cosmos DB Conf 2026

Microsoft Developer · Advanced ·🤖 AI Agents & Automation ·1d ago
AI agents are only as intelligent as their ability to remember. Without persistent memory, every conversation starts from scratch — costing tokens, increasing latency, and delivering disconnected experiences. Lino Tadros (Microsoft Regional Director, Founder & Principal Architect at The Training Boss) shows how to use Azure Cosmos DB as the unified memory layer for agentic AI applications. You'll build two essential containers from scratch: • Chat History — preserves multi-turn conversations across sessions and devices • Semantic Cache — uses vector search to return prior LLM completions for semantically similar prompts, avoiding redundant API calls See the data modeling decisions that matter: partition key strategy for multi-tenant workloads, document design that balances write distribution with query locality, vector indexes tuned for fast similarity search, and TTL policies for automatic lifecycle management. Walk away with working code patterns you can apply immediately. 👤 Connect with Lino Tadros 📝 Distinguished executive leader and renowned technical expert in AI, Machine Learning, and IoT. Leads cross-functional architectural teams to award-winning performance by developing strategic roadmaps and powering enterprise-wide projects. Serves as board member and advisor for multiple corporations delivering strategic guidance on product line developments and business solutions. Industry influencer and mastermind of strategic programs and innovations leading modernization efforts to alter the global IT landscape as Microsoft Regional Director. Partnered with Microsoft to consult major corporations on Azure integrations; trained over 1,000 global employees and architects across US, Canada, Europe, Middle East, and Australia. Invited into elite Microsoft Regional Director program as Top 1% of global SMEs—maintains direct line of communication to Microsoft executive leaders and Office of the President for Technical and Business Influence. Piloted multi-million
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

🔮 Hermes Agent 🤖 — Deep Dive & Build-Your-Own Guide 📘
Learn how to build a self-improving agent like Hermes by understanding its core principles and architecture
Dev.to AI
Insurance Renewal Lift with Behavior-Driven AI Workflows
Boost insurance renewals with AI-driven workflows that address behavior, not just reminders
Medium · AI
I Used Felo to Build an AI Agent That Automatically Researches and Ranks Every PM Tool So I Don’t…
Learn how to automate research and ranking of PM tools using Felo and AI agents, saving time and increasing productivity
Medium · AI
The 2026 Agentic Era with Gemini Agent Platform: Surviving Cascading Failures and Runaway Cloud Bills.
Learn how to survive cascading failures and runaway cloud bills with the Gemini Agent Platform in the 2026 Agentic Era
Dev.to · Fayaz
Up next
Codex + GPT-5.5 = SUPER APP! Build and Do ANYTHING!
WorldofAI
Watch →