AgenticCache: Cache-Driven Asynchronous Planning for Embodied AI Agents

📰 ArXiv cs.AI

arXiv:2604.24039v1 Announce Type: cross Abstract: Embodied AI agents increasingly rely on large language models (LLMs) for planning, yet per-step LLM calls impose severe latency and cost. In this paper, we show that embodied tasks exhibit strong plan locality, where the next plan is largely predictable from the current one. Building on this, we introduce AgenticCache, a planning framework that reuses cached plans to avoid per-step LLM calls. In AgenticCache, each agent queries a runtime cache of

Published 28 Apr 2026

Read full paper → ← Back to Reads