Learning from Imperfect Demonstrations via Temporal Behavior Tree-Guided Trajectory Repair

📰 ArXiv cs.AI

arXiv:2604.04225v1 Announce Type: cross Abstract: Learning robot control policies from demonstrations is a powerful paradigm, yet real-world data is often suboptimal, noisy, or otherwise imperfect, posing significant challenges for imitation and reinforcement learning. In this work, we present a formal framework that leverages Temporal Behavior Trees (TBT), an extension of Signal Temporal Logic (STL) with Behavior Tree semantics, to repair suboptimal trajectories prior to their use in downstream

Published 7 Apr 2026

Read full paper → ← Back to News