The Mirror Design Pattern: Strict Data Geometry over Model Scale for Prompt Injection Detection

📰 ArXiv cs.AI

arXiv:2603.11875v2 Announce Type: replace-cross Abstract: Prompt injection defenses are often framed as semantic understanding problems and delegated to increasingly large neural detectors. For the first screening layer, however, the requirements are different: the detector runs on every request and therefore must be fast, deterministic, non-promptable, and auditable. We introduce Mirror, a data-curation design pattern that organizes prompt injection corpora into matched positive and negative ce

Published 16 Apr 2026
Read full paper → ← Back to Reads