Researchers Unveil CaMeL: New Defense Against Prompt Injection Attacks on AI Agents, Solving 67% of Security Tasks

Defeating Prompt Injections by Design

View PDF Abstract:Large Language Models (LLMs) are increasingly deployed in agentic systems that interact with an external environment. However, LLM agents are vulnerable to prompt injection attacks when handling untrusted data. In this paper we propose CaMeL, a robust defense that creates a protective system layer around the LLM, securing it even when underlying models may be susceptible to attacks. To operate, CaMeL explicitly extracts the control and data flows from the (trusted) query; there...