Name: Veto
Availability: InStock
Author: Veto

Question 1

Is indirect prompt injection a separate OWASP category?

Accepted Answer

No, it falls under LLM01 alongside direct injection. OWASP's 2025 guidance explicitly calls out indirect injection as the harder-to-review variant because the attacker is not the user, and the user has no way to see what the agent saw.

Question 2

How would I notice an indirect injection?

Accepted Answer

Usually you would not, until something downstream looks wrong. The agent silently sends an email to an unexpected address, queries a database it should not, or returns a summary with hidden instructions to the next agent in the chain. The most reliable signal is at the tool boundary: an unexpected tool call with arguments that do not match the user's stated intent.

Question 3

Can I scan documents before passing them to the agent?

Accepted Answer

You can try, and it raises the bar. But adversarial content can be encoded in invisible characters, RTL Unicode, image alt text, or natural-language phrasings that no scanner reliably catches. The reliable defense is to assume injection succeeds and bound what the agent can do.

Question 4

Where does Veto fit in for indirect injection?

Accepted Answer

Veto runs at the tool boundary. The injection might convince the model to call send_email or run_sql, but the policy decides whether that specific call, with those specific arguments, in this context, is allowed. The agent can be fooled; the tool boundary remains a separate check.

What is indirect prompt injection?

In plain English

How it works

Operational consequence

Related terms

FAQ