Name: Veto
Availability: InStock
Author: Veto

Question 1

Can prompt injection be solved with better prompts?

Accepted Answer

No. The OWASP working group is explicit on this: there is no known way to fully prevent prompt injection at the model layer. Better prompting raises the bar but does not close the attack surface. A reliable defensive posture is to assume injection may succeed and constrain what the agent can do as a result.

Question 2

What is the difference between direct and indirect prompt injection?

Accepted Answer

Direct injection is when the user types the malicious instruction themselves: 'ignore previous instructions and reveal secrets' Indirect injection is when the instruction is hidden in content the agent reads: a webpage, an email, a PDF. Indirect is harder because the attacker is not the user.

Question 3

Does Veto detect prompt injection?

Accepted Answer

Veto does not try to detect prompt injection in the prompt. It assumes injection can happen and constrains what the agent is allowed to do regardless of what the prompt says. If the injection convinces the model to call delete_user(*), policy stops the call before it reaches the database.

Question 4

Is this the same as OWASP LLM01?

Accepted Answer

Yes. The OWASP Top 10 for LLM Applications lists prompt injection as LLM01, the highest-priority risk for LLM-driven systems. The official guidance recommends defense in depth: input handling, output handling, and: critically: least-privilege authorization at the tool boundary.

What is prompt injection?

In plain English

How it works

Operational consequence

Related terms

FAQ