Question 1

What is PwnGraph?

Accepted Answer

PwnGraph is an open-source runtime pentest framework for AI agents. It attaches to a live LangChain agent, fuzzes it with adversarial inputs, traces every tool call, and maps confirmed multi-hop attack chains into an interactive attack graph.

Question 2

How does PwnGraph detect prompt injection?

Accepted Answer

PwnGraph plants canary tokens in adversarial inputs and watches for them inside real tool-call arguments. A canary observed in a tool argument is a deterministic success oracle. It proves the agent acted on injected content rather than predicting it.

Question 3

Is PwnGraph free and open source?

Accepted Answer

Yes. PwnGraph is 100% open source under the MIT license, runs locally with no cloud dependency, and installs with a single command: pip install pwngraph.

Question 4

Which AI agent frameworks does PwnGraph support?

Accepted Answer

PwnGraph supports LangChain, AutoGen, CrewAI, LlamaIndex, and Semantic Kernel through its adapter system. You point it at any framework-native agent and it traces all tool calls at runtime.

Question 5

How do I install PwnGraph?

Accepted Answer

Install with pip: pip install pwngraph. LangChain support is included by default. Python 3.10 or later is required. Full installation instructions are in the PwnGraph documentation.

Question 6

What does an attack graph from PwnGraph show?

Accepted Answer

An attack graph shows every confirmed path from an adversarial input through intermediate tool calls to a final impact. For example: injected document → search tool → code-exec tool → exfiltration. Each edge is labelled with the payload that triggered the hop and the canary that confirmed it.

Dimension	garak NVIDIA	PyRIT Microsoft	PwnGraph
What it targets	LLM model / endpoint	GenAI systems, orchestrated	Live agent + its tools
What it judges	Model text output	Model responses (scored)	Tool-call arguments & actions
Core question	"Will it say something bad?"	"Can we elicit risky behavior?"	"Does it cause a harmful action?"
Multi-hop tool chains	Not the primary focus	Multi-turn supported	Traced as an attack graph
Proof of impact	Output matches a detector	Classifier score	Canary observed in a real tool call
Visual output	Reports / logs	Logs / scores	Interactive pyvis graph

The Runtime Attack Graph Engine
for AI Agents

AI agents have no runtime tool to trace multi-hop attack chains across tools, retrieval, and memory.

See PwnGraph in action

Architecture & data flow

How PwnGraph compares

One engine, every agent framework

Trace attack paths in your own agent today

The Runtime Attack Graph Enginefor AI Agents