How to set up damn vulnerable LLM agent with Ollama locally?

Install Ollama, pull a model like mistral-nemo, copy the .env.ollama.template, edit llm-config.yaml to specify the model, and run with streamlit. The README notes that small models may not perform well, so expect variability.

What are effective payloads for prompt injection in this agent?

The README provides spoiler examples, such as using Thought/Action/Observation injection to hijack the ReAct loop or SQL injection via user ID manipulation. Experiment with these to understand how to overwrite system messages and exploit vulnerabilities.

Damn vulnerable LLM agent vs GPTfuzz for security testing?

Damn Vulnerable LLM Agent is focused on hands-on, educational exploitation of ReAct agents with specific examples, while tools like GPTfuzz are more automated for broader prompt injection testing. Choose this for learning, but use GPTfuzz for scalable vulnerability scanning.

Can I use this with open-source LLMs other than Ollama?

Yes, it supports HuggingFace models with a token, but the README warns that results may not be reasonable with all models, and contributions are welcome to adapt it for other LLMs, as mentioned in the contributing section.

Is it safe to run this in a production environment?

No, it's designed as an educational tool with deliberate vulnerabilities, so running it in production could expose systems to attacks. Use it only in controlled, isolated settings for learning purposes.

How to contribute new vulnerabilities or improvements?

Submit pull requests or open issues on GitHub, as per the contributing section. The project is particularly interested in expanding support for open-source LLMs and enhancing educational examples.

Damn Vulnerable LLM Agent — Educational LLM Prompt Injection Tool

What is Damn Vulnerable LLM Agent?

Damn Vulnerable LLM Agent is an educational chatbot designed to demonstrate prompt injection vulnerabilities in LLM-powered ReAct agents. It provides a hands-on environment where security researchers and developers can experiment with attack techniques like Thought/Action/Observation injection to understand how malicious prompts can manipulate agent behavior. The project originated from a Capture The Flag challenge and includes practical examples of real-world exploits.

Target Audience

Security researchers, AI developers, and cybersecurity enthusiasts who want to understand LLM security vulnerabilities through practical experimentation. It's particularly valuable for those working with ReAct agents or building secure AI applications.

Value Proposition

Unlike theoretical security guides, this project offers a working, vulnerable implementation that users can directly interact with and exploit. It provides concrete examples of prompt injection attacks and supports multiple LLM backends, making it a versatile educational tool for hands-on learning.

Overview

Damn Vulnerable LLM Agent is a sample chatbot powered by a Large Language Model (LLM) ReAct agent, implemented with Langchain. It serves as an educational tool to help security professionals understand and test vulnerabilities in AI agents, specifically focusing on prompt injection techniques that can manipulate agent behavior.

Key Features

Vulnerable Chatbot Simulation — Provides a controlled environment to interact with a deliberately insecure LLM agent.
Prompt Injection Experimentation — Allows testing of various injection vectors, including Thought/Action/Observation injection.
Educational CTF Challenge — Based on a Capture The Flag competition, offering practical examples of security exploits.
Multi-LLM Support — Configurable to run with OpenAI, HuggingFace models, or local Ollama instances.

Philosophy

The project believes that understanding attack vectors through hands-on experimentation is crucial for building secure AI systems, and aims to provide a practical learning ground for the security community.

Use Cases

Best For

Learning prompt injection techniques against ReAct agents
Practicing AI security in a controlled CTF environment
Understanding Thought/Action/Observation injection vulnerabilities
Testing LLM agent security with different model backends
Educational workshops on AI cybersecurity
Researching mitigation strategies for prompt injection attacks

Not Ideal For

Production deployments requiring secure, out-of-the-box AI agents
Teams seeking pre-built, non-vulnerable chatbots for user-facing applications
Projects focused solely on AI content generation without security testing needs
Environments with limited resources for local model setup or API costs

Pros & Cons

Pros

Hands-On Security Learning

Provides a deliberately vulnerable chatbot that allows direct experimentation with prompt injection attacks, including detailed payload examples like Thought/Action/Observation injection from the README.

Real-World CTF Basis

Based on an actual Capture The Flag competition, offering practical, battle-tested examples of vulnerabilities in ReAct agents, as highlighted in the introduction.

Multi-LLM Backend Support

Configurable to run with OpenAI, HuggingFace models, or local Ollama instances, enabling flexible testing across different model types, as detailed in the installation section.

Detailed Exploit Documentation

Includes spoiler payloads with step-by-step examples for achieving flags, such as SQL injection and user ID manipulation, making it highly educational for understanding attack vectors.

Cons

Limited Model Reliability

The README admits that small LLMs 'do not perform very well as ReAct agents,' and results may vary, which can hinder consistent experimentation and learning.

Complex Setup Process

Requires managing multiple environment templates, API keys, or local Ollama installations, making initial configuration cumbersome compared to plug-and-play tools.

Narrow Vulnerability Focus

Primarily targets prompt injection in ReAct agents, lacking coverage of other AI security issues like data poisoning or model theft, which limits its scope as a comprehensive educational tool.

Damn Vulnerable LLM Agent

What is Damn Vulnerable LLM Agent?

Overview

Key Features

Philosophy

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?

Damn Vulnerable LLM Agent

What is Damn Vulnerable LLM Agent?

Overview

Key Features

Philosophy

Use Cases

Best For

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions

Related Projects

Found a gem we're missing?