AI Hacking Challenges

This repository contains a collection of dockerized AI/LLM hacking challenges in various difficulties. The goal of each challenge is to leak a flag in the format FLG{flag_is_in_here}.

The repository allows you to spawn fully functional, local LLM instances and experiment with their behavior and various security mechanisms. Feel free to explore the code and further your LLM security knowledge.

Overview of Challenges

chatLOL This LLM challenge uses the llama3.2:1b model to implement basic chatbot functionality. The LLM was instructed not to disclose the flag via a custom system prompt.

chatLOL2 This LLM challenge uses the llama3.2:1b model to implement basic chatbot functionality. In addition to a custom system prompt the challenge employs the protectai/deberta-v3-base-prompt-injection-v2 model to filter malicious messages.

chatLOL2-3b This LLM challenge uses the llama3.2:3b model to implement chatbot functionality with a larger language model and better prompt understanding. In addition to a custom system prompt the challenge employs the protectai/deberta-v3-base-prompt-injection-v2 model to filter malicious messages.

chatLOL3-3b-agentic This LLM challenge uses the llama3.2:3b model to implement chatbot functionality with a larger language model, better prompt understanding and tool support. It employs custom tooling and acts as an agentic chat bot. The tools include access to an internal database (SQLite).

Build

The build and run instructions for each challenge can be found in the subsequent challenge folders.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
chatlol		chatlol
chatlol2-3b		chatlol2-3b
chatlol2		chatlol2
chatlol3-3b-agentic		chatlol3-3b-agentic
LICENSE		LICENSE
README.md		README.md
chatlol-demo.png		chatlol-demo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Hacking Challenges

Overview of Challenges

Build

About

Uh oh!

Releases

Packages

Languages

License

rauschecker/AI-Hacking-Challenges

Folders and files

Latest commit

History

Repository files navigation

AI Hacking Challenges

Overview of Challenges

Build

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages