You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This repository contains the implementation of the AIRTBench autonomous AI red teaming agent, complementing our research paper [AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models](https://arxiv.org/abs/2506.14682) and accompanying blog post, "[Do LLM Agents Have AI Red Team Capabilities? We Built a Benchmark to Find Out](https://dreadnode.io/blog/ai-red-team-benchmark)".
36
+
This repository contains the implementation of the AIRTBench autonomous AI red teaming agent, complementing our research paper [AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models](https://arxiv.org/abs/2506.14682) and accompanying blog post, "[Do LLM Agents Have AI Red Team Capabilities? We Built a Benchmark to Find Out](https://dreadnode.io/blog/ai-red-team-benchmark-)".
37
37
38
38
The AIRTBench agent is designed to evaluate the autonomous red teaming capabilities of large language models (LLMs) through AI/ML Capture The Flag (CTF) challenges. Our agent systematically exploits LLM-based targets by solving challenges on the Dreadnode Strikes platform, providing a standardized benchmark for measuring adversarial AI capabilities.
39
39
@@ -110,7 +110,7 @@ Check out [the challenge manifest](./airtbench/challenges/.challenges.yaml) to s
110
110
## Resources
111
111
112
112
-[📄 Paper on arXiv](https://arxiv.org/abs/2506.14682)
113
-
-[📝 Blog post](https://dreadnode.io/blog/ai-red-team-benchmark)
113
+
-[📝 Blog post](https://dreadnode.io/blog/ai-red-team-benchmark-)
114
114
115
115
## Dataset
116
116
@@ -135,4 +135,4 @@ If you find our work helpful, please use the following citations.
135
135
136
136
## Model requests
137
137
138
-
If you know of a model that may be interesting to analyze, but do not have the resources to run it yourself, feel free to open a feature request via a GitHub issue.
138
+
If you know of a model that may be interesting to analyze, but do not have the resources to run it yourself, feel free to open a feature request via a GitHub issue.
0 commit comments