GitHub - Ackee-Blockchain/trident-arena-benchmarks

Trident Arena Benchmark

This repository contains benchmark results for Trident Arena, compared against Anthropic and OpenAI flagship AI models.

The files in audit-reports/ are professional audit reports used as the reference for what critical/high-severity issues exist in each benchmark.

Benchmark results

We scanned each benchmark with Trident Arena, and evaluated the same benchmarks with GPT-5.2xhigh and Opus 4.6, then compared all outputs against the audit.

Each cell is shown as $x/y$, where:

$y$ is the number of critical/high-severity issues confirmed by a professional audit
$x$ is how many of those issues were identified by the corresponding system (Trident Arena / GPT-5.2xhigh / Opus 4.6)

Protocol	Trident Arena	GPT-5.2xhigh	Opus 4.6
Axelar	5/7	0/7	0/7
Bert Staking	1/2	1/2	1/2
Dexalot	4/5	2/5	2/5
Metadao	3/3	1/3	1/3
Pump Science	1/2	0/2	1/2
Watt	7/11	6/11	6/11
Total:	21/30	10/30	11/30

Protocols

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
audit-reports		audit-reports
images		images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trident Arena Benchmark

Benchmark results

Protocols

Axelar

Bert Staking

Dexalot

Metadao

Pump Science

Watt

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Trident Arena Benchmark

Benchmark results

Protocols

Axelar

Bert Staking

Dexalot

Metadao

Pump Science

Watt

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages