Skip to content

Ackee-Blockchain/trident-arena-benchmarks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Trident Arena

Trident Arena Benchmark

This repository contains benchmark results for Trident Arena, compared against Anthropic and OpenAI flagship AI models.

The files in audit-reports/ are professional audit reports used as the reference for what critical/high-severity issues exist in each benchmark.

Benchmark results

We scanned each benchmark with Trident Arena, and evaluated the same benchmarks with GPT-5.2xhigh and Opus 4.6, then compared all outputs against the audit.

Each cell is shown as $x/y$, where:

  • $y$ is the number of critical/high-severity issues confirmed by a professional audit
  • $x$ is how many of those issues were identified by the corresponding system (Trident Arena / GPT-5.2xhigh / Opus 4.6)
Protocol Trident Arena GPT-5.2xhigh Opus 4.6
Axelar 5/7 0/7 0/7
Bert Staking 1/2 1/2 1/2
Dexalot 4/5 2/5 2/5
Metadao 3/3 1/3 1/3
Pump Science 1/2 0/2 1/2
Watt 7/11 6/11 6/11
Total: 21/30 10/30 11/30

Protocols

Axelar

Bert Staking

Dexalot

Metadao

Pump Science

Watt

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages