Skip to content

Conversation

@oscarzhou511
Copy link

Hi @liamdugan,
I've fixed all of the permission issues on Github and have uploaded a full prediction.json file with no missing tests. It would be great if you could approve an evaluation. Thank you so much for your help with this!

Oscar :)

@github-actions
Copy link

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

Veredict Labs AI Detector

Release date: 2025-11-17

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 93.68 and a TPR of 89.84% at FPR=5% and 73.14% at FPR=1%.
Without adversarial attacks, it achieved AUROC of 96.28 and a TPR of 93.80% at FPR=5% and 84.70% at FPR=1%.

If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID!

@liamdugan
Copy link
Owner

@oscarzhou511 would you like me to merge this result?

@oscarzhou511
Copy link
Author

@liamdugan not just yet...I'm working on improving the method! Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants