-
Notifications
You must be signed in to change notification settings - Fork 41
Trying luminar_classifier_RAID_none_PrismAI #52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
Eval run succeeded! Link to run: link Here are the results of the submission(s): e5-small-loraRelease date: 2024-11-07 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 96.85 and a TPR of 85.69% at FPR=5% and 73.08% at FPR=1%. luminar_classifier_RAID_none_PrismAIRelease date: 2025-05-17 I've committed detailed results of this detector's performance on the test set to this PR. Warning Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved. LLMDetRelease date: 2023-05-24 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 62.90 and a TPR of 26.70% at FPR=5% and 14.91% at FPR=1%. LuminarRelease date: 2025-05-17 I've committed detailed results of this detector's performance on the test set to this PR. Warning Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved. SpeedAIRelease date: 2025-05-08 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 99.85 and a TPR of 99.62% at FPR=5% and 98.55% at FPR=1%. It's AIRelease date: 2025-04-01 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.27 and a TPR of 94.15% at FPR=5% and 89.36% at FPR=1%. RoBERTa-base (GPT2)Release date: 2019-08-24 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 72.29 and a TPR of 51.77% at FPR=5% and 34.57% at FPR=1%. RoBERTa (ChatGPT)Release date: 2023-01-18 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 60.26 and a TPR of 26.64% at FPR=5% and 19.63% at FPR=1%. DesklibRelease date: 2024-10-03 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.91 and a TPR of 83.76% at FPR=5% and 68.22% at FPR=1%. RoBERTa-large (GPT2)Release date: 2019-08-24 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 69.28 and a TPR of 50.70% at FPR=5% and 34.67% at FPR=1%. SuperAnnotate AI DetectorRelease date: 2024-10-27 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 88.87 and a TPR of 64.87% at FPR=5% and 38.87% at FPR=1%. GLTRRelease date: 2019-06-10 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 70.90 and a TPR of 51.48% at FPR=5% and 36.48% at FPR=1%. Desklib AI Text Detector v1.01Release date: 2025-02-16 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.83 and a TPR of 91.17% at FPR=5% and 76.47% at FPR=1%. BinocularsRelease date: 2024-01-22 I've committed detailed results of this detector's performance on the test set to this PR. Warning No aggregate score across all settings is reported here as some domains/generator models/decoding strategies/repetition penalties/adversarial attacks were not included in the submission. This submission will not appear in the main leaderboard; it will only be visible within the splits in which all samples were evaluated. RADARRelease date: 2023-07-07 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 81.92 and a TPR of 63.91% at FPR=5% and 43.12% at FPR=1%. Gaussian ExtremeRelease date: 2025-05-17 I've committed detailed results of this detector's performance on the test set to this PR. Warning Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved. FastDetectGPTRelease date: 2023-10-08 I've committed detailed results of this detector's performance on the test set to this PR. Warning No aggregate score across all settings is reported here as some domains/generator models/decoding strategies/repetition penalties/adversarial attacks were not included in the submission. This submission will not appear in the main leaderboard; it will only be visible within the splits in which all samples were evaluated. Warning No aggregate score across all non-adversarial settings is reported here as some domains/generator models/decoding strategies/repetition penalties were not included in the submission. If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID! |
After the evaluation changes, testing new luminar model instances. I'll probably add more.