Submission/conpara v25#150
Conversation
|
It looks like this eval run failed. Please check the workflow logs to see what went wrong, then push a new commit to your PR to rerun the eval. |
|
Eval run succeeded! Link to run: link Here are the results of the submission(s): DeBERTa-ConPara-v2.5Release date: 2026-05-21 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 95.45 and a TPR of 96.84% at FPR=5% and 87.75% at FPR=1%. DeBERTa-ConPara-A3-Preprocessing-NoFeaturesRelease date: 2026-05-16 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 96.79 and a TPR of 97.39% at FPR=5% and 92.05% at FPR=1%. DeBERTa-ConPara-v2.2-Seed42Release date: 2026-05-01 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 96.86 and a TPR of 97.29% at FPR=5% and 93.55% at FPR=1%. If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID! |
No description provided.