Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

It's AI detector leaderboard submission #8

Merged
merged 3 commits into from
Sep 4, 2024
Merged

Conversation

sergak0
Copy link
Contributor

@sergak0 sergak0 commented Sep 4, 2024

No description provided.

Copy link

github-actions bot commented Sep 4, 2024

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

It's AI

Release date: 2024-09-04

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an accuracy of 87.25%.
Without adversarial attacks, it achieved an accuracy of 91.93%.
If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID!

@liamdugan
Copy link
Owner

Congrats on the new SOTA! Merged.

@liamdugan liamdugan merged commit c2990d2 into liamdugan:main Sep 4, 2024
@sergak0
Copy link
Contributor Author

sergak0 commented Sep 4, 2024

Thanks! By the way, I've seen that in the paper you've also scored commercial ai-detectors, but I haven't found them in the leaderboard - is there a way to compare our solution with theirs?

@liamdugan
Copy link
Owner

Yes. The reason why they aren't included on the leaderboard is because we were only able to test a small portion of our test set on the commercial detectors due to budget constraints. We're currently working with these companies to get more credits so that we can add them to the leaderboard as well. We hope to add them soon.

@sergak0
Copy link
Contributor Author

sergak0 commented Sep 4, 2024

Glad to hear that you are going to add them too. Anyway great work with RAID dataset - really impressed with the amount and diversity of data that you've collected and first ai-detection leaderboard that you've made.

@liamdugan
Copy link
Owner

Thank you! And thanks for submitting :)

@AIApprentice101
Copy link

AIApprentice101 commented Sep 4, 2024

Thank you @liamdugan for the prompt update of the leaderboard. It is great to have a new SOTA method. Congrats @sergak0

I notice on your github page (https://github.com/It-s-AI/llm-detection) that there're two baseline models, the perplexity-logistic regression model, and the finetuned Deberta model. Is the Deberta model the SOTA submission here? Or is it the result from the subnet 32?

Btw, I love your idea of using Bittensor subset for the AI detection task.

@sergak0
Copy link
Contributor Author

sergak0 commented Sep 5, 2024

@AIApprentice101 yeah, it's the result from subnet 32.

Btw, I love your idea of using Bittensor subnet for the AI detection task.

Thanks, hope to make our detector even better in the future.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants