Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new benchmark: Basque bench #2153

Merged
merged 9 commits into from
Oct 4, 2024

Conversation

zxcvuser
Copy link
Contributor

BasqueBench is a benchmark for tasks in Basque that cover several evaluation areas. The datasets consist of professional translations of relevant English datasets and newly created datasets in Basque. The README.md contains detailed information on all the tasks included in the benchmark.

@CLAassistant
Copy link

CLAassistant commented Jul 30, 2024

CLA assistant check
All committers have signed the CLA.

@baberabb
Copy link
Contributor

Thanks very much for this PR. Just some small issues I identified and if you could also run

pre-commit run --all-files

to fix the linting issues

lm_eval/tasks/basque_bench/flores_eu/_flores_common_yaml Outdated Show resolved Hide resolved
lm_eval/tasks/basque_bench/xcopa_eu.yaml Outdated Show resolved Hide resolved
lm_eval/tasks/basque_bench/mgsm_direct_eu.yaml Outdated Show resolved Hide resolved
lm_eval/tasks/basque_bench/mgsm_cot_native_eu.yaml Outdated Show resolved Hide resolved
@zxcvuser
Copy link
Contributor Author

These are the changes done:

  • Added the benchmark info in lm_eval/tasks/README.md
  • Replaced "-" by "_" in the create_files script in flores_eu and added weight_by_size: false
  • Run linters
  • Remove grouping in mgsm and copa tasks (they were pointing to pre-existing benchmarks)
    With these, it should all be fine now. Thank you!

Co-authored-by: Baber Abbasi <92168766+baberabb@users.noreply.github.com>
@haileyschoelkopf haileyschoelkopf merged commit c887796 into EleutherAI:main Oct 4, 2024
2 of 8 checks passed
@zxcvuser zxcvuser deleted the basque_bench branch October 9, 2024 10:05
mariagrandury pushed a commit to somosnlp/lm-evaluation-harness that referenced this pull request Oct 9, 2024
* Add basque_bench

* Add flores_eu group

* Update _flores_common_yaml

* Run linters, updated flores, mgsm, copa, and readme

* Apply suggestions from code review

Co-authored-by: Baber Abbasi <92168766+baberabb@users.noreply.github.com>

---------

Co-authored-by: Baber Abbasi <92168766+baberabb@users.noreply.github.com>
Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants