Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added TurkishMMLU to LM Evaluation Harness #2283

Merged
merged 9 commits into from
Sep 26, 2024

Conversation

ArdaYueksel
Copy link
Contributor

@ArdaYueksel ArdaYueksel commented Sep 6, 2024

In this pull request, I would like to add our work TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish to LM Evaluation Harness.
You can find the details of our work in our repository:
https://github.com/ArdaYueksel/TurkishMMLU
Also, our dataset is made available in HuggingFace: https://huggingface.co/datasets/AYueksel/TurkishMMLU

Key Features:

  • MMLU variant in Turkish Language
  • Has a separate Development Set.
  • Chain-of-Thought Configuration is available.

@CLAassistant
Copy link

CLAassistant commented Sep 6, 2024

CLA assistant check
All committers have signed the CLA.

Copy link
Contributor

@lintangsutawika lintangsutawika left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks real good. Thanks!

Copy link
Collaborator

@haileyschoelkopf haileyschoelkopf left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi there! One last thing: could you also add an entry to lm_eval/tasks/README.md describing the task in 1 sentence as is done for the other entries in that table (mentioning in this sentence that your dataset is not translated from MMLU and not machine-translated!), and note that the language is Turkish?

We want to make sure others can discover this task and your work more easily!

lm_eval/tasks/turkishmmlu/README.md Outdated Show resolved Hide resolved
lm_eval/tasks/turkishmmlu/README.md Outdated Show resolved Hide resolved
lm_eval/tasks/turkishmmlu/README.md Show resolved Hide resolved
Copy link
Contributor Author

@ArdaYueksel ArdaYueksel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated Readme addresses the raised issues

Copy link
Collaborator

@haileyschoelkopf haileyschoelkopf left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for the delay on this, fixed up some casing issues and added the documentation. Note also that template YAML stubs should not end in .yaml in order for them to not be registered as tasks erroneously.

@haileyschoelkopf
Copy link
Collaborator

Test failures unrelated.

@haileyschoelkopf haileyschoelkopf merged commit deb4328 into EleutherAI:main Sep 26, 2024
4 of 8 checks passed
giuliolovisotto pushed a commit to giuliolovisotto/lm-evaluation-harness that referenced this pull request Sep 27, 2024
* Added TurkishMMLU to LM Evaluation Harness

* Fixed COT name

* Fixed COT name

* Updated Readme

* Fixed Test issues

* Completed  Scan for changed tasks

* Updated Readme

* Update README.md

* fixup task naming casing + ensure yaml template stubs aren't registered

---------

Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
Co-authored-by: haileyschoelkopf <hailey@eleuther.ai>
shachardon pushed a commit to shachardon/lm-evaluation-harness that referenced this pull request Oct 1, 2024
* Added TurkishMMLU to LM Evaluation Harness

* Fixed COT name

* Fixed COT name

* Updated Readme

* Fixed Test issues

* Completed  Scan for changed tasks

* Updated Readme

* Update README.md

* fixup task naming casing + ensure yaml template stubs aren't registered

---------

Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
Co-authored-by: haileyschoelkopf <hailey@eleuther.ai>
mariagrandury pushed a commit to somosnlp/lm-evaluation-harness that referenced this pull request Oct 9, 2024
* Added TurkishMMLU to LM Evaluation Harness

* Fixed COT name

* Fixed COT name

* Updated Readme

* Fixed Test issues

* Completed  Scan for changed tasks

* Updated Readme

* Update README.md

* fixup task naming casing + ensure yaml template stubs aren't registered

---------

Co-authored-by: Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
Co-authored-by: haileyschoelkopf <hailey@eleuther.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants