Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

BEEP, BLM-IT, TraceIT, Multi-IT and VeryfIT
#2403 opened Oct 14, 2024 by Jj-source Loading…
Support for IBM watsonx_llm
#2397 opened Oct 11, 2024 by Medokins Loading…
Fix: Turkish MMLU Regex Pattern
#2393 opened Oct 10, 2024 by ArdaYueksel Loading…
Update citation links to Zenodo and DOI to 0.4.5
#2391 opened Oct 9, 2024 by LSinev Loading…
Add new tasks to spanish_bench and fix duplicates
#2390 opened Oct 9, 2024 by zxcvuser Loading…
add Russian mmlu
#2378 opened Oct 3, 2024 by tatiana-iazykova Loading…
Add the BlueBench benchmark
#2369 opened Oct 1, 2024 by shachardon Loading…
Remove unnecessary space prefix
#2368 opened Oct 1, 2024 by eldarkurtic Loading…
MMLU Pro Plus
#2366 opened Sep 30, 2024 by asgsaeid Loading…
Add Unitxt Multimodality Support
#2364 opened Sep 29, 2024 by elronbandel Loading…
fix cost_estimate script
#2359 opened Sep 26, 2024 by baberabb Draft
Add metabench task to LM Evaluation Harness
#2357 opened Sep 26, 2024 by kozzy97 Loading…
Support pipeline parallel with OpenVINO models
#2349 opened Sep 25, 2024 by sstrehlk Loading…
Mathvista
#2321 opened Sep 18, 2024 by baberabb Draft
mmlu translated professionally by OpenAI
#2312 opened Sep 17, 2024 by giuliolovisotto Loading…
Scrolls branch
#2309 opened Sep 16, 2024 by blitzionic Loading…
add new truncation strategy
#2300 opened Sep 15, 2024 by artemorloff Draft
Gen Prefix
#2274 opened Sep 2, 2024 by baberabb Loading…
Nvidia TensorRT-LLM
#2271 opened Sep 1, 2024 by abhishekvijeev Draft
Add Yue-Benchmark and update tasks description
#2270 opened Aug 31, 2024 by cpa2001 Loading…
Ifeval: Dowload punkt_tab on rank 0
#2267 opened Aug 30, 2024 by baberabb Loading…
[Draft] llm-as-judge
#2251 opened Aug 25, 2024 by baberabb Draft
Minor features
#2249 opened Aug 25, 2024 by artemorloff Loading…
ProTip! Exclude everything labeled bug with -label:bug.