Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
6,294 workflow runs
6,294 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add Unitxt Multimodality Support
Unit Tests #3434: Pull request #2364 opened by elronbandel
September 29, 2024 13:36 5m 0s elronbandel:unitxt-multimodal
September 29, 2024 13:36 5m 0s
Unitxt Multi Modality Support
Tasks Modified #3461: Pull request #2363 synchronize by elronbandel
September 29, 2024 13:28 1m 40s elronbandel:main
September 29, 2024 13:28 1m 40s
Unitxt Multi Modality Support
Unit Tests #3433: Pull request #2363 synchronize by elronbandel
September 29, 2024 13:28 5m 3s elronbandel:main
September 29, 2024 13:28 5m 3s
Unitxt Multi Modality Support
Tasks Modified #3460: Pull request #2363 opened by elronbandel
September 29, 2024 13:23 1m 31s elronbandel:main
September 29, 2024 13:23 1m 31s
Unitxt Multi Modality Support
Unit Tests #3432: Pull request #2363 opened by elronbandel
September 29, 2024 13:23 4m 55s elronbandel:main
September 29, 2024 13:23 4m 55s
fix some bugs of mmlu (#2299)
Unit Tests #3431: Commit 5a48ca2 pushed by lintangsutawika
September 28, 2024 14:49 5m 1s main
September 28, 2024 14:49 5m 1s
fix some bugs of mmlu (#2299)
Tasks Modified #3459: Commit 5a48ca2 pushed by lintangsutawika
September 28, 2024 14:49 2m 36s main
September 28, 2024 14:49 2m 36s
fix some bugs of mmlu
Tasks Modified #3458: Pull request #2299 synchronize by eyuansu62
September 28, 2024 09:58 2m 22s baai-open-internal:mmlu_fix
September 28, 2024 09:58 2m 22s
fix some bugs of mmlu
Unit Tests #3430: Pull request #2299 synchronize by eyuansu62
September 28, 2024 09:58 5m 28s baai-open-internal:mmlu_fix
September 28, 2024 09:58 5m 28s
Add new benchmark: Portuguese bench
Unit Tests #3423: Pull request #2156 synchronize by zxcvuser
September 27, 2024 15:35 5m 52s zxcvuser:portuguese_bench
September 27, 2024 15:35 5m 52s
Add new benchmark: Portuguese bench
Tasks Modified #3451: Pull request #2156 synchronize by zxcvuser
September 27, 2024 15:35 4m 13s zxcvuser:portuguese_bench
September 27, 2024 15:35 4m 13s
Add metabench task to LM Evaluation Harness
Tasks Modified #3442: Pull request #2357 synchronize by kozzy97
September 27, 2024 07:12 2m 4s kozzy97:metabench
September 27, 2024 07:12 2m 4s
Add metabench task to LM Evaluation Harness
Unit Tests #3414: Pull request #2357 synchronize by kozzy97
September 27, 2024 07:12 5m 9s kozzy97:metabench
September 27, 2024 07:12 5m 9s
openai: better error messages; fix greedy matching (#2327)
Unit Tests #3413: Commit 1bc6c93 pushed by haileyschoelkopf
September 26, 2024 19:58 6m 9s main
September 26, 2024 19:58 6m 9s
openai: better error messages; fix greedy matching (#2327)
Tasks Modified #3441: Commit 1bc6c93 pushed by haileyschoelkopf
September 26, 2024 19:58 13s main
September 26, 2024 19:58 13s