Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
6,292 workflow runs
6,292 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add new benchmark: Basque bench
Tasks Modified #3487: Pull request #2153 synchronize by baberabb
October 3, 2024 12:29 2m 42s zxcvuser:basque_bench
October 3, 2024 12:29 2m 42s
Add new benchmark: Galician bench (#2155)
Tasks Modified #3486: Commit 0e76386 pushed by baberabb
October 3, 2024 12:27 4m 14s main
October 3, 2024 12:27 4m 14s
Add new benchmark: Galician bench (#2155)
Unit Tests #3458: Commit 0e76386 pushed by baberabb
October 3, 2024 12:27 5m 35s main
October 3, 2024 12:27 5m 35s
Add new benchmark: Galician bench
Unit Tests #3457: Pull request #2155 synchronize by baberabb
October 3, 2024 12:20 5m 53s zxcvuser:galician_bench
October 3, 2024 12:20 5m 53s
Add new benchmark: Galician bench
Tasks Modified #3485: Pull request #2155 synchronize by baberabb
October 3, 2024 12:20 4m 33s zxcvuser:galician_bench
October 3, 2024 12:20 4m 33s
Add new benchmark: Spanish bench (#2157)
Tasks Modified #3484: Commit ea17b98 pushed by baberabb
October 3, 2024 12:18 1m 30s main
October 3, 2024 12:18 1m 30s
Add new benchmark: Spanish bench (#2157)
Unit Tests #3456: Commit ea17b98 pushed by baberabb
October 3, 2024 12:18 5m 12s main
October 3, 2024 12:18 5m 12s
Add new benchmark: Spanish bench
Unit Tests #3455: Pull request #2157 synchronize by baberabb
October 3, 2024 12:10 5m 15s zxcvuser:spanish_bench
October 3, 2024 12:10 5m 15s
Add new benchmark: Spanish bench
Tasks Modified #3483: Pull request #2157 synchronize by baberabb
October 3, 2024 12:10 1m 29s zxcvuser:spanish_bench
October 3, 2024 12:10 1m 29s
add Russian mmlu
Tasks Modified #3482: Pull request #2378 opened by tatiana-iazykova
October 3, 2024 07:34 2m 36s tatiana-iazykova:main
October 3, 2024 07:34 2m 36s
add Russian mmlu
Unit Tests #3454: Pull request #2378 opened by tatiana-iazykova
October 3, 2024 07:34 5m 56s tatiana-iazykova:main
October 3, 2024 07:34 5m 56s
LingOly - Fixing scoring bugs for smaller models
Tasks Modified #3481: Pull request #2376 opened by am-bean
October 2, 2024 16:11 1m 31s am-bean:main
October 2, 2024 16:11 1m 31s
LingOly - Fixing scoring bugs for smaller models
Unit Tests #3453: Pull request #2376 opened by am-bean
October 2, 2024 16:11 5m 4s am-bean:main
October 2, 2024 16:11 5m 4s
Add the BlueBench benchmark
Tasks Modified #3479: Pull request #2369 synchronize by shachardon
October 2, 2024 08:11 Action required shachardon:bluebench_pr
October 2, 2024 08:11 Action required
Add the BlueBench benchmark
Unit Tests #3451: Pull request #2369 synchronize by shachardon
October 2, 2024 08:11 Action required shachardon:bluebench_pr
October 2, 2024 08:11 Action required
[API] tokenizer: add trust-remote-code
Unit Tests #3450: Pull request #2372 opened by baberabb
October 1, 2024 21:00 10m 9s api_trust
October 1, 2024 21:00 10m 9s
[API] tokenizer: add trust-remote-code
Tasks Modified #3478: Pull request #2372 opened by baberabb
October 1, 2024 21:00 12s api_trust
October 1, 2024 21:00 12s
HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS
Tasks Modified #3477: Pull request #2353 synchronize by baberabb
October 1, 2024 19:43 15s automodel
October 1, 2024 19:43 15s
HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS
Unit Tests #3449: Pull request #2353 synchronize by baberabb
October 1, 2024 19:43 6m 34s automodel
October 1, 2024 19:43 6m 34s
HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS
Unit Tests #3448: Pull request #2353 synchronize by baberabb
October 1, 2024 19:37 4m 58s automodel
October 1, 2024 19:37 4m 58s
HF: switch conditional checks to self.backend from AUTO_MODEL_CLASS
Tasks Modified #3476: Pull request #2353 synchronize by baberabb
October 1, 2024 19:37 13s automodel
October 1, 2024 19:37 13s
Add the BlueBench benchmark
Unit Tests #3447: Pull request #2369 synchronize by shachardon
October 1, 2024 13:49 Action required shachardon:bluebench_pr
October 1, 2024 13:49 Action required
Add the BlueBench benchmark
Tasks Modified #3475: Pull request #2369 synchronize by shachardon
October 1, 2024 13:49 Action required shachardon:bluebench_pr
October 1, 2024 13:49 Action required
Add the BlueBench benchmark
Tasks Modified #3474: Pull request #2369 opened by shachardon
October 1, 2024 13:18 Action required shachardon:bluebench_pr
October 1, 2024 13:18 Action required
Add the BlueBench benchmark
Unit Tests #3446: Pull request #2369 opened by shachardon
October 1, 2024 13:18 -1s shachardon:bluebench_pr
October 1, 2024 13:18 -1s