Skip to content

Actions: EleutherAI/lm-evaluation-harness

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
3,595 workflow runs
3,595 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add humaneval_infilling task
Tasks Modified #5370: Pull request #3299 opened by its-alpesh
September 14, 2025 18:13 Action required its-alpesh:humaneval_infilling
September 14, 2025 18:13 Action required
Add humaneval_infilling task
Unit Tests #5342: Pull request #3299 opened by its-alpesh
September 14, 2025 18:13 Action required its-alpesh:humaneval_infilling
September 14, 2025 18:13 Action required
Add AIME to task description
Tasks Modified #5369: Pull request #3296 opened by jannalulu
September 12, 2025 16:57 1m 36s jannalulu:aime-register
September 12, 2025 16:57 1m 36s
Add AIME to task description
Unit Tests #5341: Pull request #3296 opened by jannalulu
September 12, 2025 16:57 5m 21s jannalulu:aime-register
September 12, 2025 16:57 5m 21s
Add BabiLong
Tasks Modified #5368: Pull request #3287 synchronize by jannalulu
September 12, 2025 16:45 2m 40s jannalulu:babilong
September 12, 2025 16:45 2m 40s
Add BabiLong
Unit Tests #5340: Pull request #3287 synchronize by jannalulu
September 12, 2025 16:45 5m 8s jannalulu:babilong
September 12, 2025 16:45 5m 8s
add quote to type hints (#3292)
Unit Tests #5339: Commit 0c134ee pushed by baberabb
September 12, 2025 09:16 5m 32s main
September 12, 2025 09:16 5m 32s
add quote to type hints (#3292)
Tasks Modified #5367: Commit 0c134ee pushed by baberabb
September 12, 2025 09:16 15s main
September 12, 2025 09:16 15s
Fix lambada_multilingual_stablelm
Tasks Modified #5366: Pull request #3294 synchronize by jmichaelov
September 11, 2025 16:37 1m 45s jmichaelov:patch-4
September 11, 2025 16:37 1m 45s
Fix lambada_multilingual_stablelm
Unit Tests #5338: Pull request #3294 synchronize by jmichaelov
September 11, 2025 16:37 3m 29s jmichaelov:patch-4
September 11, 2025 16:37 3m 29s
Fix lambada_multilingual_stablelm
Tasks Modified #5365: Pull request #3294 opened by jmichaelov
September 11, 2025 15:53 1m 41s jmichaelov:patch-4
September 11, 2025 15:53 1m 41s
Fix lambada_multilingual_stablelm
Unit Tests #5337: Pull request #3294 opened by jmichaelov
September 11, 2025 15:53 3m 15s jmichaelov:patch-4
September 11, 2025 15:53 3m 15s
Add long-context evaluation benchmarks (LongBench v2, Babilong, InfiniteBench, Phonebook)
Tasks Modified #5363: Pull request #3256 synchronize by Mariani-code
September 10, 2025 20:41 Action required Mariani-code:main
September 10, 2025 20:41 Action required
Add long-context evaluation benchmarks (LongBench v2, Babilong, InfiniteBench, Phonebook)
Unit Tests #5335: Pull request #3256 synchronize by Mariani-code
September 10, 2025 20:41 Action required Mariani-code:main
September 10, 2025 20:41 Action required
[feature] add support for Moore Threads GPU family
Tasks Modified #5362: Pull request #3290 synchronize by houchen-li
September 10, 2025 05:48 Action required houchen-li:main
September 10, 2025 05:48 Action required
[feature] add support for Moore Threads GPU family
Unit Tests #5334: Pull request #3290 synchronize by houchen-li
September 10, 2025 05:48 Action required houchen-li:main
September 10, 2025 05:48 Action required
[feature] add support for Moore Threads GPU family
Tasks Modified #5361: Pull request #3290 synchronize by houchen-li
September 10, 2025 05:46 Action required houchen-li:main
September 10, 2025 05:46 Action required
[feature] add support for Moore Threads GPU family
Unit Tests #5333: Pull request #3290 synchronize by houchen-li
September 10, 2025 05:46 Action required houchen-li:main
September 10, 2025 05:46 Action required
[feature] add support for Moore Threads GPU family
Tasks Modified #5360: Pull request #3290 synchronize by houchen-li
September 10, 2025 05:43 Action required houchen-li:main
September 10, 2025 05:43 Action required
[feature] add support for Moore Threads GPU family
Unit Tests #5332: Pull request #3290 synchronize by houchen-li
September 10, 2025 05:43 Action required houchen-li:main
September 10, 2025 05:43 Action required
[feature] add support for Moore Threads GPU family
Tasks Modified #5359: Pull request #3290 opened by houchen-li
September 10, 2025 05:41 Action required houchen-li:main
September 10, 2025 05:41 Action required
[feature] add support for Moore Threads GPU family
Unit Tests #5331: Pull request #3290 opened by houchen-li
September 10, 2025 05:41 Action required houchen-li:main
September 10, 2025 05:41 Action required