Skip to content

Pull requests: allenai/open-instruct

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

first iteration
#812 opened Jul 22, 2025 by jacob-morrison Draft
perf penalty
#805 opened Jul 21, 2025 by saurabh111233212 Loading…
[WIP] replay buffer
#776 opened Jul 11, 2025 by mnoukhov Draft
Saurbahs/diff filtering
#774 opened Jul 11, 2025 by saurabh111233212 Draft
next olmo and rl from base
#767 opened Jul 9, 2025 by mnoukhov Loading…
4 tasks done
[WIP] add long-form rl-rag reward
#729 opened Jun 19, 2025 by RulinShao Loading…
[WIP] better single gpu performance
#725 opened Jun 16, 2025 by mnoukhov Draft
Add adaptive majority voting for GRPO training
#684 opened May 22, 2025 by AfraAmini Loading…
ProTip! Follow long discussions with comments:>50.