-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Pull requests: verl-project/verl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[doc] chore: Bug fixes for the qwen3-235b model in 256k scenarios
#5908
opened Apr 8, 2026 by
autbuster
Loading…
8 tasks
[ckpt] feat: add WPI (Weight Propagation Interface) checkpoint engine for zero-copy cross-node weight transfer
#5907
opened Apr 7, 2026 by
yangspirit
Loading…
8 tasks done
[chat_template]: handle system message in apply_chat_template fall…
#5903
opened Apr 7, 2026 by
khazic
Loading…
6 of 8 tasks
[bugfix] Fix modeling_qwen2_5_vl missing attribute 'Qwen2RMSNorm'
#5901
opened Apr 7, 2026 by
ZhuYajun-AI
Loading…
8 tasks
[veomni] fix: improve VeOmniEngine and add flash attention kwargs support
#5900
opened Apr 7, 2026 by
deerlu
Loading…
8 tasks
[trainer] fix: return NaN for empty tensors in compute_data_metrics
#5899
opened Apr 7, 2026 by
Jackie2049
•
Draft
5 tasks done
feat(opd): support SGLang as teacher server for online policy distillation
#5897
opened Apr 7, 2026 by
nathon-lee
Loading…
[tool, perf] feat: add reward timing metrics in agent loop
#5896
opened Apr 7, 2026 by
guillemgt
Loading…
6 of 7 tasks
[megatron] fix: MTP loss deadlock when using context parallelism
#5895
opened Apr 7, 2026 by
xhx1022
Loading…
fix: flatten multi-component position_ids to 1D for nested tensor compatibility
#5886
opened Apr 6, 2026 by
yifannnwu
Loading…
1 of 2 tasks
[megatron] fix: dynamic context parallel batch splitting and loss normalization
#5869
opened Apr 2, 2026 by
Kite0011
Loading…
8 tasks
[sglang] Adapting the use of _launch_subprocesses to the latest SGLang branch
#5868
opened Apr 2, 2026 by
xiazhahe
Loading…
8 tasks
feature: Enhance Ray subprocess error handling system
#5855
opened Apr 2, 2026 by
abeiabeiqq
Loading…
[megatron] feat: enable Megatron FSDP for SFT training
#5854
opened Apr 1, 2026 by
yxs
Loading…
4 tasks done
[megatron] fix: always patch actor postprocess on unfused path for MTP models
#5845
opened Apr 1, 2026 by
AkiRusProd
Loading…
[doc] feat: add Claude Code skills for add-dataset, add-reward, add-trainer
#5844
opened Apr 1, 2026 by
khazic
Loading…
3 tasks done
[doc, misc] chore: add Claude Code skills and CLAUDE.md for AI-assisted development
#5843
opened Apr 1, 2026 by
khazic
Loading…
4 tasks done
[trainer] feat: add group reward std and gradient SNR metrics to compute_data_metrics
#5842
opened Apr 1, 2026 by
KLGR123
Loading…
3 tasks done
[rollout] chore: bump up trtllm image version to 1.3.0rc10
#5841
opened Apr 1, 2026 by
Superjomn
Loading…
6 of 8 tasks
[perf, fsdp, trainer] feat: Skip training for zero-advantage responses to speed up RL.
#5838
opened Apr 1, 2026 by
sheilaliuxl
Loading…
7 tasks done
Previous Next
ProTip!
Adding no:label will show everything without a label.