[worker] fix: Fix missing rollout_log_probs argument in policy loss functions
#1347
e2e_one_step_off_policy.yml
on: pull_request
e2e_one_step_off_policy_fsdp2
2m 55s
e2e_one_step_off_policy_megatron
3m 11s