[fix] Fix the pretty trainer logging by tyler-griggs · Pull Request #270 · NovaSky-AI/SkyRL

tyler-griggs · 2025-09-09T04:43:59Z

Re-implement the logging fix of #250 that was reverted in #261

The issue was that using the worker_process_setup_hook to set logging behavior interfered with vLLM using Ray as it's tensor parallel backend and threw an error. vLLM apparently needs this to be unset.

Moved the logging configuration into RayPPOTrainer init.

gemini-code-assist

Code Review

This pull request aims to fix logging for Ray workers by moving the configuration logic out of the problematic worker_process_setup_hook. A new function, configure_ray_worker_logging, has been added to handle log formatting and routing.

However, there is a critical issue with the current implementation. The logging configuration function is called from RayPPOTrainer.__init__, which executes on the driver process. This means the logging for the actual Ray workers (the remote actors) will not be configured, and the fix will not have the intended effect. I've provided a critical review comment to address this by moving the function call to the worker initialization logic, which is essential for this fix to work as described.

skyrl-train/skyrl_train/trainer.py

SumanthRH

LGTM, let's just verify with the GSM8K example

tyler-griggs · 2025-09-11T17:45:25Z

Yes, done. And ran the test suite that failed previously

Re-implement the logging fix of #250 that was reverted in #261 The issue was that using the `worker_process_setup_hook` to set logging behavior interfered with vLLM using Ray as it's tensor parallel backend and threw an error. vLLM apparently needs this to be unset. Moved the logging configuration into RayPPOTrainer `init`.

now logging is fixed

6c0d7fe

tyler-griggs changed the title ~~[fix]~~ [fix] Fix the pretty trainer logging Sep 9, 2025

gemini-code-assist bot reviewed Sep 9, 2025

View reviewed changes

skyrl-train/skyrl_train/trainer.py Show resolved Hide resolved

tyler-griggs added 3 commits September 11, 2025 16:18

formatting

c7c4e93

Merge remote-tracking branch 'real/main' into tgriggs/log_fix_again

a7a23b7

adding to workers

f23a4a0

SumanthRH approved these changes Sep 11, 2025

View reviewed changes

tyler-griggs merged commit fd8275f into main Sep 11, 2025
3 checks passed

tyler-griggs deleted the tgriggs/log_fix_again branch September 12, 2025 20:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] Fix the pretty trainer logging#270

[fix] Fix the pretty trainer logging#270
tyler-griggs merged 4 commits intomainfrom
tgriggs/log_fix_again

tyler-griggs commented Sep 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

SumanthRH left a comment

Uh oh!

tyler-griggs commented Sep 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tyler-griggs commented Sep 9, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

SumanthRH left a comment

Choose a reason for hiding this comment

Uh oh!

tyler-griggs commented Sep 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants