get chat_end_index after env init by etnbrd · Pull Request #184 · NovaSky-AI/SkyRL

etnbrd · 2025-08-22T09:09:03Z

This PR moves the chat_end_index to after the chat_history has been modified by the env.init function, to get the right number.

In our use case, we are injecting the system prompt in addition to the user message in the env.init, so the length changes before and after.
In this case the chat_end_index is 1 instead of 2. We think it might be causing some bugs down the line, leading to this assertion error:

AssertionError: Response ids and loss masks must have the same length, for sample 0 got 27 and 50

Resolves #183

gemini-code-assist

Code Review

This pull request correctly moves the calculation of chat_end_index to be after the call to env.init. As the description points out, env.init can modify the chat_history by adding messages like a system prompt. By calculating chat_end_index after this modification, you ensure it accurately reflects the length of the initial chat history, which should resolve the assertion error mentioned. The change is logical and well-contained.

tyler-griggs · 2025-08-22T16:44:23Z

Great catch, thanks

This PR moves the `chat_end_index` to after the `chat_history` has been modified by the `env.init` function, to get the right number. In our use case, we are injecting the system prompt in addition to the user message in the `env.init`, so the length changes before and after. In this case the `chat_end_index` is 1 instead of 2. We think it might be causing some bugs down the line, leading to this assertion error: ``` AssertionError: Response ids and loss masks must have the same length, for sample 0 got 27 and 50 ``` Resolves #183

get chat_end_index after env init

2bdf09f

gemini-code-assist bot reviewed Aug 22, 2025

View reviewed changes

tyler-griggs approved these changes Aug 22, 2025

View reviewed changes

tyler-griggs merged commit 9e5db05 into NovaSky-AI:main Aug 22, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

get chat_end_index after env init#184

get chat_end_index after env init#184
tyler-griggs merged 1 commit intoNovaSky-AI:mainfrom
DataDog:etnbrd/fix-chat_end_index

etnbrd commented Aug 22, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

tyler-griggs commented Aug 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

etnbrd commented Aug 22, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

tyler-griggs commented Aug 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants