Fix errors when use verl to train GLM4.1v model #39199

kaln27 · 2025-07-03T13:54:24Z

Support glm4v load from AutoModelForVision2Seq
Set glm4v model's _checkpoint_conversion_mapping attribute from None to empty dict {}

What does this PR do?

When use verl to train GLM4.1v model with GRPO, there are several small errors.
Here is how to fix them:

support glm4v load using AutoModelForVision2Seq
verl treat _checkpoint_conversion_mapping as a dict. But right now is None, which will abort the program. I also found that almost every model which don't need checkpoint convert have a empty dict.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

kaln27 · 2025-07-03T14:17:12Z

@ArthurZucker
Hi Arthur, would you mind to review this PR?
Thank your !

src/transformers/models/auto/modeling_auto.py

src/transformers/models/glm4v/modeling_glm4v.py

zucchini-nlp · 2025-07-08T04:52:19Z

Let's address the mapping comment and rebase main, before we can merge

* Support glm4v load from AutoModelForVision2Seq * Set glm4v model _checkpoint_conversion_mapping attr from None to {}

github-actions · 2025-07-08T09:22:58Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: glm4v

zucchini-nlp

❤️

HuggingFaceDocBuilderDev · 2025-07-08T09:39:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* Fix errors when use verl to train GLM4.1v model * Support glm4v load from AutoModelForVision2Seq * Set glm4v model _checkpoint_conversion_mapping attr from None to {} * Update modeling_auto.py

zucchini-nlp reviewed Jul 7, 2025

View reviewed changes

src/transformers/models/auto/modeling_auto.py Outdated Show resolved Hide resolved

src/transformers/models/glm4v/modeling_glm4v.py Show resolved Hide resolved

kaln27 force-pushed the main branch from 34b30b3 to 1de713c Compare July 8, 2025 05:36

kaln27 and others added 2 commits July 8, 2025 17:21

Fix errors when use verl to train GLM4.1v model

2a3f55b

* Support glm4v load from AutoModelForVision2Seq * Set glm4v model _checkpoint_conversion_mapping attr from None to {}

Update modeling_auto.py

f06fa5c

kaln27 force-pushed the main branch from bd2f2b6 to f06fa5c Compare July 8, 2025 09:21

zucchini-nlp approved these changes Jul 8, 2025

View reviewed changes

zucchini-nlp enabled auto-merge (squash) July 8, 2025 09:27

zucchini-nlp merged commit d370bc6 into huggingface:main Jul 8, 2025
19 checks passed

zucchini-nlp added the for patch Tag issues / labels that should be included in the next patch label Jul 9, 2025

kaln27 mentioned this pull request Jul 11, 2025

[misc] fix: Use AutoModelForImageTextToText instead of AutoModelForVision2Seq volcengine/verl#2475

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix errors when use verl to train GLM4.1v model #39199

Fix errors when use verl to train GLM4.1v model #39199

Uh oh!

kaln27 commented Jul 3, 2025 •

edited

Loading

Uh oh!

kaln27 commented Jul 3, 2025

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp commented Jul 8, 2025

Uh oh!

github-actions bot commented Jul 8, 2025

Uh oh!

zucchini-nlp left a comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jul 8, 2025

Uh oh!

Uh oh!

Fix errors when use verl to train GLM4.1v model #39199

Fix errors when use verl to train GLM4.1v model #39199

Uh oh!

Conversation

kaln27 commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

kaln27 commented Jul 3, 2025

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp commented Jul 8, 2025

Uh oh!

github-actions bot commented Jul 8, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jul 8, 2025

Uh oh!

Uh oh!

kaln27 commented Jul 3, 2025 •

edited

Loading