Update GLM-4.1V MMRope implementation #41182

zRzRzRzRzRzRzR · 2025-09-26T14:36:55Z

What does this PR do?

This feature aims to enable the GLM-4.1V model to support 4D input mrope processing. It supports the same input format as Qwen3 in verl.

verl pr: volcengine/verl#3291

This reverts commit d13a763.

Rocketknight1 · 2025-09-29T13:09:35Z

cc @ArthurZucker for rope

zucchini-nlp · 2025-09-30T07:39:06Z

src/transformers/models/glm4v/modular_glm4v.py

        cache_position: Optional[torch.LongTensor] = None,
        **kwargs,
    ) -> tuple[torch.FloatTensor, Optional[tuple[torch.FloatTensor, torch.FloatTensor]]]:
+        """


lets use @auto_docstring instead

zucchini-nlp · 2025-09-30T07:39:31Z

src/transformers/models/glm4v/modular_glm4v.py

+        outputs = (hidden_states,)
+
+        if output_attentions:
+            outputs += (self_attn_weights,)
+
+        return outputs


and keep @check_model_inputs below so we don't have to return these explicitly

But this code inherits from Glm4MoeDecoderLayer, and this class doesn't seem to have @check_model_inputs. Where should it be added below?

zucchini-nlp · 2025-09-30T07:42:14Z

src/transformers/models/glm4v_moe/modular_glm4v_moe.py

+    def forward(
+        self,
+        hidden_states: torch.Tensor,
+        position_embeddings: tuple[torch.Tensor, torch.Tensor],
+        attention_mask: Optional[torch.Tensor] = None,


Ig the overwriting is because of the @check_model_inputs, lets' bring the decorator back and remove unnecessary changes

src/transformers/models/glm4v_moe/modeling_glm4v_moe.py

HuggingFaceDocBuilderDev · 2025-09-30T07:47:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

github-actions · 2025-10-09T09:32:22Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: glm4v, glm4v_moe

zucchini-nlp

Some tests are failing, can you take a look?

src/transformers/models/glm4v/modular_glm4v.py

src/transformers/models/glm4v_moe/convert_glm4v_moe_mgt_weights_to_hf.py

zRzRzRzRzRzRzR · 2025-10-09T10:03:34Z

now ci pass

zucchini-nlp

Thanks for iterating, merging

* update for 4D mask * update * Update modular_glm4v.py * 1 * Revert "1" This reverts commit d13a763. * update as glm4v logtic * update * 1 * update * Create convert_glm4v_moe_mgt_weights_to_hf.py * update * update

zRzRzRzRzRzRzR added 2 commits September 26, 2025 22:19

update for 4D mask

25afe1c

update

efec01a

zRzRzRzRzRzRzR marked this pull request as draft September 26, 2025 14:39

zRzRzRzRzRzRzR added 5 commits September 26, 2025 22:40

Update modular_glm4v.py

2aa2d2d

Merge branch 'huggingface:main' into glm-45-vl

c3bb809

1

d13a763

Revert "1"

3b5b509

This reverts commit d13a763.

update as glm4v logtic

3c578b0

zRzRzRzRzRzRzR changed the title ~~update for 4D mask~~ Update GLM-4.1V MMRope implementation Sep 28, 2025

zRzRzRzRzRzRzR marked this pull request as ready for review September 28, 2025 08:46

github-actions bot requested review from ArthurZucker and Rocketknight1 September 28, 2025 08:46

zucchini-nlp reviewed Sep 30, 2025

View reviewed changes

zRzRzRzRzRzRzR added 5 commits October 9, 2025 16:16

update

49460cc

Merge remote-tracking branch 'upstream/main' into glm-45-vl

3688107

Merge branch 'huggingface:main' into glm-45-vl

e0d48d3

1

39669c6

update

5b298af

zRzRzRzRzRzRzR added 3 commits October 9, 2025 17:38

Create convert_glm4v_moe_mgt_weights_to_hf.py

030e565

Merge branch 'huggingface:main' into glm-45-vl

81e1c4f

update

21e4717

zucchini-nlp reviewed Oct 9, 2025

View reviewed changes

src/transformers/models/glm4v/modular_glm4v.py Show resolved Hide resolved

src/transformers/models/glm4v_moe/convert_glm4v_moe_mgt_weights_to_hf.py Outdated Show resolved Hide resolved

update

e744f08

zucchini-nlp approved these changes Oct 9, 2025

View reviewed changes

zucchini-nlp merged commit 1951f3b into huggingface:main Oct 9, 2025
19 checks passed

Update GLM-4.1V MMRope implementation #41182

Update GLM-4.1V MMRope implementation #41182

Uh oh!

Conversation

zRzRzRzRzRzRzR commented Sep 26, 2025

What does this PR do?

Uh oh!

Rocketknight1 commented Sep 29, 2025

Uh oh!

zucchini-nlp Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

zRzRzRzRzRzRzR Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Sep 30, 2025

Uh oh!

github-actions bot commented Oct 9, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zRzRzRzRzRzRzR commented Oct 9, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants