GLM-4.1V Model support #38431

zRzRzRzRzRzRzR · 2025-05-28T09:47:24Z

This PR aims to support the use of the GLM-4-0414 model for training video understanding and image understanding models GLM-4.1V
This PR has completed the refactoring of the related modules. Due to the overlap of F definitions (torch and torchvision), image_processors and videos_processors have not been placed under modular management @zucchini-nlp review sugguest.
This PR is for code review. @ArthurZucker

…into glm-4v-0414

Cyrilvallez

Alright, I fixed the remaining parts, and confirmed that the model is still working as expected on real checkpoints - merging now! Thanks for the work!! 🤗🚀

ydshieh · 2025-06-26T16:53:27Z

@zRzRzRzRzRzRzR Thank you for adding this model. This model's tests is quite slow as you can see in the following list, and causes the job that has this model tests running in 12 minutes instead of other jobs (~4 minutes)

https://app.circleci.com/pipelines/github/huggingface/transformers/135728/workflows/913523da-80bc-4021-9e85-d5ab8653d204/jobs/1800021/timing

Would you be up to make is faster? Usually it means to tweak Glm4vVisionText2TextModelTester. It's good to check what the config created there and see if they are some values would cause the created model being big.

58.03s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_initialization
23.31s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_model_outputs_equivalence
20.84s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_attention_outputs
18.42s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_resize_tokens_embeddings
17.87s call     tests/models/bark/test_modeling_bark.py::BarkModelIntegrationTests::test_model_can_generate
16.71s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_can_use_safetensors
16.34s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_load_save_without_tied_weights
15.13s call     tests/models/clvp/test_modeling_clvp.py::ClvpModelForConditionalGenerationTest::test_batching_equivalence
14.97s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_attn_implementation_composite_models
12.92s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_save_load
12.83s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_hidden_states_output
12.66s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_feed_forward_chunking
12.60s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_can_init_all_missing_weights
10.00s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_eager_matches_sdpa_inference_03_fp16_pad_left_no_attn_mask
9.79s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_model_weights_reload_no_missing_tied_weights
9.54s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_eager_matches_sdpa_inference_18_bf16_pad_left_no_attn_mask_sdpa_kernels
9.12s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_eager_matches_sdpa_inference_04_fp16_pad_right_sdpa_kernels
9.03s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_eager_matches_sdpa_inference_00_fp16_pad_left_sdpa_kernels
9.01s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_batching_equivalence
8.94s call     tests/models/glm4v/test_modeling_glm4v.py::Glm4vModelTest::test_eager_matches_sdpa_inference_01_fp16_pad_left

zRzRzRzRzRzRzR added 24 commits May 30, 2025 23:29

20250508 Model Architecture

62b5927

Update modeling_glm4v.py

474f210

Update modeling_glm4v.py

bb38cee

Update modeling_glm4v.py

9b2c1b7

update 1447

7d46066

0526

e4b59b6

update

ca00ad0

format

420ee1a

problem

5c74b30

update

567fdc6

update with only image embed diff

61cc381

Final

b91ef52

upload

1596b55

update

085cd04

1

071b4f1

upload with ruff

434bb53

update

5e43cd6

update

7a52852

work

d0f2ba2

1

2037a44

1

351bd80

update with new note

cd41b4d

2

8006d59

Update convert_glm4v_mgt_weights_to_hf.py

7e670eb

zRzRzRzRzRzRzR force-pushed the glm-4v-0414 branch from 5f515ac to 7e670eb Compare May 30, 2025 15:30

zRzRzRzRzRzRzR added 5 commits May 30, 2025 23:30

Update tokenization_auto.py

42b0d92

Merge branch 'huggingface:main' into glm-4v-0414

23fd4cf

update with new format

b2ecd4e

Merge branch 'glm-4v-0414' of github.com:zRzRzRzRzRzRzR/transformers …

17288d6

…into glm-4v-0414

remove rmsnrom

206d5bd

zRzRzRzRzRzRzR and others added 27 commits June 20, 2025 10:57

revert processing

cdf5043

update preprocesor

be87241

changed

e339725

1

a05466c

update

6d118fb

update

f229bc2

6

a3102d6

update

6567882

update

b8f5af5

Merge branch 'main' into glm-4v-0414

a845d4f

update

ce2f117

Delete tmp.txt

efce1a5

Merge branch 'huggingface:main' into glm-4v-0414

ac0d1f2

config

539f2f7

Update video_processing_glm4v.py

1244eb0

Merge branch 'main' into glm-4v-0414

2a956eb

apply modular correctly

148bc1f

move functions

71b9ae8

fix order

cbf9cc7

update the longest_edge

8e7d244

style

c3ea3f2

Merge branch 'main' into glm-4v-0414

c212f1f

simplify a lot

f63a6a6

fix random order of classes

4ab7874

skip integration tests

39f6375

correctly fix the tests

9873cf2

fix TP plan

03f8a97

Cyrilvallez approved these changes Jun 25, 2025

View reviewed changes

Cyrilvallez merged commit af98702 into huggingface:main Jun 25, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GLM-4.1V Model support #38431

GLM-4.1V Model support #38431

Uh oh!

zRzRzRzRzRzRzR commented May 28, 2025 •

edited

Loading

Uh oh!

Cyrilvallez left a comment

Uh oh!

Uh oh!

ydshieh commented Jun 26, 2025

Uh oh!

Uh oh!

GLM-4.1V Model support #38431

GLM-4.1V Model support #38431

Uh oh!

Conversation

zRzRzRzRzRzRzR commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ydshieh commented Jun 26, 2025

Uh oh!

Uh oh!

zRzRzRzRzRzRzR commented May 28, 2025 •

edited

Loading