Transformers 4.53 support, SmolLM3 and Fix old Transformers support by IlyasMoutawwakil · Pull Request #2319 · huggingface/optimum

IlyasMoutawwakil · 2025-07-10T12:08:13Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

IlyasMoutawwakil · 2025-07-10T12:09:30Z

+        if is_transformers_version(">=", "4.53"):
+            from transformers.integrations.executorch import sdpa_mask_without_vmap
+            from transformers.masking_utils import AttentionMaskInterface
+
+            AttentionMaskInterface.register("sdpa", sdpa_mask_without_vmap)


using the patching spec here doesn't work because the original object is referenced in a dictionary in the AttentionMaskInterface

HuggingFaceDocBuilderDev · 2025-07-10T12:23:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

echarlaix

Great work @IlyasMoutawwakil thanks a lot !

echarlaix · 2025-07-15T08:40:06Z

+    def __init_subclass__(cls, **kwargs):
+        super().__init_subclass__(**kwargs)
+        logger.warning(
+            "The `ExportConfig` class is deprecated and will be removed in a future version. "


cc @JingyaHuang https://github.com/huggingface/optimum-neuron/blob/4fd7d27fce7f2421e51f9fcabf81a752d32c144f/optimum/exporters/neuron/base.py#L43

bil-ash · 2025-07-15T14:01:32Z

Would be nice if gemma3n(multimodal) support is also added along with this PR.

IlyasMoutawwakil · 2025-07-15T14:24:51Z

Hi @bil-ash ! unfortunately it won't be that simple because optimum's onnx exporter doesn't support multimodal decoders yet.
Would love to review a PR, ideally taking inspiration from optimum's openvino exporter.
I will be adding SmolLM3 however.

…timum into transformers-4.53

DWarez · 2025-07-18T08:41:30Z

hi, is there an eta for the merging of this PR? I'd like to use the sdpa_mask_without_vmap function (thanks a lot for that). That patching allows the export of gemma3-text, which has been required quite a bit in the issues 👀

attention vmap patch as in here huggingface#2319

…d broadcastable

…r normalized config in ort modeling

same as huggingface/optimum#2319

test

d28dee8

IlyasMoutawwakil commented Jul 10, 2025

View reviewed changes

IlyasMoutawwakil added 2 commits July 10, 2025 20:37

fix seq2seq patched sdpa

37dac0b

patch qwen3_moe, out_attentions, and eager_mask

19b3dc2

IlyasMoutawwakil marked this pull request as ready for review July 10, 2025 20:44

IlyasMoutawwakil added 2 commits July 10, 2025 23:08

fix

ce3809f

use optimum model

bd8dec6

IlyasMoutawwakil commented Jul 11, 2025

View reviewed changes

Comment thread tests/exporters/tflite/test_export_cli.py

IlyasMoutawwakil requested a review from echarlaix July 11, 2025 08:05

editable subpackages

5593686

echarlaix approved these changes Jul 15, 2025

View reviewed changes

IlyasMoutawwakil commented Jul 15, 2025

View reviewed changes

Comment thread optimum/exporters/onnx/model_patcher.py Outdated

Apply suggestions from code review

b10c340

IlyasMoutawwakil added 2 commits July 15, 2025 16:25

smollm3 support

d3bd103

Merge branch 'transformers-4.53' of https://github.com/huggingface/op…

f80c1c5

…timum into transformers-4.53

IlyasMoutawwakil changed the title ~~Transformers 4.53 support~~ Transformers 4.53 support and SmolLM3 Jul 15, 2025

IlyasMoutawwakil mentioned this pull request Jul 17, 2025

Support transformers 4.53 and SmolLM3 huggingface/optimum-onnx#22

Merged

deprecate tensorflow onnx export and add smollm3 to export tests

6b33750

DWarez added a commit to DWarez/optimum that referenced this pull request Jul 18, 2025

fix: export gemma3-text is now working thanks to

ed5abe1

attention vmap patch as in here huggingface#2319

echarlaix reviewed Jul 18, 2025

View reviewed changes

Comment thread optimum/exporters/onnx/__main__.py

echarlaix reviewed Jul 18, 2025

View reviewed changes

Comment thread optimum/exporters/onnx/model_patcher.py

DWarez mentioned this pull request Jul 20, 2025

Runtime Error when exporting ByteDance-Seed/Seed-X-7B #2323

Closed

4 tasks

IlyasMoutawwakil added 4 commits July 21, 2025 12:15

write a more general sdpa_mask without vmap that's also vectorized an…

85521dc

…d broadcastable

better and more generic sdpa_mask_without_vmap implementation

dd6d0c1

style and fix

3dd85c1

fix

f7b6ebd

IlyasMoutawwakil added 2 commits July 21, 2025 14:38

patch find_packed_sequence_indices as it's untraceable

c5e0165

fix

3a4ea0d

IlyasMoutawwakil requested a review from echarlaix July 21, 2025 14:17

IlyasMoutawwakil added the onnxruntime-slow label Jul 21, 2025

IlyasMoutawwakil added 13 commits July 21, 2025 22:56

fix

74f064d

revert tests removal until refactor

5976851

fix temporary hub repo import

6c602ce

fix

b43fead

fix external data tests on windows

9953b91

update phi and phi3 min version

310bcd1

condition modernbert optimization test

459316d

get back old (pre 4.44) bloom modeling support and remove the need fo…

4fc5972

…r normalized config in ort modeling

fix test was using hardcoded architecture

4134e46

unparallelize test that uses remote code

2a6ef9c

support older versions of mpt and phi (4.36)

8623353

remove parallelism from slow tests

a3ad9df

fix vision to text pipelines test

77dd30f

IlyasMoutawwakil changed the title ~~Transformers 4.53 support and SmolLM3~~ Transformers 4.53 support, SmolLM3 and Fix old Transformers support Jul 22, 2025

IlyasMoutawwakil added 2 commits July 23, 2025 10:31

more specific version handling for find_packed_sequence_indices

d41f0ea

fix

7790092

IlyasMoutawwakil merged commit 53f39a6 into main Jul 23, 2025
46 of 49 checks passed

IlyasMoutawwakil deleted the transformers-4.53 branch July 23, 2025 10:33

IlyasMoutawwakil added a commit to huggingface/optimum-onnx that referenced this pull request Jul 23, 2025

Support transformers 4.53 and SmolLM3 (#22)

ff096ac

same as huggingface/optimum#2319

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transformers 4.53 support, SmolLM3 and Fix old Transformers support#2319

Transformers 4.53 support, SmolLM3 and Fix old Transformers support#2319
IlyasMoutawwakil merged 31 commits into
mainfrom
transformers-4.53

IlyasMoutawwakil commented Jul 10, 2025

Uh oh!

IlyasMoutawwakil Jul 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 10, 2025

Uh oh!

Uh oh!

echarlaix left a comment

Uh oh!

echarlaix Jul 15, 2025

Uh oh!

Uh oh!

Uh oh!

bil-ash commented Jul 15, 2025

Uh oh!

IlyasMoutawwakil commented Jul 15, 2025

Uh oh!

DWarez commented Jul 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

IlyasMoutawwakil commented Jul 10, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

IlyasMoutawwakil Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jul 10, 2025

Uh oh!

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

echarlaix Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bil-ash commented Jul 15, 2025

Uh oh!

IlyasMoutawwakil commented Jul 15, 2025

Uh oh!

DWarez commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

DWarez commented Jul 18, 2025 •

edited

Loading