Propagate library_name parameter in from_pretrained to export by tomaarsen · Pull Request #2328 · huggingface/optimum

tomaarsen · 2025-07-25T10:32:41Z

Resolves #2327

Hello!

Pull Request overview

Allow library_name in ORTModel....from_pretrained("...", library_name=...)

Details

As described in #2327, this will allow exporting and loading models using a specific library name rather than relying on Optimum's automatic inferring of the library. For Sentence Transformers, this allows me to add ONNX exporting to SparseEncoder models with:

from pprint import pprint

import torch
from sentence_transformers import SparseEncoder
from optimum.onnxruntime import ORTModelForMaskedLM

# 1. Load a pretrained SparseEncoder model
model = SparseEncoder("naver/splade-cocondenser-ensembledistil")

# Very hackishly override the model to use ORTModelForMaskedLM
class ORTModelForMaskedLMModule(ORTModelForMaskedLM, torch.nn.Module):
    def __init__(self, *args, **kwargs):
        super().__init__(*args, **kwargs)
        torch.nn.Module.__init__(self)

model[0].auto_model = ORTModelForMaskedLMModule.from_pretrained("naver/splade-cocondenser-ensembledistil", library_name="transformers")

# Encode some texts into sparse embeddings
embeddings = model.encode(["I'm travelling to the grocery store to buy some milk."])

# Decode the embeddings again, the beauty of sparse embeddings
decoded = model.decode(embeddings, top_k=5)
pprint(decoded)
"""
[[('milk', 2.486112356185913),
  ('grocery', 1.7925951480865479),
  ('dairy', 1.6981983184814453),
  ('traveling', 1.5185797214508057),
  ('buy', 1.3063507080078125)]]
"""

As Sentence Transformers only exports models in the "transformers" way, I can add this internally in Sentence Transformers so the eventual usage by the end user is simply model = SparseEncoder("naver/splade-cocondenser-ensembledistil", backend="onnx"). Note also that I can export with OpenVINO without library_name just fine, only ONNX has this issue.

Please let me know if you'd rather go in a different direction here.

Tom Aarsen

Required to avoid automatic inferring of the library_name

HuggingFaceDocBuilderDev · 2025-07-25T10:48:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

echarlaix

Thanks for the PR @tomaarsen ! Also would be nice to add a test for this (to ensure SparseEncoder export is not broken in the future)

echarlaix · 2025-07-25T16:31:31Z

        use_io_binding: Optional[bool] = None,
        # other arguments
        model_save_dir: Optional[Union[str, Path, TemporaryDirectory]] = None,
+        library_name: Optional[str] = None,


would prefer to have it as a class attribute instead which can be set to None by default (will be inferred in this case) and for ORTModelForMaskedLM can be set to transformers directly as it should never be set to anything else no ?

like done in optimum-intel https://github.com/huggingface/optimum-intel/blob/d93fd59aebd24ac887deb1eaab7ff6af41b09946/optimum/intel/openvino/modeling_base.py#L647

tomaarsen · 2025-07-25T16:40:08Z

Will address your comments on Monday (most likely), thank you!

Tom Aarsen

Under modeling_diffusion it looks like ORTModel isn't used

tomaarsen · 2025-07-28T10:56:34Z

I've used the class attribute approach that you proposed, and also added a test using https://huggingface.co/sparse-encoder-testing/splade-bert-tiny-nq. I'm also using this model in my own tests, so it'll stay. It's a good example of a model that would be automatically detected as Sentence Transformers, but we might want to load with ...ModelForMaskedLM.

Tom Aarsen

echarlaix

LGTM, thanks @tomaarsen

echarlaix · 2025-07-29T12:19:03Z

Also the optimum onnx / ort integration is moved in https://github.com/huggingface/optimum-onnx @tomaarsen. I'll take care of opening a PR to add these PR changes there as well

tomaarsen · 2025-07-29T14:09:51Z

Oh, good to know! Thank you.

from huggingface/optimum#2328 --------- Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>

Propagate library_name parameter in from_pretrained to export

5af93d2

Required to avoid automatic inferring of the library_name

echarlaix reviewed Jul 25, 2025

View reviewed changes

tomaarsen added 2 commits July 28, 2025 12:54

Use class attribute for ORTModel instead

6d46211

Under modeling_diffusion it looks like ORTModel isn't used

Add test case

fb3c850

tomaarsen requested a review from echarlaix July 28, 2025 10:59

echarlaix approved these changes Jul 29, 2025

View reviewed changes

Comment thread optimum/onnxruntime/modeling_ort.py Outdated

Update optimum/onnxruntime/modeling_ort.py

c7370ab

echarlaix merged commit 689c0b5 into huggingface:main Jul 29, 2025
40 of 42 checks passed

echarlaix mentioned this pull request Aug 1, 2025

Propagate library_name parameter in from_pretrained to export huggingface/optimum-onnx#29

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Propagate library_name parameter in from_pretrained to export#2328

Propagate library_name parameter in from_pretrained to export#2328
echarlaix merged 4 commits into
huggingface:mainfrom
tomaarsen:feat/propagate_library_name

tomaarsen commented Jul 25, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 25, 2025

Uh oh!

echarlaix left a comment

Uh oh!

echarlaix Jul 25, 2025

Uh oh!

tomaarsen commented Jul 25, 2025

Uh oh!

tomaarsen commented Jul 28, 2025

Uh oh!

echarlaix left a comment

Uh oh!

Uh oh!

Uh oh!

echarlaix commented Jul 29, 2025

Uh oh!

tomaarsen commented Jul 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tomaarsen commented Jul 25, 2025

Pull Request overview

Details

Uh oh!

HuggingFaceDocBuilderDev commented Jul 25, 2025

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

echarlaix Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

tomaarsen commented Jul 25, 2025

Uh oh!

tomaarsen commented Jul 28, 2025

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

echarlaix commented Jul 29, 2025

Uh oh!

tomaarsen commented Jul 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants