Fixes for ORT 1.13.1 by regisss · Pull Request #430 · huggingface/optimum

regisss · 2022-10-24T18:38:22Z

What does this PR do?

This PR fixes a couple of errors that appeared with ORT 1.13.1:

disable_shape_inference is added to OptimizationConfig since it has been added to FusionOptions in ORT (see Add --disable_shape_inference option to optimizer.py microsoft/onnxruntime#12215),
replace input_qType by activation_qType in quantization.py since this argument name has changed in ORT (see Splitting quantize_tensor and quantize_input microsoft/onnxruntime#12873).

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2022-10-24T18:56:25Z

The documentation is not available anymore as the PR was closed or merged.

echarlaix

Thanks for taking care of this @regisss !

echarlaix · 2022-10-25T07:56:56Z

            mode=quantization_config.mode,
            weight_qType=quantization_config.weights_dtype,
-            input_qType=quantization_config.activations_dtype,
+            activation_qType=quantization_config.activations_dtype,


In order for us to support onnxruntime version lower than 1.13.1, could you adapt the quantizer_factory behavior depending on which onnxruntime version is used ?

You're absolutely right, I just added it!

echarlaix

Thanks a lot @regisss

ghost · 2022-10-26T07:26:55Z

Thank you very much @regisss!

Especially since opimization is currently not working for me because of this missing field:

File "/srv/./app/steps/transformers/convert_transformer_to_onnx_step.py", line 43, in convert_transformer_to_onnx_step
    _ = ORTOptimizer.from_pretrained(
        │            └ <classmethod object at 0x7f9e158abf70>
        └ <class 'optimum.onnxruntime.optimization.ORTOptimizer'>

  File "/usr/local/lib/python3.9/site-packages/optimum/onnxruntime/optimization.py", line 128, in optimize
    optimization_options = FusionOptions.parse(optimization_config)
                           │             │     └ OptimizationConfig(optimization_level=1, optimize_for_gpu=False, fp16=False, optimize_with_onnxruntime_only=False, disable_ge...
                           │             └ <staticmethod object at 0x7f9e15633340>
                           └ <class 'onnxruntime.transformers.fusion_options.FusionOptions'>
  File "/usr/local/lib/python3.9/site-packages/onnxruntime/transformers/fusion_options.py", line 64, in parse
    if args.disable_shape_inference:
       └ OptimizationConfig(optimization_level=1, optimize_for_gpu=False, fp16=False, optimize_with_onnxruntime_only=False, disable_ge...

AttributeError: 'OptimizationConfig' object has no attribute 'disable_shape_inference'

Does anybody know a workaround or should I just wait for the patch? Will it be also cherry picked into 1.3.1 or only directly to 1.4.1?

Thanks!

regisss · 2022-10-26T07:40:35Z

Hi @nmaoez! If you need the fix ASAP, the best is to install Optimum from source as follows:

git clone https://github.com/huggingface/optimum.git
cd optimum/
pip install .[onnxruntime]

A patch is coming very soon but I don't know if this will be integrated to it. Maybe @echarlaix can tell more about this.

echarlaix · 2022-10-26T08:25:54Z

Hi @nmaoez @regisss, the optimum release 1.4.1 is out

ghost · 2022-10-26T08:56:23Z

@echarlaix Amazing! Thanks!

regisss added 2 commits October 24, 2022 20:15

Fix ORT 1.13.1

53b3882

Add docstring

b018d61

regisss marked this pull request as ready for review October 24, 2022 18:57

regisss requested a review from echarlaix October 24, 2022 18:58

echarlaix reviewed Oct 25, 2022

View reviewed changes

regisss added 2 commits October 25, 2022 10:12

Add management of former ORT versions

0a4a309

Correct version comparison

e655c58

echarlaix approved these changes Oct 25, 2022

View reviewed changes

echarlaix merged commit 9366f84 into huggingface:main Oct 25, 2022

regisss deleted the fixes branch October 26, 2022 07:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes for ORT 1.13.1#430

Fixes for ORT 1.13.1#430
echarlaix merged 4 commits into
huggingface:mainfrom
regisss:fixes

regisss commented Oct 24, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Oct 24, 2022 •

edited

Loading

Uh oh!

echarlaix left a comment

Uh oh!

echarlaix Oct 25, 2022

Uh oh!

regisss Oct 25, 2022

Uh oh!

echarlaix left a comment

Uh oh!

ghost commented Oct 26, 2022

Uh oh!

regisss commented Oct 26, 2022

Uh oh!

echarlaix commented Oct 26, 2022

Uh oh!

ghost commented Oct 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

regisss commented Oct 24, 2022

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Oct 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

echarlaix Oct 25, 2022

Choose a reason for hiding this comment

Uh oh!

regisss Oct 25, 2022

Choose a reason for hiding this comment

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

ghost commented Oct 26, 2022

Uh oh!

regisss commented Oct 26, 2022

Uh oh!

echarlaix commented Oct 26, 2022

Uh oh!

ghost commented Oct 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HuggingFaceDocBuilderDev commented Oct 24, 2022 •

edited

Loading