Improve the compatibility dealing with large ONNX proto in ORTOptimizer and ORTQuantizer by JingyaHuang · Pull Request #332 · huggingface/optimum

JingyaHuang · 2022-08-02T13:29:09Z

What does this PR do?

Add all_tensors_to_one_file option to ORTOptimizer when exporting the ONNX model.

P.S. only the ONNXModel of optimization has added the option, not quantization.

Refactoring the compatibility of ORTOptimizer and ORTQuantizer in cases of large ONNX proto(export path / load ONNX)
Update the optimization and quantization examples with the new option.

Fixes #222

Related to onnx/onnx#4394

P.S.

Currently the Optimizer exports the original ONNX to the same folder as the optimized ONNX model, ORTOptimizer would better cache the original model as theORTQuantizer, this PR will not target the issue, it could be done either in the refactorization PR ORT optimizer refactorization #294 or in a new PR.
onnx offers options: save_as_external_data, all_tensors_to_one_file, size_threshold(1024), convert_attribute(false), and the last two are not supported to be self-defined in onnxruntime yet. But in general, the first two are the most effective.
Currently onnxruntime export external files and model proto after quantization to different directory and this might need to be fixed from their side. Potential updates on ONNXModel.save_model_to_file of quantization and optimization microsoft/onnxruntime#12576

…ace/optimum into add_onnx_export_options

HuggingFaceDocBuilderDev · 2022-08-08T16:39:36Z

The documentation is not available anymore as the PR was closed or merged.

…ace/optimum into add_onnx_export_options

regisss

Thanks for this PR @JingyaHuang!!
I just have one comment which may be unrelated to this PR.

regisss

LGTM, thanks Jingya!!

JingyaHuang

Thanks for reviewing, I will put the arguments in ORTModel as suggested and make necessary changes in the examples.

echarlaix

LGTM, thanks @JingyaHuang !

JingyaHuang · 2022-11-28T18:49:52Z

Moved config(model and ORT) saving after saving the model, otherwise will get the following error as configs are saved in the folder:

RuntimeError: Output directory (results/albert_largev1_squad2) for external data is not empty.

While running the template example, I got the error message in the evaluation phase(no problem with the optimization, and models with external files are properly saved):

Traceback (most recent call last):
  File "run_qa.py", line 540, in <module>
    main()
  File "run_qa.py", line 492, in main
    metrics = compute_metrics(predictions)
  File "run_qa.py", line 460, in compute_metrics
    return metric.compute(predictions=p.predictions, references=p.label_ids)
  File "/usr/local/lib/python3.8/dist-packages/evaluate/module.py", line 444, in compute
    output = self._compute(**inputs, **compute_kwargs)
  File "/root/.cache/huggingface/modules/evaluate_modules/metrics/evaluate-metric--squad/b4e2dbca455821c7367faa26712f378254b69040ebaab90b64bdeb465e4a304d/squad.py", line 110, in _compute
    score = compute_score(dataset=dataset, predictions=pred_dict)
  File "/root/.cache/huggingface/modules/evaluate_modules/metrics/evaluate-metric--squad/b4e2dbca455821c7367faa26712f378254b69040ebaab90b64bdeb465e4a304d/compute_score.py", line 67, in compute_score
    exact_match += metric_max_over_ground_truths(exact_match_score, prediction, ground_truths)
  File "/root/.cache/huggingface/modules/evaluate_modules/metrics/evaluate-metric--squad/b4e2dbca455821c7367faa26712f378254b69040ebaab90b64bdeb465e4a304d/compute_score.py", line 52, in metric_max_over_ground_truths
    return max(scores_for_ground_truths)
ValueError: max() arg is an empty sequence

Seems like a bug with the metric, an update might need to be done with the examples. I can take it over if you don't have bandwidth @echarlaix.

JingyaHuang added 8 commits August 2, 2022 13:08

Add all_tensors_to_one_file option and update qa example

8f94a68

Put back readme

6f772e2

Add disable_shape_inference to optim config

82f7b60

Merge with main

bd45fbf

Add all_tensors_to_one_file option and update qa example

dea7277

Put back readme

ea53730

Add disable_shape_inference to optim config

3b25e14

Merge branch 'add_onnx_export_options' of https://github.com/huggingf…

d269189

…ace/optimum into add_onnx_export_options

JingyaHuang and others added 4 commits August 11, 2022 09:16

Add all_tensors_to_one_file option and update qa example

bb412bf

Put back readme

63f5441

Add disable_shape_inference to optim config

15b23d4

Merge branch 'add_onnx_export_options' of https://github.com/huggingf…

38402e6

…ace/optimum into add_onnx_export_options

JingyaHuang changed the base branch from main to add-causallm-with-pkv August 11, 2022 09:17

JingyaHuang changed the base branch from add-causallm-with-pkv to main August 11, 2022 09:17

JingyaHuang changed the title ~~Add more optional configs for exporting large ModelProto~~ Improve the compatibility dealing with large ONNX proto in ORTOptimizer and ORTQuantizer Aug 11, 2022

JingyaHuang and others added 12 commits August 11, 2022 10:21

Add all_tensors_to_one_file to quantizer

51f9c0f

Fix quantization path

fa66665

Remove all to one from quantizer

2538927

Update quantization examples(except for multiple choices)

672afd4

Add all_tensors_to_one_file option and update qa example

3362473

Put back readme

f688e88

Add disable_shape_inference to optim config

77a7c3b

Add all_tensors_to_one_file to quantizer

5ee0e21

Fix quantization path

d138a06

Remove all to one from quantizer

543f2f6

Update quantization examples(except for multiple choices)

d7e0a15

Merge branch 'add_onnx_export_options' of https://github.com/huggingf…

a0135a5

…ace/optimum into add_onnx_export_options

JingyaHuang changed the base branch from main to add-causallm-with-pkv August 12, 2022 16:24

JingyaHuang changed the base branch from add-causallm-with-pkv to main August 12, 2022 16:24

JingyaHuang added 2 commits August 12, 2022 17:33

swag example of quantization

fd7e458

Update optimization exemples

bfa6018

JingyaHuang marked this pull request as ready for review August 12, 2022 17:41

JingyaHuang requested review from echarlaix and philschmid August 12, 2022 17:42

regisss reviewed Aug 14, 2022

View reviewed changes

Comment thread optimum/onnxruntime/quantization.py Outdated

JingyaHuang added 2 commits August 22, 2022 14:48

Save path typing

aa98055

Merge branch 'main' into add_onnx_export_options

4c0d45c

JingyaHuang requested a review from regisss August 23, 2022 07:43

regisss approved these changes Aug 23, 2022

View reviewed changes

Resolved merge conflict with Optimizer refactoring

d49a411

JingyaHuang commented Aug 26, 2022

View reviewed changes

Comment thread optimum/onnxruntime/optimization.py Outdated

JingyaHuang commented Aug 26, 2022

View reviewed changes

Comment thread optimum/onnxruntime/quantization.py

echarlaix reviewed Sep 5, 2022

View reviewed changes

Comment thread examples/onnxruntime/optimization/multiple-choice/run_swag.py

Comment thread optimum/onnxruntime/optimization.py Outdated

Merge branch 'main' into add_onnx_export_options

af5c15b

JingyaHuang commented Sep 8, 2022

View reviewed changes

Comment thread optimum/onnxruntime/optimization.py Outdated

Comment thread examples/onnxruntime/optimization/multiple-choice/run_swag.py

echarlaix approved these changes Oct 10, 2022

View reviewed changes

JingyaHuang added 3 commits November 28, 2022 16:37

Merge branch 'main' into add_onnx_export_options

ac02176

Change all_tensors_to_one_file name

8d9533a

Save configs after the model

e4d24ea

JingyaHuang merged commit 0808c8c into main Nov 28, 2022

JingyaHuang deleted the add_onnx_export_options branch November 28, 2022 21:57

JingyaHuang mentioned this pull request Dec 14, 2022

Handling ONNX models with external data #586

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the compatibility dealing with large ONNX proto in ORTOptimizer and ORTQuantizer#332

Improve the compatibility dealing with large ONNX proto in ORTOptimizer and ORTQuantizer#332
JingyaHuang merged 33 commits into
mainfrom
add_onnx_export_options

JingyaHuang commented Aug 2, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 8, 2022 •

edited

Loading

Uh oh!

regisss left a comment

Uh oh!

Uh oh!

regisss left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JingyaHuang left a comment

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment •

edited

Loading

Uh oh!

JingyaHuang commented Nov 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

JingyaHuang commented Aug 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Aug 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JingyaHuang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JingyaHuang commented Nov 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

JingyaHuang commented Aug 2, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 8, 2022 •

edited

Loading

echarlaix left a comment •

edited

Loading