Adding support for float16 conversion for GPUs by philschmid · Pull Request #273 · huggingface/optimum

philschmid · 2022-07-08T16:45:48Z

What does this PR do?

This PR adds support for converting model weights from fp32 to fp16 by adding a new Optimization parameter. If the fp16 arg is provided in the OptimizationConfig the weights are converted. I also added a test to make sure the model is not containing any fp32 weights

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2022-07-08T16:48:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

regisss

LGTM @philschmid !!
I just left two minor comments.

Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

echarlaix

LGTM, thanks for this addition @philschmid

mfuntowicz · 2022-07-08T18:54:53Z

+                onnx_optimized_model_output_path=optimized_model_path,
+                optimization_config=optimization_config,
+            )
+            model = onnx.load(optimized_model_path.as_posix())


Can you add a test about the relative difference in the output? I'm not a huge fan of changing dtype for all the layers like this.

Not really sure why since we are not applying any custom logic.
What if there is more difference? what do you expect from the test?

Added the test but not sure what the benefit is

philschmid added 2 commits July 8, 2022 16:43

adding support for float16 conversion for GPUs

4be594c

make style

638b468

philschmid requested a review from echarlaix July 8, 2022 16:45

regisss approved these changes Jul 8, 2022

View reviewed changes

Comment thread optimum/onnxruntime/configuration.py Outdated

Comment thread tests/onnxruntime/test_optimization.py

Update optimum/onnxruntime/configuration.py

0469f06

Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

echarlaix approved these changes Jul 8, 2022

View reviewed changes

mfuntowicz reviewed Jul 8, 2022

View reviewed changes

philschmid added 3 commits July 9, 2022 20:47

add relative test

f75f733

added posix path

c188798

added from_transformers conversion for config.json

4e5e568

philschmid merged commit 65ad733 into main Jul 10, 2022

philschmid deleted the add-fp-16-optimization branch July 10, 2022 19:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for float16 conversion for GPUs#273

Adding support for float16 conversion for GPUs#273
philschmid merged 6 commits into
mainfrom
add-fp-16-optimization

philschmid commented Jul 8, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Jul 8, 2022

Uh oh!

regisss left a comment

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment

Uh oh!

mfuntowicz Jul 8, 2022

Uh oh!

philschmid Jul 8, 2022

Uh oh!

philschmid Jul 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

philschmid commented Jul 8, 2022

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Jul 8, 2022

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

mfuntowicz Jul 8, 2022

Choose a reason for hiding this comment

Uh oh!

philschmid Jul 8, 2022

Choose a reason for hiding this comment

Uh oh!

philschmid Jul 10, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants