ORTTrainer / ORTSeq2SeqTrainer refactoring - Inference with ORTModel by JingyaHuang · Pull Request #189 · huggingface/optimum

JingyaHuang · 2022-05-13T13:36:08Z

What does this PR do?

This PR replaces the bare inference sessions in ORTTrainer with subclasses of ORTModel, and enables the inference with ONNX Runtime in ORTSeq2SeqTrainer.

[Refactoring ORTTrainer]

Replace inference session in evaluation_loop_ort and prediction_loop_ort with subclasses of ORTModel.
Clean up unused in evaluation_loop_ort and prediction_loop_ort

[Refactoring ORTSeq2SeqTrainer]

Adapt OnnxConfigWithPastAndLoss on the top of DecoderOnnxConfig.
Generate dummy labels and compute of loss in _DecoderWithLMhead.
Add labels and the loss in sequence-to-sequence ort modeling.
Override evaluation_loop_ort and prediction_loop_ort to leverage ORTModelForSeq2SeqLM.

…ngface/optimum into refactoring-orttrainer-inf

HuggingFaceDocBuilderDev · 2022-08-22T18:06:37Z

The documentation is not available anymore as the PR was closed or merged.

regisss

It looks great Jingya!

echarlaix

LGTM, well done @JingyaHuang 🔥

* Override export of ORTSeq2SeqTrainer * Do not force download by default in ORTModel (#356) * Update OnnxConfigWithLoss wrapper * ORT optimizer refactorization (#294) * Refactorization of ORTOptimizer * Refactorization of ORTModel * Adapt examples according to refactorization * Adapt tests * Fix style * Remove quantizer modification * Fix style * Apply modifications from #270 for quantizer and optimizer to have same behavior * Add test for optimization of Seq2Seq models * Fix style * Add ort config saving when optimizing a model * Add ort config saving when quantizing a model * Add tests * Fix style * Adapt optimization examples * Fix readme * Remove unused parameter * Adapt quantization examples * Fix quantized model and ort config saving * Add documentation * Add model configuration saving to simplify loading of optimized model * Fix style * Fix description * Fix quantization tests * Remove opset argument which is onnx config default opset when exporting with ORTModels * Fix import (#360) * Fix export of decoders * Add flag to export only decoders * Fix ORTTrainer inference ort subclass parsing * Fix filenames when empty suffix given (#363) * fix(optimization): handle empty file suffix * fix(quantization): handle empty file suffix * use pathlibfor save_dir * run test again * Update optimum/onnxruntime/quantization.py Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> * ReRun test that failed because of cache (network) Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> * Override the evaluation and prediction loop in ORTSeq2SeqTrainer * Fix documentation (#369) * fix class * Update optimization.mdx * Fix label smoother device prob * Fix lm_logits and labels dimension mismatch * Clean up Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com> Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> Co-authored-by: Pierre Snell <ierezell@gmail.com> Co-authored-by: Pierre Snell <pierre.snell@botpress.com> Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>

regisss

Great work Jingya 🔥
I just have two naive questions

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

echarlaix

LGTM 🔥

JingyaHuang added 2 commits May 13, 2022 13:19

Inference with ORTModel

68bfc28

Clean up unused imports

fff8f5d

fxmarty mentioned this pull request May 13, 2022

Remove unused argument #154

Closed

1 task

JingyaHuang requested review from echarlaix and philschmid May 13, 2022 13:40

JingyaHuang and others added 2 commits July 6, 2022 15:15

Merge with main

aae3939

Merge branch 'main' into refactoring-orttrainer-inf

265aebb

JingyaHuang removed request for echarlaix and philschmid July 6, 2022 21:32

JingyaHuang added 10 commits July 8, 2022 09:19

merge with main

0873acc

Merge with main

7d3ef97

Replace Inference session by ort model

b730e75

merge with jingya-refactoring-ort-model and main

b4fe02d

Inference with ORTModel

abc9f32

Clean up unused imports

bedf01a

Merge with main

9640d73

Replace Inference session by ort model

90e4150

merge with jingya-refactoring-ort-model and main

8e7ef16

Merge branch 'refactoring-orttrainer-inf' of https://github.com/huggi…

72bcff4

…ngface/optimum into refactoring-orttrainer-inf

JingyaHuang changed the base branch from main to jingya-refactoring-ort-model July 18, 2022 12:49

JingyaHuang changed the base branch from jingya-refactoring-ort-model to main July 18, 2022 12:50

JingyaHuang added 2 commits August 22, 2022 17:47

Update modeling for custom tasks

994c65b

Merge branch 'main' into refactoring-orttrainer-inf

5541571

JingyaHuang added 2 commits August 22, 2022 20:47

Replace in evaluation_loop

94082d3

refectoring prediction_loop

b6245b0

JingyaHuang requested review from echarlaix, michaelbenayoun and regisss August 23, 2022 07:44

regisss approved these changes Aug 23, 2022

View reviewed changes

Merge branch 'main' into refactoring-orttrainer-inf

b2ffff0

echarlaix approved these changes Sep 5, 2022

View reviewed changes

Comment thread optimum/onnxruntime/trainer.py Outdated

JingyaHuang and others added 2 commits September 7, 2022 10:22

Merge branch 'main' into refactoring-orttrainer-inf

c72938c

JingyaHuang requested review from echarlaix and regisss September 7, 2022 08:27

JingyaHuang changed the title ~~ORTTrainer refactoring - Inference with ORTModel~~ ORTTrainer / ORTSeq2SeqTrainer refactoring - Inference with ORTModel Sep 7, 2022

Fix onnx config test

5670e92

regisss reviewed Sep 7, 2022

View reviewed changes

Comment thread optimum/onnxruntime/trainer.py

Comment thread optimum/onnxruntime/trainer.py

regisss approved these changes Sep 7, 2022

View reviewed changes

Merge branch 'main' into refactoring-orttrainer-inf

05ec3ef

JingyaHuang requested a review from philschmid September 14, 2022 10:19

JingyaHuang mentioned this pull request Sep 14, 2022

Update ORT training examples & add summarization example #383

Merged

5 tasks

echarlaix requested changes Sep 19, 2022

View reviewed changes

Comment thread optimum/onnx/modeling_seq2seq.py Outdated

Comment thread optimum/onnxruntime/modeling_seq2seq.py Outdated

Comment thread optimum/onnxruntime/modeling_seq2seq.py Outdated

Comment thread optimum/onnxruntime/modeling_seq2seq.py

JingyaHuang and others added 3 commits September 19, 2022 14:29

detect labels from input names

ce0a1ce

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

Detect loss from output names

161ce7f

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

Put back decoder with past

8e6fb8d

JingyaHuang requested a review from echarlaix September 19, 2022 13:27

JingyaHuang added 2 commits September 19, 2022 13:36

Merged with main

86b4593

Put past key values to the correct place

f9eb070

echarlaix reviewed Sep 19, 2022

View reviewed changes

Comment thread optimum/onnxruntime/modeling_seq2seq.py Outdated

Comment thread optimum/onnxruntime/modeling_seq2seq.py Outdated

JingyaHuang and others added 3 commits September 19, 2022 17:16

remove if/else statement

ae19c72

Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

Revert the inference code

4fea7a3

Merge branch 'main' into refactoring-orttrainer-inf

3d961ea

echarlaix approved these changes Sep 19, 2022

View reviewed changes

JingyaHuang merged commit eada157 into main Sep 19, 2022

JingyaHuang deleted the refactoring-orttrainer-inf branch September 19, 2022 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ORTTrainer / ORTSeq2SeqTrainer refactoring - Inference with ORTModel#189

ORTTrainer / ORTSeq2SeqTrainer refactoring - Inference with ORTModel#189
JingyaHuang merged 31 commits into
mainfrom
refactoring-orttrainer-inf

JingyaHuang commented May 13, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 22, 2022 •

edited

Loading

Uh oh!

regisss left a comment

Uh oh!

echarlaix left a comment •

edited

Loading

Uh oh!

Uh oh!

regisss left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

JingyaHuang commented May 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Aug 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

echarlaix left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echarlaix left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

JingyaHuang commented May 13, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 22, 2022 •

edited

Loading

echarlaix left a comment •

edited

Loading