Add ORTModel support for custom tasks by JingyaHuang · Pull Request #303 · huggingface/optimum

JingyaHuang · 2022-07-18T07:23:54Z

What does this PR do?

ORTModelForXXX.model decided valid inputs and outputs of the model's forward method, thus the creation of inputs in the forward method can be abstract, and also the outputs. This would allow the ORTModels to be more flexible.

e.g. In ORTTrainer, the evaluation includes labels as input and loss as output. With the PR, it will enable us to replace bare inference sessions with ORTModels more easily.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2022-07-18T07:27:41Z

The documentation is not available anymore as the PR was closed or merged.

regisss

Thanks for this @JingyaHuang!!
Just one nit

philschmid

I am not sure if we should make this change for enabling something, which the classes aren't designed for.
You said

e.g. In ORTTrainer, the evaluation includes labels as input and loss as output. With the PR, it will enable us to replace bare inference sessions with ORTModels more easily.

This sounds more that we should use evaluate and pipeline or the ORTModel in the trainer with the post-processing outside of the model.

…gingface/optimum into jingya-refactoring-ort-model

philschmid · 2022-07-18T17:33:27Z

I still do not understand the purpose of this change.

As mentioned before the ORTModelForXX were introduced for inference and not for training. The idea was to be able to have API compatible Model classes, which can be used with pipelines without the need to re-write the pre - & post-processing.
Also, the idea is to add those inference model classes to other packages, e.g optimum-intel.
Additionally, we are looking into removing the copying and adding IOBindings to reduce latency in the future.

The changes you suggest:

slow down the inference
add a lot of complex dynamic code -> which we tried to exclude, that's why we have several ORTModelForXX classes rather than one
Add support for something ORTTrainier training specific.

The question I have is:

Is this change needed for the ORTTrainer? how are we currently doing it?
What is the benefit for the customer?

philschmid

can you add a test and then we should be good. Good idea! ✅ And if some use case emerges out of it we can add new task-specific model classes

JingyaHuang · 2022-07-22T08:44:31Z

I still do not understand the purpose of this change.

As mentioned before the ORTModelForXX were introduced for inference and not for training. The idea was to be able to have API compatible Model classes, which can be used with pipelines without the need to re-write the pre - & post-processing. Also, the idea is to add those inference model classes to other packages, e.g optimum-intel. Additionally, we are looking into removing the copying and adding IOBindings to reduce latency in the future.

The changes you suggest:

slow down the inference

add a lot of complex dynamic code -> which we tried to exclude, that's why we have several ORTModelForXX classes rather than one

Add support for something ORTTrainier training specific.

The question I have is:

Is this change needed for the ORTTrainer? how are we currently doing it?

What is the benefit for the customer?

Hi @philschmid, sorry for the late reply. I re-drafted the code, indeed it shouldn't be in other task-specific models as it will slow them down. The basic idea behind the PR is to leave some flexibility to users, it's like a fallback so that when they are using a more customized model they can still be able to benefit from the ORTModel foundation with a small sacrifice of speed.
ORTTrainer.train() is independent of the PR, but for the inference of ORTTrainer the evaluate() and predict(), I am using directly InferenceSession right now and I will replace it with ORTModel, for the predict it is pretty straight forward, but for the evaluate the model include loss thus I need something more customized, and things like ORTModelForCustomTasks shall be helpful.

JingyaHuang added 2 commits July 17, 2022 22:19

Update ort nightly docker

df3e17d

Refactoring ort models(except for clm)

83a1248

Refactoring clm

ac252dd

JingyaHuang requested review from echarlaix, lewtun and philschmid July 18, 2022 07:36

Merge with main

9ad147e

regisss approved these changes Jul 18, 2022

View reviewed changes

philschmid suggested changes Jul 18, 2022

View reviewed changes

Comment thread optimum/onnxruntime/modeling_ort.py Outdated

Comment thread optimum/onnxruntime/modeling_ort.py Outdated

JingyaHuang added 8 commits July 18, 2022 13:24

Change inputs name for clarity

fbea006

another try

2993d61

Update ort nightly docker

2b87fb2

Refactoring ort models(except for clm)

bf3a05e

Refactoring clm

b10dbb9

Change inputs name for clarity

7d02152

another try

23e63b9

Merge branch 'jingya-refactoring-ort-model' of https://github.com/hug…

445cbc6

…gingface/optimum into jingya-refactoring-ort-model

JingyaHuang commented Jul 18, 2022

View reviewed changes

Comment thread optimum/onnxruntime/modeling_ort.py Outdated

Revert and create a class for custom models

3f05d5f

philschmid approved these changes Jul 22, 2022

View reviewed changes

Comment thread optimum/onnxruntime/modeling_ort.py

JingyaHuang changed the title ~~Refactoring ort model inputs and outputs~~ Add ORTModel support for custom tasks Jul 22, 2022

JingyaHuang added 6 commits July 26, 2022 14:14

Add example for custom task

ce24984

Fix doc

08e2268

Fix docstring

1b8fb62

Add to import

1126fca

Merge with main

b21e41a

Fix key name

24be97b

JingyaHuang added 23 commits August 1, 2022 19:27

Revert and create a class for custom models

51d671e

Add example for custom task

c5a97d9

Fix doc

f8f09b8

Fix docstring

c68b529

Add to import

34acf9a

Fix key name

b37d76f

Refactoring ort models(except for clm)

12fa2fc

Refactoring clm

926bd43

Change inputs name for clarity

0f73601

another try

296c705

Refactoring ort models(except for clm)

fc17aff

Refactoring clm

4e13123

Change inputs name for clarity

9ec19d5

another try

abe2197

Revert and create a class for custom models

9b47b44

Add example for custom task

5387050

Fix docstring

0eeb4cf

Merge with main

7106362

Fix key name

d9c9cf6

Add test

194db9f

Fix test

5a2653a

Remove unused import

34ded61

pull from remote

6e568a7

JingyaHuang changed the base branch from main to doc-builder-habana-test August 1, 2022 20:05

JingyaHuang changed the base branch from doc-builder-habana-test to main August 1, 2022 20:06

JingyaHuang added 3 commits August 1, 2022 20:10

Fix style

011a01e

remove duplicated

21cb134

Merge with main

68e104f

JingyaHuang merged commit d3c0b75 into main Aug 3, 2022

JingyaHuang deleted the jingya-refactoring-ort-model branch August 3, 2022 09:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ORTModel support for custom tasks#303

Add ORTModel support for custom tasks#303
JingyaHuang merged 69 commits into
mainfrom
jingya-refactoring-ort-model

JingyaHuang commented Jul 18, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jul 18, 2022 •

edited

Loading

Uh oh!

regisss left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

philschmid left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

philschmid commented Jul 18, 2022 •

edited

Loading

Uh oh!

philschmid left a comment

Uh oh!

Uh oh!

JingyaHuang commented Jul 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

JingyaHuang commented Jul 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Jul 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

philschmid left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

philschmid commented Jul 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

philschmid left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

JingyaHuang commented Jul 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

JingyaHuang commented Jul 18, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 18, 2022 •

edited

Loading

philschmid commented Jul 18, 2022 •

edited

Loading