Remove FP16_Optimizer patch for DeepSpeed by Rohan138 · Pull Request #2213 · huggingface/optimum

Rohan138 · 2025-03-11T21:50:26Z

Deepspeed (since 2022) already includes the FusedAdam FP16_Optimizer originally from NVIDIA/apex here (and it's better maintained wrt deepspeed grad_norm, MOE training, etc): https://github.com/deepspeedai/DeepSpeed/blob/master/deepspeed/runtime/fp16/fused_optimizer.py

Currently this line gives a warning saying:

/opt/conda/envs/py_3.10/lib/python3.10/site-packages/onnxruntime/training/optim/_modifier_registry.py:56: UserWarning: Skip modifying optimizer because of optimizer name not found in the registry: accelerate.utils.deepspeed.DeepSpeedOptimizerWrapper

which is actually caused by get_full_qualified_type_name(optimizer.optimizer) = 'deepspeed.runtime.fp16.fused_optimizer.FP16_Optimizer' not being in the onnxruntime registry: https://github.com/microsoft/onnxruntime/blob/main/orttraining/orttraining/python/training/optim/_modifier_registry.py

i.e. the FP16 Optimizer from onnxruntime (https://github.com/microsoft/onnxruntime/blob/main/orttraining/orttraining/python/training/optim/fp16_optimizer.py) is not actually doing anything, just falling back to the DeepSpeed fused Adam optimizer anyway, so this line and the warning it causes are redundant.

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

@JingyaHuang

Deepspeed already includes the same FusedAdam FP16_Optimizer originally from NVIDIA/apex here: https://github.com/deepspeedai/DeepSpeed/blob/master/deepspeed/runtime/fp16/fused_optimizer.py Currently this line gives a warning saying: ``` /opt/conda/envs/py_3.10/lib/python3.10/site-packages/onnxruntime/training/optim/_modifier_registry.py:56: UserWarning: Skip modifying optimizer because of optimizer name not found in the registry: accelerate.utils.deepspeed.DeepSpeedOptimizerWrapper ``` i.e. the FP16 Optimizer from onnxruntime (https://github.com/microsoft/onnxruntime/blob/main/orttraining/orttraining/python/training/optim/fp16_optimizer.py) is not actually wrapping the DeepSpeed fused Adam optimizer anyway, so this line is redundant.

HuggingFaceDocBuilderDev · 2025-04-02T10:59:50Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

IlyasMoutawwakil

LGTM, tests are passing !
is onnxruntime-training still useful for you ? we are discussing deprecating it since usage is so low.

IlyasMoutawwakil added the onnxruntime-training label Apr 2, 2025

IlyasMoutawwakil approved these changes Apr 2, 2025

View reviewed changes

IlyasMoutawwakil merged commit 26b5b1e into huggingface:main Apr 14, 2025

Rohan138 mentioned this pull request Jul 8, 2025

onnxruntime-training package has been deprecated pytorch/ort#199

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove FP16_Optimizer patch for DeepSpeed#2213

Remove FP16_Optimizer patch for DeepSpeed#2213
IlyasMoutawwakil merged 1 commit into
huggingface:mainfrom
Rohan138:patch-1

Rohan138 commented Mar 11, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 2, 2025

Uh oh!

IlyasMoutawwakil left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Rohan138 commented Mar 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Apr 2, 2025

Uh oh!

IlyasMoutawwakil left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Rohan138 commented Mar 11, 2025 •

edited

Loading