PyTorch 2 Export Quantization for OpenVINO torch.compile backend #3321

daniil-lyakhov · 2025-04-11T08:56:22Z

Hello there!

In this example we would like to present a quantization pipeline that allows users to run quantized OpenVINO models optimally—without ever leaving the PyTorch ecosystem

CC: @alexsu52, @ynimmaga, @anzr299 @AlexKoff88

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @ZailiWang @ZhaoqiongZ @leslie-fang-intel @Xia-Weiwen @sekahler2 @CaoE @zhuhaozhe @Valentine233

Co-authored-by: Alexander Suslov <[email protected]> Co-authored-by: Yamini Nimmagadda <[email protected]>

pytorch-bot · 2025-04-11T08:56:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/3321

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 0a422c2 with merge base a5632da ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

prototype_source/openvino_quantizer.rst

Co-authored-by: Alexander Suslov <[email protected]>

svekars · 2025-04-12T13:56:24Z

@HamidShojanazeri can someone from the partner's team take a look?

AlannaBurke · 2025-04-12T18:38:32Z

prototype_source/openvino_quantizer.rst

+Introduction
+--------------
+
+**This is an experimental feature, the quantization API is subject to change.**


Just a suggestion, but I'd put this in a note callout: .. note::

Thanks! Done

jerryzh168

Thanks, looks great!

fyi we just copied the pytorch pt2e quantization flow code to torchao: pytorch/ao#2048

if you have bug fixes or new features that you need in pt2e quantization please make changes in torchao instead, we are planning to deprecate the code in pytorch/pytorch

overall plan: https://dev-discuss.pytorch.org/t/torch-ao-quantization-migration-plan/2810

jerryzh168 · 2025-04-15T20:44:38Z

prototype_source/openvino_quantizer.rst

+
+    # Capture the FX Graph to be quantized
+    with torch.no_grad(), nncf.torch.disable_patching():
+        exported_model = torch.export.export(model, example_inputs).module()


torch.export.export_for_training is the recommended API now I think

Edit: looks like in new pytorch versions these two are the same, in that case we might be recommending torch.export.export since it's simpler. cc @tugsbayasgalan @gmagogsfm to confirm

OK I checked with Tugsuu, please continue to use torch.export.export, we'll be migrating to this as well

jerryzh168 · 2025-04-15T20:52:47Z

prototype_source/openvino_quantizer.rst

+    from torch.ao.quantization.quantize_pt2e import convert_pt2e
+    from torch.ao.quantization.quantize_pt2e import prepare_pt2e


for long term support, importing from torchao.quantization.pt2e.quantize_pt2e might be better

this requires people to install torchao nightly though: pip install --pre torchao --index-url https://download.pytorch.org/whl/nightly/cu126 # full options are cpu/cu118/cu126/cu128

but this can be done in a separate step since you might need to adapt your code to work with the torchao copy

svekars

Just a few suggestions

prototype_source/openvino_quantizer.rst

Co-authored-by: Svetlana Karslioglu <[email protected]>

svekars · 2025-04-21T18:13:30Z

@daniil-lyakhov there seems to be a merge conflict - can you please take a look?

daniil-lyakhov · 2025-04-22T14:41:57Z

@daniil-lyakhov there seems to be a merge conflict - can you please take a look?

Sure! Done

prototype_source/openvino_quantizer.rst

Co-authored-by: Svetlana Karslioglu <[email protected]>

daniil-lyakhov · 2025-04-25T09:08:38Z

@HamidShojanazeri , @williamwen42, could you please take a look?

daniil-lyakhov and others added 7 commits November 22, 2024 13:47

WIP

f0ab805

OpenVINOQuantizer

acf1647

Apply suggestions from code review

5b1c99a

Co-authored-by: Alexander Suslov <[email protected]> Co-authored-by: Yamini Nimmagadda <[email protected]>

Comments

b2eaa82

NNCF API docs

810899a

Comments

82a47a5

fold_quantize=False

26f044b

facebook-github-bot added the cla signed label Apr 11, 2025

alexsu52 reviewed Apr 11, 2025

View reviewed changes

prototype_source/openvino_quantizer.rst Outdated Show resolved Hide resolved

daniil-lyakhov and others added 2 commits April 11, 2025 14:42

Update prototype_source/openvino_quantizer.rst

75d3549

Co-authored-by: Alexander Suslov <[email protected]>

Merge branch 'main' into dl/fx/openvino_quantizer

e8e94d3

svekars added intel and removed intel labels Apr 12, 2025

svekars assigned williamwen42 and HamidShojanazeri Apr 12, 2025

AlannaBurke reviewed Apr 12, 2025

View reviewed changes

daniil-lyakhov force-pushed the dl/fx/openvino_quantizer branch from 1c6bc7c to f09a85f Compare April 14, 2025 13:58

Spelling / comments

f09a85f

daniil-lyakhov requested review from AlannaBurke and alexsu52 April 15, 2025 13:05

svekars added 2 commits April 15, 2025 13:39

Merge branch 'main' into dl/fx/openvino_quantizer

2c766e7

Merge branch 'main' into dl/fx/openvino_quantizer

b424f92

jerryzh168 approved these changes Apr 15, 2025

View reviewed changes

jerryzh168 reviewed Apr 15, 2025

View reviewed changes

AlannaBurke approved these changes Apr 15, 2025

View reviewed changes

jerryzh168 reviewed Apr 15, 2025

View reviewed changes

svekars reviewed Apr 15, 2025

View reviewed changes

prototype_index.rst is updated

f3137be

Apply suggestions from code review

b7d2781

Co-authored-by: Svetlana Karslioglu <[email protected]>

alexsu52 approved these changes Apr 18, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into dl/fx/openvino_quantizer

bb3c2f8

svekars approved these changes Apr 22, 2025

View reviewed changes

svekars reviewed Apr 22, 2025

View reviewed changes

prototype_source/openvino_quantizer.rst Outdated Show resolved Hide resolved

daniil-lyakhov and others added 3 commits April 22, 2025 18:47

Update prototype_source/openvino_quantizer.rst

c093c76

Co-authored-by: Svetlana Karslioglu <[email protected]>

Merge branch 'main' into dl/fx/openvino_quantizer

090823f

Merge branch 'main' into dl/fx/openvino_quantizer

0a422c2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyTorch 2 Export Quantization for OpenVINO torch.compile backend #3321

PyTorch 2 Export Quantization for OpenVINO torch.compile backend #3321

daniil-lyakhov commented Apr 11, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Apr 11, 2025 •

edited

Loading

svekars commented Apr 12, 2025

AlannaBurke Apr 12, 2025

daniil-lyakhov Apr 14, 2025

jerryzh168 left a comment

jerryzh168 Apr 15, 2025 •

edited

Loading

jerryzh168 Apr 16, 2025

jerryzh168 Apr 15, 2025 •

edited

Loading

svekars left a comment

svekars commented Apr 21, 2025

daniil-lyakhov commented Apr 22, 2025

daniil-lyakhov commented Apr 25, 2025

		from torch.ao.quantization.quantize_pt2e import convert_pt2e
		from torch.ao.quantization.quantize_pt2e import prepare_pt2e

PyTorch 2 Export Quantization for OpenVINO torch.compile backend #3321

Are you sure you want to change the base?

PyTorch 2 Export Quantization for OpenVINO torch.compile backend #3321

Conversation

daniil-lyakhov commented Apr 11, 2025 • edited by pytorch-bot bot Loading

pytorch-bot bot commented Apr 11, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/3321

✅ No Failures

svekars commented Apr 12, 2025

AlannaBurke Apr 12, 2025

Choose a reason for hiding this comment

daniil-lyakhov Apr 14, 2025

Choose a reason for hiding this comment

jerryzh168 left a comment

Choose a reason for hiding this comment

jerryzh168 Apr 15, 2025 • edited Loading

Choose a reason for hiding this comment

jerryzh168 Apr 16, 2025

Choose a reason for hiding this comment

jerryzh168 Apr 15, 2025 • edited Loading

Choose a reason for hiding this comment

svekars left a comment

Choose a reason for hiding this comment

svekars commented Apr 21, 2025

daniil-lyakhov commented Apr 22, 2025

daniil-lyakhov commented Apr 25, 2025

daniil-lyakhov commented Apr 11, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Apr 11, 2025 •

edited

Loading

jerryzh168 Apr 15, 2025 •

edited

Loading

jerryzh168 Apr 15, 2025 •

edited

Loading