Add Op(_native_batch_norm_legit_no_training and _native_batch_norm_legit) | feat(torchlib) #1116

titaiwangms · 2023-10-27T22:06:06Z

Add the support of _native_batch_norm_legit_no_training and _native_batch_norm_legit, which are two new aten ops to replace aten::native_batch_norm according to https://github.com/pytorch/pytorch/blob/a44f8894fa6d973693aab44a3dda079a168b05c1/torch/_decomp/decompositions.py#L1501-L1510.

Previous to this PR, due to lack of support of _native_batch_norm_legit_no_training and _native_batch_norm_legit, the exporter decomposes native_batch_norm to a bunch of other nodes and drags down the performance.

NOTE: The mismatch result size between CUDA/CPU export doesn't happen even with these nodes supported. Could be fixed somewhere else.

Tested with the code:

import torch

import onnxruntime


def repro_split():
    class Model(torch.nn.Module):
        def __init__(self):
            super().__init__()
            self.bn = torch.nn.BatchNorm2d(64)
            self.conv = torch.nn.Conv2d(64, 64, 3)

        def forward(self, x):
            x = self.bn(x)
            x = self.conv(x)
            return torch.split(x, [16, 24, 24], 1)

    model = Model().cuda().eval()
    x = torch.randn(1, 64, 32, 32).cuda()
    export_output = torch.onnx.dynamo_export(model, x)

    onnxruntime.InferenceSession(export_output.model_proto.SerializeToString())
    export_output.save("coat_lite_mini.onnx")
    export_output.save_diagnostics("debug_bn.sarif")

    session = onnxruntime.InferenceSession("coat_lite_mini.onnx")
    input_names = [ort_input.name for ort_input in session.get_inputs()]
    onnx_format_args = export_output.adapt_torch_inputs_to_onnx(
        x
    )
    ort_input = {k: v.cpu().numpy() for k, v in zip(input_names, onnx_format_args)}
    print(session.run(None, ort_input))


repro_split()

codecov · 2023-10-27T22:07:12Z

Codecov Report

Merging #1116 (6323b65) into main (70843ef) will increase coverage by 0.01%.
The diff coverage is 75.00%.

@@            Coverage Diff             @@
##             main    #1116      +/-   ##
==========================================
+ Coverage   78.44%   78.45%   +0.01%     
==========================================
  Files         118      118              
  Lines       15018    15021       +3     
  Branches     1599     1599              
==========================================
+ Hits        11781    11785       +4     
+ Misses       2870     2867       -3     
- Partials      367      369       +2

Files	Coverage Δ
onnxscript/function_libs/torch_lib/ops/core.py	`79.74% <75.00%> (+0.05%)`	⬆️

justinchuby · 2023-11-02T20:09:40Z

onnxscript/function_libs/torch_lib/ops/core.py

+# replace native_batch_norm within unknown time period.
+# TODO: Refactor this after native_batch_norm is deprecated.
+@torch_op("aten::_native_batch_norm_legit_no_training", trace_only=True)
+def aten_native_batch_norm_no_training(


aten__native_batch_norm_no_training

justinchuby · 2023-11-02T20:10:43Z

onnxscript/function_libs/torch_lib/ops/core.py

+# NOTE: https://github.com/pytorch/pytorch/blob/a44f8894fa6d973693aab44a3dda079a168b05c1/torch/_decomp/decompositions.py#L1501-L1510
+# _native_batch_norm_legit_no_training and _native_batch_norm_legit are meant to
+# replace native_batch_norm within unknown time period.
+# TODO: Refactor this after native_batch_norm is deprecated.


Create an issue to track the todo?

justinchuby · 2023-11-02T20:11:18Z

@titaiwangms just minor follow ups - thanks!

From the review comment: #1116 (comment).

titaiwangms added 2 commits October 27, 2023 17:21

draft add BN ops

cab6af6

lint and comments

a5cad9f

titaiwangms added the module: torchlib Related to the torch/aten function lib in development label Oct 27, 2023

titaiwangms requested review from BowenBao and justinchuby October 27, 2023 22:06

Merge branch 'main' into titaiwang/add_op_native_batch_norm

6323b65

BowenBao approved these changes Oct 27, 2023

View reviewed changes

titaiwangms merged commit f35e844 into microsoft:main Oct 28, 2023

justinchuby mentioned this pull request Oct 30, 2023

Missing aten._native_batch_norm_legit_no_training.default #1118

Closed

justinchuby reviewed Nov 2, 2023

View reviewed changes

titaiwangms mentioned this pull request Nov 2, 2023

Fix Op(_native_batch_norm_legit_no_training) | feat(torchlib) #1126

Merged

justinchuby pushed a commit that referenced this pull request Nov 2, 2023

Fix Op(_native_batch_norm_legit_no_training) | feat(torchlib) (#1126)

aab5517

From the review comment: #1116 (comment).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Op(_native_batch_norm_legit_no_training and _native_batch_norm_legit) | feat(torchlib) #1116

Add Op(_native_batch_norm_legit_no_training and _native_batch_norm_legit) | feat(torchlib) #1116

titaiwangms commented Oct 27, 2023

codecov bot commented Oct 27, 2023 •

edited

Loading

justinchuby Nov 2, 2023

justinchuby Nov 2, 2023

titaiwangms Nov 2, 2023

justinchuby commented Nov 2, 2023

Add Op(_native_batch_norm_legit_no_training and _native_batch_norm_legit) | feat(torchlib) #1116

Add Op(_native_batch_norm_legit_no_training and _native_batch_norm_legit) | feat(torchlib) #1116

Conversation

titaiwangms commented Oct 27, 2023

codecov bot commented Oct 27, 2023 • edited Loading

Codecov Report

justinchuby Nov 2, 2023

Choose a reason for hiding this comment

justinchuby Nov 2, 2023

Choose a reason for hiding this comment

titaiwangms Nov 2, 2023

Choose a reason for hiding this comment

justinchuby commented Nov 2, 2023

codecov bot commented Oct 27, 2023 •

edited

Loading