Skip to content

Commit 329184a

Browse files
mcr229facebook-github-bot
authored andcommitted
Update Profiling Section in XNNPACK Delegate Docs (#3237)
Summary: Pull Request resolved: #3237 Updating Profiling Section of the docs Main point is pointing the the SDK Profiling Tutorial on how to get XNNPACK profiling information Reviewed By: metascroy, cccclai Differential Revision: D56439491 fbshipit-source-id: 1d724ffae6d89e8769ea427cb37b4ec85fe3452f
1 parent 8748d57 commit 329184a

File tree

1 file changed

+1
-9
lines changed

1 file changed

+1
-9
lines changed

docs/source/native-delegates-executorch-xnnpack-delegate.md

+1-9
Original file line numberDiff line numberDiff line change
@@ -74,16 +74,8 @@ Since weight packing creates an extra copy of the weights inside XNNPACK, We fre
7474
When executing the XNNPACK subgraphs, we prepare the tensor inputs and outputs and feed them to the XNNPACK runtime graph. After executing the runtime graph, the output pointers are filled with the computed tensors.
7575

7676
#### **Profiling**
77-
We have enabled basic profiling for XNNPACK delegate that can be enabled with the following compiler flag `-DENABLE_XNNPACK_PROFILING`. After running the model it will produce basic per-op and total timings. We provide an example of the profiling below. The timings listed are the average across runs, and the units are in microseconds.
77+
We have enabled basic profiling for XNNPACK delegate that can be enabled with the following compiler flag `-DENABLE_XNNPACK_PROFILING`. With ExecuTorch's SDK integration, you can also now use the SDK tools to profile the model. You can follow the steps in [Using the ExecuTorch SDK to Profile a Model](./tutorials/sdk-integration-tutorial) on how to profile ExecuTorch models and use SDK's Inspector API to view XNNPACK's internal profiling information.
7878

79-
```
80-
Fully Connected (NC, F32) GEMM: 109.510002
81-
Total Time: 109.510002
82-
```
83-
84-
::::{note}
85-
Profiling is a work in progress, and is planned to be integrated with [SDK Tools](sdk-delegate-integration.md) and Tensorboard.
86-
::::
8779

8880
[comment]: <> (TODO: Refactor quantizer to a more official quantization doc)
8981
## Quantization

0 commit comments

Comments
 (0)