You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/llm/getting-started.md
+18-23
Original file line number
Diff line number
Diff line change
@@ -77,7 +77,7 @@ cd ../..
77
77
:::
78
78
::::
79
79
80
-
For more information, see [Setting Up ExecuTorch](https://pytorch.org/executorch/stable/getting-started-setup.html).
80
+
For more information, see [Setting Up ExecuTorch](../getting-started-setup.md).
81
81
82
82
83
83
## Running a Large Language Model Locally
@@ -161,7 +161,7 @@ with open("nanogpt.pte", "wb") as file:
161
161
162
162
To export, run the script with `python export_nanogpt.py` (or python3, as appropriate for your environment). It will generate a `nanogpt.pte` file in the current directory.
163
163
164
-
For more information, see [Exporting to ExecuTorch](https://pytorch.org/executorch/main/tutorials/export-to-executorch-tutorial.html) and
164
+
For more information, see [Exporting to ExecuTorch](../tutorials/export-to-executorch-tutorial) and
for the [XNNPACK Backend](../tutorial-xnnpack-delegate-lowering.md) and [CoreML
522
+
Backend](../build-run-coreml.md).
527
523
528
524
## Quantization
529
525
@@ -609,7 +605,7 @@ target_link_libraries(
609
605
xnnpack_backend) # Provides the XNNPACK CPU acceleration backend
610
606
```
611
607
612
-
For more information, see [Quantization in ExecuTorch](https://pytorch.org/executorch/stable/quantization-overview.html).
608
+
For more information, see [Quantization in ExecuTorch](../quantization-overview.md).
613
609
614
610
## Profiling and Debugging
615
611
After lowering a model by calling `to_backend()`, you may want to see what got delegated and what didn’t. ExecuTorch
@@ -687,7 +683,7 @@ Through the ExecuTorch SDK, users are able to profile model execution, giving ti
687
683
688
684
##### ETRecord generation (Optional)
689
685
690
-
An ETRecord is an artifact generated at the time of export that contains model graphs and source-level metadata linking the ExecuTorch program to the original PyTorch model. You can view all profiling events without an ETRecord, though with an ETRecord, you will also be able to link each event to the types of operators being executed, module hierarchy, and stack traces of the original PyTorch source code. For more information, see [https://pytorch.org/executorch/main/sdk-etrecord.html](https://pytorch.org/executorch/main/sdk-etrecord.html)
686
+
An ETRecord is an artifact generated at the time of export that contains model graphs and source-level metadata linking the ExecuTorch program to the original PyTorch model. You can view all profiling events without an ETRecord, though with an ETRecord, you will also be able to link each event to the types of operators being executed, module hierarchy, and stack traces of the original PyTorch source code. For more information, see [the ETRecord docs](../sdk-etrecord.md).
691
687
692
688
693
689
In your export script, after calling `to_edge()` and `to_executorch()`, call `generate_etrecord()` with the `EdgeProgramManager` from `to_edge()` and the `ExecuTorchProgramManager` from `to_executorch()`. Make sure to copy the `EdgeProgramManager`, as the call to `to_backend()` mutates the graph in-place.
@@ -709,7 +705,7 @@ Run the export script and the ETRecord will be generated as `etrecord.bin`.
709
705
710
706
##### ETDump generation
711
707
712
-
An ETDump is an artifact generated at runtime containing a trace of the model execution. For more information, see [https://pytorch.org/executorch/main/sdk-etdump.html](https://pytorch.org/executorch/main/sdk-etdump.html)
708
+
An ETDump is an artifact generated at runtime containing a trace of the model execution. For more information, see [the ETDump docs](../sdk-etdump.md).
713
709
714
710
Include the ETDump header in your code.
715
711
```cpp
@@ -779,7 +775,7 @@ This prints the performance data in a tabular format in “inspector_out.txt”,
<ahref="../_static/img/llm_manual_print_data_tabular.png"target="_blank">View in full size</a>
781
777
782
-
To learn more about the Inspector and the rich functionality it provides, see the [Inspector API Reference](https://pytorch.org/executorch/main/sdk-inspector.html).
778
+
To learn more about the Inspector and the rich functionality it provides, see the [Inspector API Reference](../sdk-inspector.md).
783
779
784
780
## Custom Kernels
785
781
With the ExecuTorch custom operator APIs, custom operator and kernel authors can easily bring in their kernel into PyTorch/ExecuTorch.
0 commit comments