We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 0fc0939 commit 86024c3Copy full SHA for 86024c3
1 file changed
docs/source/concept_guides/quantization.mdx
@@ -161,7 +161,7 @@ this works well with activations.
161
162
To effectively quantize a model to `int8`, the steps to follow are:
163
164
-1. Choose which operators to quantize. Good operators to quantize are the one dominating it terms of computation time,
+1. Choose which operators to quantize. Good operators to quantize are the one dominating in terms of computation time,
165
for instance linear projections and matrix multiplications.
166
2. Try post-training dynamic quantization, if it is fast enough stop here, otherwise continue to step 3.
167
3. Try post-training static quantization which can be faster than dynamic quantization but often with a drop in terms of
0 commit comments