-
Notifications
You must be signed in to change notification settings - Fork 531
Update quant overview for 021 #3845
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3845
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New FailureAs of commit be9b90c with merge base f184329 ( NEW FAILURE - The following job has failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
@Jack-Khuu has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
``` | ||
from torchao.quantization.quant_api import Int8DynActInt4WeightQuantizer | ||
|
||
model = Int8DynActInt4WeightQuantizer(precision=torch_dtype, groupsize=group_size).quantize(model) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: do we want to quickly mentioned how it's lowered as well, so people can have a more concrete picture of when the quantization happens
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
left one comment
@Jack-Khuu has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
@Jack-Khuu merged this pull request in f48f392. |
@pytorchbot cherry-pick --onto release/0.2 -c docs |
Cherry picking #3845The cherry pick PR is at #3857 Details for Dev Infra teamRaised by workflow job |
Summary: Pull Request resolved: #3845 Reviewed By: Gasoonjia Differential Revision: D58176137 Pulled By: Jack-Khuu fbshipit-source-id: bdaf01a8fb66ba3333c3b6d7802c3bb02b20c4a5 (cherry picked from commit f48f392) Co-authored-by: Jack-Khuu <[email protected]>
No description provided.