4b embedding quantizer #3081

mikekgfb · 2024-04-17T05:41:52Z

Summary: 4b embedding quantizer

Differential Revision: D56229021

pytorch-bot · 2024-04-17T05:41:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3081

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 74b78e7 with merge base 1f4b631 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-04-17T05:42:01Z

This pull request was exported from Phabricator. Differential Revision: D56229021

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

facebook-github-bot · 2024-04-17T06:10:40Z

This pull request was exported from Phabricator. Differential Revision: D56229021

facebook-github-bot · 2024-04-17T06:10:48Z

This pull request was exported from Phabricator. Differential Revision: D56229021

larryliu0820 · 2024-04-17T06:21:17Z

examples/models/llama2/quantize.py

+                self.weight, self.scales, None, 0, 0, indices, dtype=self.dtype
+            )
+        else:  # 4bit packed
+            return torch.ops.llama_quantized.embedding_4bit.dtype(


It seems here it should be quantized_decomposed::embedding_4bit

larryliu0820 · 2024-04-17T06:27:27Z

examples/models/llama2/quantize.py

            )


-class EmbeddingOnlyInt8QuantHandler:
-    def __init__(self, mod, *, bitwidth: int = 8, group_size: Optional[int] = None):
+class EmbeddingOnlyInt8QuantHandler(QuantHandler):


Should we rename this class? Since it's not int8 only anymore.

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

facebook-github-bot · 2024-04-17T06:38:01Z

This pull request was exported from Phabricator. Differential Revision: D56229021

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 17, 2024

facebook-github-bot added the fb-exported label Apr 17, 2024

mikekgfb requested review from larryliu0820 and manuelcandales April 17, 2024 05:43

larryliu0820 approved these changes Apr 17, 2024

View reviewed changes

facebook-github-bot pushed a commit that referenced this pull request Apr 17, 2024

4b embedding quantizer (#3081)

2f18427

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

facebook-github-bot force-pushed the export-D56229021 branch from 8699dc9 to 2f18427 Compare April 17, 2024 06:10

mikekgfb pushed a commit that referenced this pull request Apr 17, 2024

4b embedding quantizer (#3081)

b13d167

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

mikekgfb force-pushed the export-D56229021 branch from 2f18427 to b13d167 Compare April 17, 2024 06:10

larryliu0820 reviewed Apr 17, 2024

View reviewed changes

4b embedding quantizer (#3081)

74b78e7

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

facebook-github-bot force-pushed the export-D56229021 branch from b13d167 to 74b78e7 Compare April 17, 2024 06:37

facebook-github-bot pushed a commit that referenced this pull request Apr 17, 2024

4b embedding quantizer (#3081)

3351e42

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

mikekgfb pushed a commit that referenced this pull request Apr 17, 2024

4b embedding quantizer (#3081)

2ab96e9

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

larryliu0820 pushed a commit that referenced this pull request Apr 17, 2024

4b embedding quantizer (#3081)

5929360

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

larryliu0820 pushed a commit that referenced this pull request Apr 17, 2024

4b embedding quantizer (#3081)

0da7b40

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

manuelcandales approved these changes Apr 18, 2024

View reviewed changes

larryliu0820 pushed a commit that referenced this pull request Apr 18, 2024

4b embedding quantizer (#3081)

6b3b722

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

manuelcandales closed this Apr 18, 2024

manuelcandales added a commit to manuelcandales/executorch-1 that referenced this pull request Apr 18, 2024

4b embedding quantizer (pytorch#3081)

3bc8f16

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

larryliu0820 pushed a commit that referenced this pull request Apr 19, 2024

4b embedding quantizer (#3081)

f7c1459

Summary: 4b embedding quantizer Reviewed By: larryliu0820 Differential Revision: D56229021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

4b embedding quantizer #3081

4b embedding quantizer #3081

mikekgfb commented Apr 17, 2024

pytorch-bot bot commented Apr 17, 2024 •

edited

Loading

facebook-github-bot commented Apr 17, 2024

facebook-github-bot commented Apr 17, 2024

facebook-github-bot commented Apr 17, 2024

larryliu0820 Apr 17, 2024

larryliu0820 Apr 17, 2024

facebook-github-bot commented Apr 17, 2024

4b embedding quantizer #3081

4b embedding quantizer #3081

Conversation

mikekgfb commented Apr 17, 2024

pytorch-bot bot commented Apr 17, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3081

✅ No Failures

facebook-github-bot commented Apr 17, 2024

facebook-github-bot commented Apr 17, 2024

facebook-github-bot commented Apr 17, 2024

larryliu0820 Apr 17, 2024

Choose a reason for hiding this comment

larryliu0820 Apr 17, 2024

Choose a reason for hiding this comment

facebook-github-bot commented Apr 17, 2024

pytorch-bot bot commented Apr 17, 2024 •

edited

Loading