Qualcomm AI Engine Direct - Enable per channel linear op by chunit-quic · Pull Request #2822 · pytorch/executorch

chunit-quic · 2024-04-03T01:41:14Z

Add per channel weight quantization for linear op
Bias quantization for per channel weight Linear op is not support yet

pytorch-bot · 2024-04-03T01:41:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2822

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 2efac57 with merge base 9fd1a0e ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-04-04T07:00:08Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cccclai · 2024-04-04T22:18:11Z

Hey thank you for submitting, do you mind rebasing? Seems like CI was broken and it's fixed in main branch now.

- Add per channel weight quantization for linear op - Bias quantization for per channel weight Linear op is not support yet

haowhsu-quic · 2024-04-09T11:16:53Z

Hi @cccclai , since @chunit-quic is on PTO, I help rebase this onto latest mainline.
This PR is important for incoming LLAMA enablement PR, could you kindly help merge this? thank you!

facebook-github-bot · 2024-04-10T18:04:27Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-04-10T19:31:08Z

@cccclai merged this pull request in 554cd27.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 3, 2024

chunit-quic changed the title ~~[Qualcomm AI Engine Direct - Enable per channel linear op]~~ Qualcomm AI Engine Direct - Enable per channel linear op Apr 3, 2024

kirklandsign approved these changes Apr 4, 2024

View reviewed changes

cccclai added the partner: qualcomm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Qualcomm label Apr 4, 2024

[Qualcomm AI Engine Direct - Enable per channel linear op]

2efac57

- Add per channel weight quantization for linear op - Bias quantization for per channel weight Linear op is not support yet

haowhsu-quic force-pushed the enable_per_channel_linear branch from e3c8e0c to 2efac57 Compare April 9, 2024 11:13

facebook-github-bot closed this in 554cd27 Apr 10, 2024

facebook-github-bot added the Merged label Apr 10, 2024

mergennachin mentioned this pull request Apr 26, 2024

disclaimer #3376

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qualcomm AI Engine Direct - Enable per channel linear op#2822

Qualcomm AI Engine Direct - Enable per channel linear op#2822
chunit-quic wants to merge 1 commit into
pytorch:mainfrom
CodeLinaro:enable_per_channel_linear

chunit-quic commented Apr 3, 2024

Uh oh!

pytorch-bot Bot commented Apr 3, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Apr 4, 2024

Uh oh!

cccclai commented Apr 4, 2024

Uh oh!

haowhsu-quic commented Apr 9, 2024

Uh oh!

facebook-github-bot commented Apr 10, 2024

Uh oh!

facebook-github-bot commented Apr 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

chunit-quic commented Apr 3, 2024

Uh oh!

pytorch-bot Bot commented Apr 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2822

✅ No Failures

Uh oh!

facebook-github-bot commented Apr 4, 2024

Uh oh!

cccclai commented Apr 4, 2024

Uh oh!

haowhsu-quic commented Apr 9, 2024

Uh oh!

facebook-github-bot commented Apr 10, 2024

Uh oh!

facebook-github-bot commented Apr 10, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-bot Bot commented Apr 3, 2024 •

edited

Loading