[ET-VK] Enable int8 tiled compute shader to be used with buffer tensors #10302

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

facebook-github-bot merged 5 commits into gh/SS-JIA/214/base from gh/SS-JIA/214/head

Apr 23, 2025

Contributor

SS-JIA commented Apr 18, 2025 •

edited

Loading

Stack from ghstack (oldest at bottom):

Context

As title. Allow the optimized int8 tiled compute shader to be usable for buffer-backed tensors as well.

Changes

Generate buffer variants for the int8 linear tiled shader
Force the scales tensor to always be a buffer to reduce the number of shader variants that need to be generated.
Generate an additional variant that computes only 1 output row
Do not require output rows to be an exact multiple of 4 or 6 to use the tiled implementation

Differential Revision: D73276277


          [ET-VK] Enable int8 tiled compute shader to be used with buffer tensors

55cb072

## Context

As title. Allow the optimized int8 tiled compute shader to be usable for buffer-backed tensors as well.

## Changes

* Generate buffer variants for the int8 linear tiled shader
* Force the scales tensor to always be a buffer to reduce the number of shader variants that need to be generated.
* Generate an additional variant that computes only 1 output row
* Do not require output rows to be an exact multiple of 4 or 6 to use the tiled implementation

Differential Revision: [D73276277](https://our.internmc.facebook.com/intern/diff/D73276277/)

[ghstack-poisoned]

SS-JIA added a commit that referenced this pull request


          [ET-VK] Enable int8 tiled compute shader to be used with buffer tensors

## Context

As title. Allow the optimized int8 tiled compute shader to be usable for buffer-backed tensors as well.

## Changes

* Generate buffer variants for the int8 linear tiled shader
* Force the scales tensor to always be a buffer to reduce the number of shader variants that need to be generated.
* Generate an additional variant that computes only 1 output row
* Do not require output rows to be an exact multiple of 4 or 6 to use the tiled implementation

Differential Revision: [D73276277](https://our.internmc.facebook.com/intern/diff/D73276277/)

ghstack-source-id: 279008193
Pull Request resolved: #10302

pytorch-bot bot commented Apr 18, 2025 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10302

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 1 Pending

As of commit fa29ad0 with merge base 334af4a ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner / linux-job (gh)
>>> Lint for backends/cadence/aot/tests/test_replace_ops_passes.py:
pull / unittest-editable / linux / linux-job (gh)
backends/xnnpack/test/ops/test_conv1d.py::TestConv1d::test_qs8_conv1d_batchnorm_seq

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

Contributor

facebook-github-bot commented Apr 18, 2025

This pull request was exported from Phabricator. Differential Revision: D73276277

facebook-github-bot added the fb-exported label


          Update on "[ET-VK] Enable int8 tiled compute shader to be used with b…

191e6c4

…uffer tensors"

## Context

As title. Allow the optimized int8 tiled compute shader to be usable for buffer-backed tensors as well.

## Changes

* Generate buffer variants for the int8 linear tiled shader
* Force the scales tensor to always be a buffer to reduce the number of shader variants that need to be generated.
* Generate an additional variant that computes only 1 output row
* Do not require output rows to be an exact multiple of 4 or 6 to use the tiled implementation

Differential Revision: [D73276277](https://our.internmc.facebook.com/intern/diff/D73276277/)

[ghstack-poisoned]

SS-JIA mentioned this pull request

[ET-VK] Add coop shader for int8 linear #10304

Merged

Contributor

facebook-github-bot commented Apr 18, 2025

This pull request was exported from Phabricator. Differential Revision: D73276277


          Update on "[ET-VK] Enable int8 tiled compute shader to be used with b…

a89de6c

…uffer tensors"

## Context

As title. Allow the optimized int8 tiled compute shader to be usable for buffer-backed tensors as well.

## Changes

* Generate buffer variants for the int8 linear tiled shader
* Force the scales tensor to always be a buffer to reduce the number of shader variants that need to be generated.
* Generate an additional variant that computes only 1 output row
* Do not require output rows to be an exact multiple of 4 or 6 to use the tiled implementation

Differential Revision: [D73276277](https://our.internmc.facebook.com/intern/diff/D73276277/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Apr 23, 2025

This pull request was exported from Phabricator. Differential Revision: D73276277


          Update on "[ET-VK] Enable int8 tiled compute shader to be used with b…

ba2c565

…uffer tensors"

## Context

As title. Allow the optimized int8 tiled compute shader to be usable for buffer-backed tensors as well.

## Changes

* Generate buffer variants for the int8 linear tiled shader
* Force the scales tensor to always be a buffer to reduce the number of shader variants that need to be generated.
* Generate an additional variant that computes only 1 output row
* Do not require output rows to be an exact multiple of 4 or 6 to use the tiled implementation

Differential Revision: [D73276277](https://our.internmc.facebook.com/intern/diff/D73276277/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Apr 23, 2025

This pull request was exported from Phabricator. Differential Revision: D73276277


          Update on "[ET-VK] Enable int8 tiled compute shader to be used with b…

fa29ad0

…uffer tensors"

## Context

As title. Allow the optimized int8 tiled compute shader to be usable for buffer-backed tensors as well.

## Changes

* Generate buffer variants for the int8 linear tiled shader
* Force the scales tensor to always be a buffer to reduce the number of shader variants that need to be generated.
* Generate an additional variant that computes only 1 output row
* Do not require output rows to be an exact multiple of 4 or 6 to use the tiled implementation

Differential Revision: [D73276277](https://our.internmc.facebook.com/intern/diff/D73276277/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Apr 23, 2025

This pull request was exported from Phabricator. Differential Revision: D73276277

yipjustin approved these changes

View reviewed changes

SS-JIA added the release notes: vulkan label

facebook-github-bot merged commit 9217e37 into gh/SS-JIA/214/base

83 of 87 checks passed

facebook-github-bot deleted the gh/SS-JIA/214/head branch

April 23, 2025 22:04

facebook-github-bot temporarily deployed to cherry-pick-bot

April 23, 2025 22:04

— with

GitHub Actions Inactive

pytorchbot mentioned this pull request

[ET-VK] Enable int8 tiled compute shader to be used with buffer tensors #10415

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported release notes: vulkan