[ET-VK] Implement missing Vulkan operators for Parakeet TDT model by SS-JIA · Pull Request #18059 · pytorch/executorch

SS-JIA · 2026-03-10T17:01:27Z

Stack from ghstack (oldest at bottom):

Add missing operators needed for Parakeet TDT model support:

New symint ops: sym_sub, sym_floordiv, sym_mul in SymIntOps.cpp;
register operator.floordiv and operator.mul as ephemeral ops in
op_registry.py
New tensor ops: bitwise_not (via unary_op shader with uint8 DTYPE),
logical_and (alias for bitwise_and dispatch)
Improve _to_copy: expand dtype support to FP_INT_BOOL_T and use
pick_io_storage_fn to restrict to CONTIGUOUS_BUFFER for non-fp
conversions
Fix where resize: compute output shape via broadcast across all tensor
inputs instead of always using the second input's shape
Add symint support to split: use extract_int_or_symint_list instead of
get_int_list in resize_split_node and split_with_sizes_copy_default
Mark scalar_tensor as supporting resize

Differential Revision: D95970159

cc @manuelcandales @digantdesai @cbilgin

Add missing operators needed for Parakeet TDT model support: - New symint ops: sym_sub, sym_floordiv, sym_mul in SymIntOps.cpp; register operator.floordiv and operator.mul as ephemeral ops in op_registry.py - New tensor ops: bitwise_not (via unary_op shader with uint8 DTYPE), logical_and (alias for bitwise_and dispatch) - Improve _to_copy: expand dtype support to FP_INT_BOOL_T and use pick_io_storage_fn to restrict to CONTIGUOUS_BUFFER for non-fp conversions - Fix where resize: compute output shape via broadcast across all tensor inputs instead of always using the second input's shape - Add symint support to split: use extract_int_or_symint_list instead of get_int_list in resize_split_node and split_with_sizes_copy_default - Mark scalar_tensor as supporting resize Differential Revision: [D95970159](https://our.internmc.facebook.com/intern/diff/D95970159/) [ghstack-poisoned]

pytorch-bot · 2026-03-10T17:01:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18059

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 11 Cancelled Jobs, 1 Unrelated Failure

As of commit d6d9825 with merge base 22174fa ():

NEW FAILURES - The following jobs have failed:

Build Presets / windows (pybind) / build (gh)
An unexpected error has occurred. Conda has prepared the above report.
pull / unittest-editable / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_mv3_model

CANCELLED JOBS - The following jobs were cancelled. Please retry:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-moshi-linux / linux-job (gh) (matched linux rule in flaky-rules.json)
E: Failed to fetch http://security.ubuntu.com/ubuntu/pool/main/libs/libssh/libssh-gcrypt-4_0.9.6-2ubuntu0.22.04.6_amd64.deb 404 Not Found [IP: 185.125.190.83 80]

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-03-10T17:05:09Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

…T model" Add missing operators needed for Parakeet TDT model support: - New symint ops: sym_sub, sym_floordiv, sym_mul in SymIntOps.cpp; register operator.floordiv and operator.mul as ephemeral ops in op_registry.py - New tensor ops: bitwise_not (via unary_op shader with uint8 DTYPE), logical_and (alias for bitwise_and dispatch) - Improve _to_copy: expand dtype support to FP_INT_BOOL_T and use pick_io_storage_fn to restrict to CONTIGUOUS_BUFFER for non-fp conversions - Fix where resize: compute output shape via broadcast across all tensor inputs instead of always using the second input's shape - Add symint support to split: use extract_int_or_symint_list instead of get_int_list in resize_split_node and split_with_sizes_copy_default - Mark scalar_tensor as supporting resize Differential Revision: [D95970159](https://our.internmc.facebook.com/intern/diff/D95970159/) cc manuelcandales digantdesai cbilgin [ghstack-poisoned]

Pull Request resolved: #18059 Add missing operators needed for Parakeet TDT model support: - New symint ops: sym_sub, sym_floordiv, sym_mul in SymIntOps.cpp; register operator.floordiv and operator.mul as ephemeral ops in op_registry.py - New tensor ops: bitwise_not (via unary_op shader with uint8 DTYPE), logical_and (alias for bitwise_and dispatch) - Improve _to_copy: expand dtype support to FP_INT_BOOL_T and use pick_io_storage_fn to restrict to CONTIGUOUS_BUFFER for non-fp conversions - Fix where resize: compute output shape via broadcast across all tensor inputs instead of always using the second input's shape - Add symint support to split: use extract_int_or_symint_list instead of get_int_list in resize_split_node and split_with_sizes_copy_default - Mark scalar_tensor as supporting resize ghstack-source-id: 353546692 @exported-using-ghexport Differential Revision: [D95970159](https://our.internmc.facebook.com/intern/diff/D95970159/)

pytorch-bot bot added the module: vulkan Issues related to the Vulkan delegate and code under backends/vulkan/ label Mar 10, 2026

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 10, 2026

meta-codesync bot added fb-exported meta-exported labels Mar 10, 2026

ssjia added 4 commits March 11, 2026 09:52

SS-JIA mentioned this pull request Mar 16, 2026

[ET-VK] Fix exponential blowup in tag_memory_meta_pass repset tracing #18207

Merged

manuelcandales approved these changes Mar 17, 2026

View reviewed changes

ssjia added 2 commits March 17, 2026 11:27

meta-codesync bot merged commit 8539c47 into gh/SS-JIA/476/base Mar 18, 2026
204 of 220 checks passed

meta-codesync bot deleted the gh/SS-JIA/476/head branch March 18, 2026 01:48

meta-codesync bot temporarily deployed to cherry-pick-bot March 18, 2026 01:48 Inactive

pytorchbot mentioned this pull request Mar 18, 2026

[ET-VK] Implement missing Vulkan operators for Parakeet TDT model #18275

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ET-VK] Implement missing Vulkan operators for Parakeet TDT model#18059

[ET-VK] Implement missing Vulkan operators for Parakeet TDT model#18059
meta-codesync[bot] merged 8 commits intogh/SS-JIA/476/basefrom
gh/SS-JIA/476/head

SS-JIA commented Mar 10, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 10, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SS-JIA commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18059

❌ 2 New Failures, 11 Cancelled Jobs, 1 Unrelated Failure

Uh oh!

github-actions bot commented Mar 10, 2026

This PR needs a release notes: label

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SS-JIA commented Mar 10, 2026 •

edited

Loading

pytorch-bot bot commented Mar 10, 2026 •

edited

Loading

This PR needs a `release notes:` label