Skip to content

[ET-VK] Add ANY_STORAGE support to embedding#18273

Merged
SS-JIA merged 4 commits intogh/SS-JIA/473/origfrom
gh/SS-JIA/474/orig
Mar 18, 2026
Merged

[ET-VK] Add ANY_STORAGE support to embedding#18273
SS-JIA merged 4 commits intogh/SS-JIA/473/origfrom
gh/SS-JIA/474/orig

Conversation

@pytorchbot
Copy link
Collaborator

This PR was created by the merge bot to help merge the original PR into the main branch.
ghstack PR number: #18057 by @SS-JIA
^ Please use this as the source of truth for the PR details, comments, and reviews
ghstack PR base: https://github.com/pytorch/executorch/tree/gh/SS-JIA/474/base
ghstack PR head: https://github.com/pytorch/executorch/tree/gh/SS-JIA/474/head
Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/SS-JIA/473/orig
Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/SS-JIA/474/orig
Differential Revision: D95970161
@diff-train-skip-merge

Modernize embedding to support ANY_STORAGE. Add buffer and texture
shader variants using BufferMetadata/TextureMetadata with indexing.glslh.
Unify new dispatch path with add_storage_type_suffix and
graph.meta_ubo(). Legacy channels-packed texture path retained for
backward compatibility.

Pull Request resolved: #18057
ghstack-source-id: 353546689
@exported-using-ghexport

Differential Revision: [D95970161](https://our.internmc.facebook.com/intern/diff/D95970161/)
@pytorchbot pytorchbot requested a review from SS-JIA as a code owner March 18, 2026 01:48
@pytorch-bot
Copy link

pytorch-bot bot commented Mar 18, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18273

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ssjia added 3 commits March 17, 2026 21:51
Modernize argmax and argmin to support ANY_STORAGE via the
add_reduce_per_row_node dispatch path. Buffer shader uses
BufferMetadata with indexing.glslh. Custom workgroup sizing retained
for cooperative row-reduction algorithm with shared memory.

Pull Request resolved: #18058
ghstack-source-id: 353546687
@exported-using-ghexport

Differential Revision: [D95970165](https://our.internmc.facebook.com/intern/diff/D95970165/)
Pull Request resolved: #18059

Add missing operators needed for Parakeet TDT model support:

- New symint ops: sym_sub, sym_floordiv, sym_mul in SymIntOps.cpp;
  register operator.floordiv and operator.mul as ephemeral ops in
  op_registry.py
- New tensor ops: bitwise_not (via unary_op shader with uint8 DTYPE),
  logical_and (alias for bitwise_and dispatch)
- Improve _to_copy: expand dtype support to FP_INT_BOOL_T and use
  pick_io_storage_fn to restrict to CONTIGUOUS_BUFFER for non-fp
  conversions
- Fix where resize: compute output shape via broadcast across all tensor
  inputs instead of always using the second input's shape
- Add symint support to split: use extract_int_or_symint_list instead of
  get_int_list in resize_split_node and split_with_sizes_copy_default
- Mark scalar_tensor as supporting resize
ghstack-source-id: 353546692
@exported-using-ghexport

Differential Revision: [D95970159](https://our.internmc.facebook.com/intern/diff/D95970159/)
…linear ops

Pull Request resolved: #18061

Wire bias through the q4gsw and dq8ca_q4gsw quantized linear operators.
Add add_bias_to_out_tile() helper in the output tile computation header and call
it from all three shader variants (tiled, coop, dq8ca_tiled). Remove the bias
guard in the pattern matcher to allow biased linear layers.
ghstack-source-id: 353546681
@exported-using-ghexport

Differential Revision: [D95970172](https://our.internmc.facebook.com/intern/diff/D95970172/)
@SS-JIA SS-JIA merged commit 4dca7ea into gh/SS-JIA/473/orig Mar 18, 2026
26 of 27 checks passed
@SS-JIA SS-JIA deleted the gh/SS-JIA/474/orig branch March 18, 2026 01:52
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 18, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants