[MPS] Add index.Tensor and aten.logical_not #3221

DenisVieriu97 · 2024-04-22T23:12:48Z

Add missing llama ops for MPS delegate:

index.Tensor
logical_not

index.put works correctly for generating 1 token, but gives incorrect results on 2nd token. This remains disabled.

Summary of changes:

Adds missing llama2 ops
Adds support for launching Metal kernels instead of MPSGraph ops (if MPSGraph doesn't have the support)

cc @cccclai , @shoumikhin

pytorch-bot · 2024-04-22T23:12:51Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3221

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 2b833bd with merge base 1eaed2b ():

NEW FAILURE - The following job has failed:

Apple / test-demo-ios / macos-job (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 65

FLAKY - The following job failed but was likely due to flakiness present on trunk:

Apple / upload-frameworks-ios (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

cccclai · 2024-04-23T00:01:30Z

Thank you! The logical op is actually from decomposing sdpa - if the sdpa is not ready yet, maybe can use the simpler version #3165 which has fewer ops than the decomposing F.scaled_dot_product_attention

facebook-github-bot · 2024-04-23T00:01:43Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

DenisVieriu97 · 2024-04-23T05:02:24Z

Apple / test-demo-ios / macos-job (pull_request) is failing because it uses the prebuilt mpsdelegate static lib stored in AWS (https://ossci-ios.s3.amazonaws.com/executorch/, "mps_backend": ["sha256": "97db0fd2b458ff4dae3f4e927d417b4ce88ef3bd4114759abe8372a05bac84ad"] which doesn't match anymore with the runtime changes made in this PR. This PR is using the new AOT changes but the older runtime, which are not compatible anymore. I've checked locally, and Apple / test-demo-ios / macos-job passes with the new runtime.
Any ideas how to fix this ? (cc @cccclai , @shoumikhin )

DenisVieriu97 · 2024-04-23T06:40:56Z

Thank you! The logical op is actually from decomposing sdpa - if the sdpa is not ready yet, maybe can use the simpler version #3165 which has fewer ops than the decomposing F.scaled_dot_product_attention

Thank you @cccclai . I'll take a look and create a new PR for those changes

cccclai · 2024-04-23T06:44:26Z

Thank you! The logical op is actually from decomposing sdpa - if the sdpa is not ready yet, maybe can use the simpler version #3165 which has fewer ops than the decomposing F.scaled_dot_product_attention

Thank you @cccclai . I'll take a look and create a new PR for those changes

oh sorry pointed to a wrong pr...should be this one #3037. Regarding the test failure, probably @shoumikhin knows more. Will ping him tomorrow

facebook-github-bot · 2024-04-23T18:18:42Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-04-23T22:57:37Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-04-24T01:26:23Z

@cccclai merged this pull request in 02a6b66.

cccclai · 2024-04-24T01:45:55Z

@pytorchbot cherry-pick --onto release/0.2 -c regression

Summary: Add missing llama ops for MPS delegate: - `index.Tensor` - `logical_not` `index.put` works correctly for generating 1 token, but gives incorrect results on 2nd token. This remains disabled. Summary of changes: - Adds missing llama2 ops - Adds support for launching Metal kernels instead of MPSGraph ops (if MPSGraph doesn't have the support) cc cccclai , shoumikhin Pull Request resolved: #3221 Reviewed By: shoumikhin Differential Revision: D56447710 Pulled By: cccclai fbshipit-source-id: 778a485df5e67d1afd006b42f07b69c8a3961223 (cherry picked from commit 02a6b66)

pytorchbot · 2024-04-24T01:50:20Z

Cherry picking #3221

The cherry pick PR is at #3267 and it is recommended to link a regression cherry pick PR with an issue

Details for Dev Infra team

Raised by workflow job

Summary: Add missing llama ops for MPS delegate: - `index.Tensor` - `logical_not` `index.put` works correctly for generating 1 token, but gives incorrect results on 2nd token. This remains disabled. Summary of changes: - Adds missing llama2 ops - Adds support for launching Metal kernels instead of MPSGraph ops (if MPSGraph doesn't have the support) cc cccclai , shoumikhin Pull Request resolved: #3221 Reviewed By: shoumikhin Differential Revision: D56447710 Pulled By: cccclai fbshipit-source-id: 778a485df5e67d1afd006b42f07b69c8a3961223 (cherry picked from commit 02a6b66) Co-authored-by: Denis Vieriu <[email protected]>

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 22, 2024

DenisVieriu97 changed the title ~~[MPS] Add index.Tensor and logical_not~~ [MPS] Add index.Tensor and aten.logical_not Apr 22, 2024

cccclai approved these changes Apr 23, 2024

View reviewed changes

DenisVieriu97 force-pushed the dev/denis/missing_llama_ops branch from d13bb4e to 1c9fda5 Compare April 23, 2024 01:07

DenisVieriu97 added 4 commits April 23, 2024 15:54

Add index.Tensor and logical_not

e22cbc3

Remove prints

653a682

More cleanup

26f80e7

Fix lint

2b833bd

DenisVieriu97 force-pushed the dev/denis/missing_llama_ops branch from 1c9fda5 to 2b833bd Compare April 23, 2024 22:54

facebook-github-bot closed this in 02a6b66 Apr 24, 2024

facebook-github-bot added the Merged label Apr 24, 2024

This was referenced Apr 25, 2024

fix llama readme #3339

Closed

disclaimer #3376

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MPS] Add index.Tensor and aten.logical_not #3221

[MPS] Add index.Tensor and aten.logical_not #3221

Uh oh!

DenisVieriu97 commented Apr 22, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 22, 2024 •

edited

Loading

Uh oh!

cccclai commented Apr 23, 2024

Uh oh!

facebook-github-bot commented Apr 23, 2024

Uh oh!

DenisVieriu97 commented Apr 23, 2024

Uh oh!

DenisVieriu97 commented Apr 23, 2024

Uh oh!

cccclai commented Apr 23, 2024

Uh oh!

facebook-github-bot commented Apr 23, 2024

Uh oh!

facebook-github-bot commented Apr 23, 2024

Uh oh!

facebook-github-bot commented Apr 24, 2024

Uh oh!

cccclai commented Apr 24, 2024

Uh oh!

pytorchbot commented Apr 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[MPS] Add index.Tensor and aten.logical_not #3221

[MPS] Add index.Tensor and aten.logical_not #3221

Uh oh!

Conversation

DenisVieriu97 commented Apr 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3221

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

cccclai commented Apr 23, 2024

Uh oh!

facebook-github-bot commented Apr 23, 2024

Uh oh!

DenisVieriu97 commented Apr 23, 2024

Uh oh!

DenisVieriu97 commented Apr 23, 2024

Uh oh!

cccclai commented Apr 23, 2024

Uh oh!

facebook-github-bot commented Apr 23, 2024

Uh oh!

facebook-github-bot commented Apr 23, 2024

Uh oh!

facebook-github-bot commented Apr 24, 2024

Uh oh!

cccclai commented Apr 24, 2024

Uh oh!

pytorchbot commented Apr 24, 2024

Cherry picking #3221

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DenisVieriu97 commented Apr 22, 2024 •

edited

Loading

pytorch-bot bot commented Apr 22, 2024 •

edited

Loading