Commit aa3f22c
Summary:
It was a workaround to skip `aten.index_put` op in Core ML delegation, at the cost of partitioning the Llama model into 13 pieces.
For better performance, we prefer to delegate the whole model to Core ML. Since Core ML has added the [necessary support](apple/coremltools#2190), it is time to revert this workaround
Pull Request resolved: #2975
Reviewed By: kirklandsign
Differential Revision: D56002979
Pulled By: cccclai
fbshipit-source-id: e7a7c8c43706cb57eba3e6f720b3d713bec5065b
(cherry picked from commit 7d4bafc)
Co-authored-by: yifan_shen3 <[email protected]>
1 parent 27e1a62 commit aa3f22c
1 file changed
+0
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
663 | 663 | | |
664 | 664 | | |
665 | 665 | | |
666 | | - | |
667 | | - | |
668 | | - | |
669 | 666 | | |
670 | 667 | | |
671 | 668 | | |
| |||
0 commit comments