You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Executorch][llama] Allow custom sdpa op replacement pass to leverage attention mask
Pull Request resolved: #10285
Previously we assumed that the custom sdpa always does causal attention.
This diff adds option to this module swap pass to make custom sdpa leverage
attention mask instead of causal.
ghstack-source-id: 279292324
@exported-using-ghexport
Differential Revision: [D73222736](https://our.internmc.facebook.com/intern/diff/D73222736/)
0 commit comments