You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
See what happens if we export with max_seq_len (#11611)
Summary:
With some testing it seems like we can't export with token dimension max being `max_seq_len` in dynamic shape, if we only export with `tokens`.
However if we export with both `tokens` and `input_pos`, we can set token dimension max value to be `max_seq_len`.
This diff fix 2 things:
* Change dynamic shape based on different inputs
* Change pte's metadata `get_max_seq_len` and `get_max_context_len` based on the value of token dimension max value in dynamic shape.
Differential Revision: D76530379
Pulled By: larryliu0820
0 commit comments