[CI] remove redundant checks in test_eager_matches_sdpa_inference
#36740
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do?
Follow-up to #36650
Removes redundant test cases in
test_eager_matches_sdpa_inference, related tooutput_attentions.SDPA has special attention mask preparation (e.g. see here) but, when we pass
output_attentions, SDPA reverts to eager attention. That means that mask preparation for SDPA +output_attentionsmust be the same as eager. In #30652 we added aforloop to test the cases withoutput_attentions in (True, False), resulting in 24 new subtests.That's testing overkill, all we need to check is to confirm that we're preparing the attention mask correctly with
output_attentions=True. This PR removes all but 1 of the redundant tests.@ydshieh with this PR, both CPU and GPU CI is faster than in
v4.49.0🤗