[tests] Parameterized `test_eager_matches_sdpa_inference` #36650

gante · 2025-03-11T18:44:13Z

What does this PR do?

Problem

test_eager_matches_sdpa_inference on main is running many nested test cases (48!). This is not only a bad practice but also slows down our workflow: when it breaks, we need to parse the test outputs to see which configuration(s) broke. An example:

Which expands into

Did it crash because of a specific parameterization? Or because we are running many subtests in a test? Answering those questions is not clear atm.

Fix

This PR replaces the test's for loops by @parameterized.expand, making sure the test name immediately identifies the test case.

In the process, I've noticed many skips/overwrites are no longer needed. The test is still super ugly, and I've left a few TODO for whenever we decide to touch the test again.

An example -- what's being tested is now clear, minimal difference in run time ✨

github-actions · 2025-03-11T18:44:25Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. When it is ready for review, please click the Ready for review button (at the bottom of the PR page).

gante · 2025-03-11T18:59:19Z

tests/test_modeling_common.py

+                    self.skipTest(reason="Model does not support output_attentions")
+
+                # TODO: if we can also check with `batch_size=1` without being flaky?
+                for batch_size in [7]:


most of the diff in the loop is indentation :)

there are a few changes, to avoid overwriting this large test and, more importantly, to pressure us into standardizing model interfaces (see musicgen notes in this loop)

gante · 2025-03-11T19:01:02Z

tests/test_modeling_common.py

+        # TODO: we shouldn't need to do this skip, i.e. the test would be composable from the model tester. CLIP-like
+        # models have a custom mixin, which we detect to skip this test.
+        if not any(".ModelTesterMixin" in str(base) for base in self.__class__.__bases__):
+            self.skipTest(reason="CLIP-like models have a different `test_eager_matches_sdpa_inference`")


the differences here already existed -- this test was being overwritten using the same names.

With the parameterization we get new test names, hence this extra skip

Ah, I didn't know changing this will make the new test names being run ... I trust you on this being neeeded here.

gante · 2025-03-11T19:01:40Z

tests/test_modeling_common.py


-        if torch_dtype == "float16" and not is_torch_fp16_available_on_device(torch_device):
+        # convert shorthand name to torch.dtype
+        if torch_dtype == "fp16":


float16 -> fp16 for a shorter test name :)

HuggingFaceDocBuilderDev · 2025-03-11T19:22:36Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

tests/test_modeling_common.py

ydshieh · 2025-03-13T11:12:43Z

tests/test_modeling_common.py

+            # TODO: standardize the interfaces for musicgen models, see other todo in this test
+            if model.__class__.__name__ == "MusicgenMelodyForConditionalGeneration":


do you mean we need this because of the new generate tests have new names?

maybe I misunderstand: the change here is to avoid this test being overwitten for musicgen models?

No :) For this specific model, which behaves differently, we have the two usual options:

overwrite the test as usual

add this exception

I went with the second to pressure me into having a second look at musicgen, which has many interface issues (same argument names, different meaning and expected shapes) 👀 These interface issues, in turn, are causing me issues whenever we fix generate, tests, ...

I can do a normal test overwrite if you think it's preferable.

It's ok, I have no strong opinion, just wondering if : with this if ... else ..., musicgen models is still run this test successfully without the overwrite. If so, that is nice.

Yes, they pass the test 🙌

ydshieh

I only have a nit question

https://github.com/huggingface/transformers/pull/36650/files#r1993288313

I love the idea to make the test name clear in the summary.
I trust you without taking look by myself on the changes in different test files.

ydshieh · 2025-03-13T11:22:45Z

One more nit: for the name in the test, we can make even short by attention --> attn

gante added 4 commits March 11, 2025 16:41

all but clip and audio models

9f5faa5

moshi

ffb413c

musicgen, why are you so special?

7331b29

all cases

33c6a3f

github-actions bot marked this pull request as draft March 11, 2025 18:44

gante added 2 commits March 11, 2025 18:47

move tols back in

44927a8

parameterization as a constant

256a224

gante marked this pull request as ready for review March 11, 2025 18:55

Merge branch 'main' into parameterized_eager_sdpa_inference

327c573

github-actions bot requested a review from ydshieh March 11, 2025 18:55

gante commented Mar 11, 2025

View reviewed changes

gante and others added 5 commits March 12, 2025 10:12

Merge branch 'main' into parameterized_eager_sdpa_inference

131b237

Merge branch 'main' into parameterized_eager_sdpa_inference

c4a8acb

fix models with in the signature but not in the config

9e3aae2

fix vitmsn

11b9d35

fix moshi skips

db015c4

gante force-pushed the parameterized_eager_sdpa_inference branch from 1bea90c to db015c4 Compare March 13, 2025 10:12

Merge branch 'main' into parameterized_eager_sdpa_inference

a05e710

ydshieh reviewed Mar 13, 2025

View reviewed changes

tests/test_modeling_common.py Outdated Show resolved Hide resolved

ydshieh reviewed Mar 13, 2025

View reviewed changes

ydshieh approved these changes Mar 13, 2025

View reviewed changes

gante and others added 4 commits March 13, 2025 14:15

PR comments: reverse if order, shorter test name

e31ff5d

Merge branch 'main' into parameterized_eager_sdpa_inference

f52afe4

Merge branch 'main' into parameterized_eager_sdpa_inference

1357a91

Merge branch 'main' into parameterized_eager_sdpa_inference

fb423a2

gante merged commit 42ebb6c into huggingface:main Mar 14, 2025
23 checks passed

gante deleted the parameterized_eager_sdpa_inference branch March 14, 2025 14:41

gante mentioned this pull request Mar 15, 2025

[CI] remove redundant checks in test_eager_matches_sdpa_inference #36740

Merged

ydshieh mentioned this pull request Sep 1, 2025

Fix test_eager_matches_sdpa_inference not run for CLIP #40581

Merged

		# TODO: standardize the interfaces for musicgen models, see other todo in this test
		if model.__class__.__name__ == "MusicgenMelodyForConditionalGeneration":

[tests] Parameterized test_eager_matches_sdpa_inference #36650

[tests] Parameterized test_eager_matches_sdpa_inference #36650

Uh oh!

Conversation

gante commented Mar 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Problem

Fix

Uh oh!

github-actions bot commented Mar 11, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Mar 11, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gante Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ydshieh left a comment

Choose a reason for hiding this comment

Uh oh!

ydshieh commented Mar 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[tests] Parameterized `test_eager_matches_sdpa_inference` #36650

[tests] Parameterized `test_eager_matches_sdpa_inference` #36650

gante commented Mar 11, 2025 •

edited

Loading

gante Mar 13, 2025 •

edited

Loading