[VLMs] fix flash-attention tests #37603

zucchini-nlp · 2025-04-18T09:15:21Z

What does this PR do?

FA2 with layer norm upcasted was failing for all VLMs before. The issue lies in VLM config structure which is composite, and modifying base config doesn't update sub-configs (which are also deepcopied in model with XXX._from_config)

The solution is to apply recursively config update on model's children, whenever a PretrainedModel is found. Though I am not sure if we need any recursive calls when prepare_model_for_quantization, it doesn't seem to update config

github-actions · 2025-04-18T09:15:33Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

zucchini-nlp · 2025-04-18T09:15:50Z

run-slow: janus

github-actions · 2025-04-18T09:17:56Z

This comment contains run-slow, running the specified jobs: This comment contains run-slow, running the specified jobs:

models: ['models/janus']
quantizations: [] ...

HuggingFaceDocBuilderDev · 2025-04-18T09:41:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

SunMarc

SGTM ! We pop_pre_quantization_dtype when saving the config, this is why you don't see it after saving the model. I think we should also update it recursively !

BenjaminBossan · 2025-04-25T13:23:21Z

Hi @zucchini-nlp this PR introduced a regression in PEFT. Before the PR, the model card would look like this:

---
library_name: peft
base_model: facebook/opt-125m
---

# Model Card for Model ID
[...]

Afterwards, the base_model meta info is missing:

---
library_name: peft
---

# Model Card for Model ID
[...]

Here is a reproducer:

# testfile.py
from transformers import AutoModelForCausalLM
from peft import LoraConfig, get_peft_model

model_id = "facebook/opt-125m"

def test_save_peft_modelcard(tmp_path):
    model = AutoModelForCausalLM.from_pretrained(model_id)
    config = LoraConfig()
    model = get_peft_model(model, config)
    model.save_pretrained(tmp_path, safe_serialization=False)

    with open(f"{tmp_path}/README.md") as f:
        model_card = f.read()
    assert "base_model: facebook/opt-125m" in model_card

SunMarc · 2025-04-25T14:16:12Z

src/transformers/configuration_utils.py

-        if "_name_or_path" in serializable_config_dict:
-            del serializable_config_dict["_name_or_path"]


probably due to that, we don't remove this in to_dict() cc @zucchini-nlp

oops, sorry, will revert that

revert

* fix one test * fa2 ln test * remove keys from config recursively * fix * fixup

revert

fix one test

8ef6a2b

github-actions bot marked this pull request as draft April 18, 2025 09:15

zucchini-nlp marked this pull request as ready for review April 18, 2025 09:15

Merge branch 'main' into janus

53cb846

fa2 ln test

de9e02a

zucchini-nlp changed the title ~~[janus] fix tests~~ [VLMs] fix flash-attention tests Apr 18, 2025

zucchini-nlp requested a review from SunMarc April 18, 2025 10:49

SunMarc approved these changes Apr 18, 2025

View reviewed changes

zucchini-nlp added 4 commits April 24, 2025 11:16

remove keys from config recursively

62518f9

Merge branch 'main' into janus

c4c9b96

fix

a19c4a9

fixup

0f0c1cd

zucchini-nlp merged commit 1cfcbfc into huggingface:main Apr 24, 2025
20 checks passed

SunMarc reviewed Apr 25, 2025

View reviewed changes

zucchini-nlp mentioned this pull request Apr 28, 2025

[config] revert #37603 #37821

Merged

SunMarc pushed a commit that referenced this pull request Apr 28, 2025

[config] revert #37603 (#37821)

9c5b131

revert

zucchini-nlp added a commit to zucchini-nlp/transformers that referenced this pull request May 14, 2025

[VLMs] fix flash-attention tests (huggingface#37603)

fab0e9c

* fix one test * fa2 ln test * remove keys from config recursively * fix * fixup

zucchini-nlp added a commit to zucchini-nlp/transformers that referenced this pull request May 14, 2025

[config] revert huggingface#37603 (huggingface#37821)

e187ba9

revert

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[VLMs] fix flash-attention tests #37603

[VLMs] fix flash-attention tests #37603

Uh oh!

zucchini-nlp commented Apr 18, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

zucchini-nlp commented Apr 18, 2025

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 18, 2025

Uh oh!

SunMarc left a comment

Uh oh!

Uh oh!

BenjaminBossan commented Apr 25, 2025

Uh oh!

SunMarc Apr 25, 2025

Uh oh!

zucchini-nlp Apr 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		if "_name_or_path" in serializable_config_dict:
		del serializable_config_dict["_name_or_path"]

[VLMs] fix flash-attention tests #37603

[VLMs] fix flash-attention tests #37603

Uh oh!

Conversation

zucchini-nlp commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

zucchini-nlp commented Apr 18, 2025

Uh oh!

github-actions bot commented Apr 18, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 18, 2025

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BenjaminBossan commented Apr 25, 2025

Uh oh!

SunMarc Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Apr 27, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zucchini-nlp commented Apr 18, 2025 •

edited

Loading