Skip to content

Conversation

@SunMarc
Copy link
Member

@SunMarc SunMarc commented Mar 12, 2025

What does this do

This PR simplifies a bit how we calculate modules that shouldn't be converted for the different quantization scheme.
Also, it fixes an issue with keep_in_fp32_modules as it is now Optional with default value None. This means that we should check beforehand with we want to use extend method.

@github-actions github-actions bot marked this pull request as draft March 12, 2025 14:29
@github-actions
Copy link
Contributor

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. When it is ready for review, please click the Ready for review button (at the bottom of the PR page).

@SunMarc SunMarc marked this pull request as ready for review March 12, 2025 14:30
@MekkCyber
Copy link
Contributor

LGTM Thanks !

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Member

@Cyrilvallez Cyrilvallez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot!

@danielhanchen
Copy link
Contributor

Appreciate the fixes!

@SunMarc SunMarc merged commit cc3a361 into main Mar 12, 2025
24 checks passed
@SunMarc SunMarc deleted the fix-keep-in-fp32-quants branch March 12, 2025 22:43
@danielhanchen
Copy link
Contributor

:))

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants