Skip to content

Conversation

@pcuenca
Copy link
Member

@pcuenca pcuenca commented Apr 29, 2025

Reviewed by @molbap privately.

#37852 should be merged too.

@github-actions github-actions bot marked this pull request as draft April 29, 2025 18:00
@github-actions
Copy link
Contributor

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

@pcuenca pcuenca marked this pull request as ready for review April 29, 2025 18:01
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Member

@LysandreJik LysandreJik left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, LGTM!

Copy link
Collaborator

@ArthurZucker ArthurZucker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don’t think we should default to static cache, but rather Dynamic!
otherwise should be alright!

@LysandreJik LysandreJik merged commit 63cd4c7 into main Apr 30, 2025
19 of 21 checks passed
@LysandreJik LysandreJik deleted the guard-updates branch April 30, 2025 08:34
zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request May 14, 2025
* Unhardcode use_chunked_attention, fix no_rope_layers

* Go back to exhaustive list of bools

* Conversion and modeling updates

* Fix rope

* Unhardcode rope

* Fix context length

* style

* Minor updates to conversion

* Use StaticCache

* Minor simplification

* DynamicCache 🤦

* Style

* Style
@ArthurZucker
Copy link
Collaborator

This broke quite a few things, will have a look

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants