-
Notifications
You must be signed in to change notification settings - Fork 31.7k
add sdpa to OPT #33298
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
add sdpa to OPT #33298
Changes from 20 commits
Commits
Show all changes
57 commits
Select commit
Hold shift + click to select a range
35e1a64
add sdpa to OPT
908e39b
chore: remove redundant whitespace in OPTDecoder class
c84a4dd
fixup
be32f92
bug fix
8063994
add sdpa and attention generate test
248029a
fixup
b66e3d8
Refactor OPTAttention forward method for improved readability and mai…
579d60e
undo refactor for _shape and key,val states
b105376
add OPT to doc, fixup didn't find it for some reason
c349632
change order
6dba8b0
change default attn_implemntation in testing to eager
989625b
Merge branch 'main' into spda_opt
avishaiElmakies 1d21751
[run-slow] opt
7233fda
change test_eager_matches_sdpa_generate to the one llama
9bacdeb
Update default attention implementation in testing common
5b38f78
[run-slow] opt
3f24a04
remove uneeded print
2efd25a
[run-slow] opt
bdd9cb2
refactor model testers to have attn_implementation="eager"
f80e3b3
[run-slow] opt
7ea22eb
convert test_eager_matches_sdpa_generate to opt-350M
b5547e7
bug fix when creating mask for opt
eaa8028
Merge branch 'main' into spda_opt
668e291
[run-slow] opt
d9d3bb3
if layer head mask default to eager
388d663
if head mask is not none fall to eager
e735ec4
[run-slow] opt
f94d574
Update src/transformers/models/opt/modeling_opt.py
avishaiElmakies e734d9d
Clean up Unpack imports (#33631)
molbap 34593ba
Fix DPT /Dinov2 sdpa regression on main (#33660)
molbap 6889d69
handle dependency errors in check_imports (#33622)
molbap d488c33
add back self.max_position_embeddings = config.max_position_embedding…
chengchengpei 9990915
Fix Llava conversion for LlavaQwen2ForCausalLM with Clip vision tower…
Isotr0py 3720eca
Uniformize kwargs for Udop processor and update docs (#33628)
yonigozlan 9b11d28
Generation: deprecate `PreTrainedModel` inheriting from `GenerationMi…
gante d3f8417
Enable BNB multi-backend support (#31098)
jiqing-feng 52a0a75
Fix error string after refactoring into get_chat_template (#33652)
tibor-reiss 400927e
uniformize git processor (#33668)
yonigozlan 3b0d24c
Modular `transformers`: modularity and inheritance for new model addi…
ArthurZucker ef64c81
Fix CIs post merging modular transformers (#33681)
ArthurZucker 6cd88aa
Fixed docstring for cohere model regarding unavailability of prune_he…
mnauf 4a457c1
Generation tests: update imagegpt input name, remove unused functions…
gante 4deac16
Improve Error Messaging for Flash Attention 2 on CPU (#33655)
sizhky 1f7d50a
Gemma2: fix config initialization (`cache_implementation`) (#33684)
gante 3e798fa
Fix ByteLevel alphabet missing when Sequence pretokenizer is used (#3…
umarbutler 9665ecc
Uniformize kwargs for image-text-to-text processors (#32544)
yonigozlan e1839b9
🚨🚨 Setting default behavior of assisted decoding (#33657)
jmamou 37da2d6
tests: fix pytorch tensor placement errors (#33485)
dvrogozh 58c2b2b
bump tokenizers, fix added tokens fast (#32535)
ArthurZucker f0bb0a8
[Pixtral] Improve docs, rename model (#33491)
NielsRogge 34a9142
fix code quality after merge
ArthurZucker 6aeec65
HFQuantizer implementation for compressed-tensors library (#31704)
bfineran 3e69375
Merge branch 'main' into spda_opt
avishaiElmakies a9b18dc
update model card for opt
9876dbb
add batch size to inference table
ff35bbc
[slow-run] opt
cfd1209
[run-slow] opt
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.