-
Notifications
You must be signed in to change notification settings - Fork 207
[OpenVINO] Support Qwen3-next #1523
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+689
−6
Merged
Changes from 40 commits
Commits
Show all changes
49 commits
Select commit
Hold shift + click to select a range
61d0f3e
[OpenVINO] Support Qwen3-next
rkazants ea6b4b3
Fix config and add base patching
rkazants 7e37aae
Extend patching
rkazants 8bc1c5a
Initial patching for linear attention
rkazants 26a4b65
Patch recurrent gated delta rule
rkazants a0e8d3c
Use module extension for conversion of chunked_attention_cell
rkazants 486a4f8
Implement conversion extension for chunked gated delta rule cell
rkazants f623e57
Patch sparse moe block
rkazants e76f243
Use core_attn_out
rkazants b191d59
Fix use of mask
rkazants 0b1bb21
Correct shape for recurrent_state in config file
rkazants 6a3d22f
Re-write patch for MoE
rkazants 9df28e3
9ddaad9
Merge remote-tracking branch 'upstream/main' into support_qwen3_next
rkazants 6384b9f
Apply code-formatting
rkazants f66862a
Fix previous commit with main merge
rkazants f4af348
Re-patch sparse MoE
rkazants 500810f
Merge remote-tracking branch 'upstream/main' into support_qwen3_next
rkazants aee20f4
Merge remote-tracking branch 'upstream/main' into support_qwen3_next
rkazants 92ec0e5
Fix code formatting
rkazants 3e1c66f
Add tests for qwen3 next
rkazants 5f45761
Unify representation for CausalConv1d
rkazants d49c7cd
Apply code-formatting
rkazants 9874d8c
Leave only one GatedDeltaNet representation
rkazants f1dd676
Fix support for other models
rkazants 2665dc9
Fix test_decoder.py
rkazants 162bb72
Use chunk size equal to one
rkazants f940262
Apply suggestion from @rkazants
rkazants 0112422
Move to recurrent gated delta net
rkazants 6a6bb5b
Merge remote-tracking branch 'origin/support_qwen3_next' into support…
rkazants 0bbc2a1
Apply code-formatting
rkazants 15a3aee
Fix inference
rkazants 70d75ed
Add comments to patching and config code
rkazants d9c233c
Apply code-formatting
rkazants 2956e0a
Use the right decoder patcher
rkazants 5657efd
Fix test_export test
rkazants 1f32570
No beam search support for Qwen3-next
rkazants 203726f
Update tests/openvino/test_decoder.py
rkazants b4d7505
Remove unneeded cached function calls for chunked gdn
rkazants b453da7
Update optimum/exporters/openvino/model_patcher.py
rkazants 33cb551
Handle bf16 weights
rkazants c0e1311
Apply suggestion from @rkazants
rkazants e8c4702
Apply suggestion from @rkazants
rkazants 111e69d
Apply suggestion from @rkazants
rkazants 1a91c7b
Apply suggestion from @rkazants
rkazants 5238cff
Apply suggestion from @rkazants
rkazants e789e75
Comment patch_recurrent_gated_delta_rule
rkazants 169033e
Move convert_recurrent_attention_cell to internal module _ov_ops.py
rkazants 1b835bb
Apply code formatting
rkazants File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.