[OpenVINO] Support Qwen3-next#1523
Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
There was a problem hiding this comment.
Pull request overview
Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
There was a problem hiding this comment.
Pull request overview
Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
|
FYI After fixes in OV for group-wise quantization, I was able to measure the accuracy for default int4 and I got (CPU, WWB Similarity): 0.960114 |
|
Thank you 🙏 |
|
Hello, Roman. @rkazants Can we convert models from the Intel repository to OpenVINO? AutoRound models. https://huggingface.co/Intel/Qwen3-Coder-Next-int4-AutoRound |
Hi @savvadesogle, Can you try this command for conversion: optimum-cli export openvino -m Intel/Qwen3-Coder-Next-int4-AutoRound Qwen3-Coder-Next-int4-AutoRound@ljaljushkin, @MaximProshin, @mvafin, did you see any problems for converting this quantized model from NNCF perspective or PyTorch FE? Best regards, |
What does this PR do?
Example of conversion cmd-line for
Qwen/Qwen3-Next-80B-A3B-Instruct:optimum-cli export openvino -m Qwen/Qwen3-Next-80B-A3B-Instruct Qwen3-Next-80B-A3B-InstructExample of inference for
Qwen/Qwen3-Next-80B-A3B-Instructusing OpenVINO backend:Before submitting