Skip to content

[XPU] fix weight scale shape#42725

Merged
jikunshang merged 2 commits into
vllm-project:mainfrom
zufangzhu:zufang/fix_weight_scale
May 17, 2026
Merged

[XPU] fix weight scale shape#42725
jikunshang merged 2 commits into
vllm-project:mainfrom
zufangzhu:zufang/fix_weight_scale

Conversation

@zufangzhu
Copy link
Copy Markdown
Contributor

update weight scale shape

Test CI

Signed-off-by: Zhu, Zufang <zufang.zhu@intel.com>
@mergify mergify Bot added the intel-gpu Related to Intel GPU label May 15, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request modifies the process_weights_after_loading method in the XPU scaled matrix multiplication kernel to transpose and update the weight_scale parameter. A review comment points out that this transformation is currently performed unconditionally, which could lead to incorrect tensor shapes if the weight is already in the target layout. The reviewer suggests moving the weight_scale transposition inside the conditional block to ensure it remains consistent with the weight's layout.

Comment thread vllm/model_executor/kernels/linear/scaled_mm/xpu.py
@jikunshang jikunshang added the ready ONLY add when PR is ready to merge/full CI is needed label May 17, 2026
@jikunshang jikunshang merged commit 0fa8884 into vllm-project:main May 17, 2026
58 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

intel-gpu Related to Intel GPU ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants