Skip to content

[AMD] Optimize Qwen3-VL decode - fuse QK-norm + 3D mRoPE + KV cache write#21458

Merged
HaiShaw merged 6 commits into
sgl-project:mainfrom
yctseng0211:fused_qk_norm_mrope_decode
Apr 1, 2026
Merged

[AMD] Optimize Qwen3-VL decode - fuse QK-norm + 3D mRoPE + KV cache write#21458
HaiShaw merged 6 commits into
sgl-project:mainfrom
yctseng0211:fused_qk_norm_mrope_decode

Commits

Commits on Mar 27, 2026

Commits on Apr 1, 2026