Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CI/build] Abort CI if pre-commit fails ci/build documentation Improvements or additions to documentation
#25168 opened Sep 18, 2025 by tmuttaki Loading…
5 tasks
[GDN] cherry-pick bugfix for scaled_dot_kkt from upstream FLA. ready ONLY add when PR is ready to merge/full CI is needed
#25167 opened Sep 18, 2025 by sighingnow Loading…
[CPU] Disable oneDNN linear on non-x86 platforms
#25166 opened Sep 18, 2025 by bigPYJ1151 Loading…
1 of 5 tasks
draft AFD implementation for step3 documentation Improvements or additions to documentation frontend needs-rebase v1
#25162 opened Sep 18, 2025 by Oliver-ss Loading…
5 tasks
refactor: abstract graph mode support into platform interface rocm Related to AMD ROCm
#25161 opened Sep 18, 2025 by yiz-liu Loading…
2 tasks
[Rocm] [quantization] support quark wmxfp4 for gptoss gpt-oss Related to GPT-OSS models rocm Related to AMD ROCm
#25159 opened Sep 18, 2025 by haoyangli-amd Draft
[Model] Let more pooling models support tp&pp.
#25152 opened Sep 18, 2025 by noooop Draft
5 tasks
[Docs] adjust docs link documentation Improvements or additions to documentation
#25151 opened Sep 18, 2025 by tangming1996 Loading…
5 tasks
[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B
#25146 opened Sep 18, 2025 by yma11 Loading…
5 tasks
[XPU][bugfix] fix rope for llama4 and deepseek deepseek Related to DeepSeek models llama Related to Llama models
#25145 opened Sep 18, 2025 by yma11 Loading…
5 tasks
[Bugfix] Parse SpeculativeConfig Error
#25142 opened Sep 18, 2025 by yyzxw Loading…
5 tasks
[GPUModelRunner] Split code related to kv cache init to a separate file ci/build ready ONLY add when PR is ready to merge/full CI is needed v1
#25139 opened Sep 18, 2025 by heheda12345 Loading…
5 tasks
[spec decode] Fix MTP inference path for MiMo-7B model documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding
#25136 opened Sep 18, 2025 by zixi-qi Loading…
5 tasks
Llamas 3.1 405B fp4 changes upstreaming from 355_wip llama Related to Llama models
#25135 opened Sep 18, 2025 by maleksan85 Loading…
[Bugfix] Fix ShardedStateLoader support for DeepSeek models with MLA scaling parameters deepseek Related to DeepSeek models
#25133 opened Sep 18, 2025 by lirong-lirong Loading…
6 of 8 tasks
[Kernel] Support DCP for Triton backend deepseek Related to DeepSeek models v1
#25132 opened Sep 18, 2025 by frank-wei Loading…
5 tasks
[V0 Deprecation] Remove V0 output processor ci/build frontend needs-rebase ready ONLY add when PR is ready to merge/full CI is needed v1
#25131 opened Sep 18, 2025 by WoosukKwon Loading…
Moves source compilation to build stage ci/build documentation Improvements or additions to documentation
#25129 opened Sep 18, 2025 by bbartels Loading…
5 tasks
ProTip! no:milestone will show everything without a milestone.