-
-
Notifications
You must be signed in to change notification settings - Fork 10.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[P/D][Nixl] Test PR kv-connector Label
kv-connector
#25175
opened Sep 18, 2025 by
NickLucche
•
Draft
TokenizerBase#convert_ids_to_tokens adds support for single id conversion
#25173
opened Sep 18, 2025 by
usberkeley
•
Draft
5 tasks
[CI/build] Abort CI if pre-commit fails
ci/build
documentation
Improvements or additions to documentation
#25168
opened Sep 18, 2025 by
tmuttaki
Loading…
5 tasks
[GDN] cherry-pick bugfix for scaled_dot_kkt from upstream FLA.
ready
ONLY add when PR is ready to merge/full CI is needed
#25167
opened Sep 18, 2025 by
sighingnow
Loading…
[CPU] Disable oneDNN linear on non-x86 platforms
#25166
opened Sep 18, 2025 by
bigPYJ1151
Loading…
1 of 5 tasks
Optimize KV cache distribution for asymmetric pipeline parallelism
frontend
v1
#25164
opened Sep 18, 2025 by
gholmes829
Loading…
4 of 5 tasks
[Docs] Fix warnings in mkdocs build (continued)
frontend
#25163
opened Sep 18, 2025 by
Zerohertz
Loading…
draft AFD implementation for step3
documentation
Improvements or additions to documentation
frontend
needs-rebase
v1
#25162
opened Sep 18, 2025 by
Oliver-ss
Loading…
5 tasks
refactor: abstract graph mode support into platform interface
rocm
Related to AMD ROCm
#25161
opened Sep 18, 2025 by
yiz-liu
Loading…
2 tasks
[Bugfix]: prevent crash when sampled tokens exceed max_model_len
v1
#25160
opened Sep 18, 2025 by
nicole-lihui
Loading…
1 of 5 tasks
[Rocm] [quantization] support quark wmxfp4 for gptoss
gpt-oss
Related to GPT-OSS models
rocm
Related to AMD ROCm
#25159
opened Sep 18, 2025 by
haoyangli-amd
•
Draft
[Docs] adjust docs link
documentation
Improvements or additions to documentation
#25151
opened Sep 18, 2025 by
tangming1996
Loading…
5 tasks
[bugfix] fix MHA for models like OpenGVLab/InternVL3_5-38B
#25146
opened Sep 18, 2025 by
yma11
Loading…
5 tasks
[XPU][bugfix] fix rope for llama4 and deepseek
deepseek
Related to DeepSeek models
llama
Related to Llama models
#25145
opened Sep 18, 2025 by
yma11
Loading…
5 tasks
[GPUModelRunner] Split code related to kv cache init to a separate file
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#25139
opened Sep 18, 2025 by
heheda12345
Loading…
5 tasks
[Bugfix][CPU] Add placeholder to avoid import errors when using fused_moe ops on platforms without triton
#25137
opened Sep 18, 2025 by
bigPYJ1151
Loading…
2 of 5 tasks
[spec decode] Fix MTP inference path for MiMo-7B model
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
speculative-decoding
#25136
opened Sep 18, 2025 by
zixi-qi
Loading…
5 tasks
Llamas 3.1 405B fp4 changes upstreaming from 355_wip
llama
Related to Llama models
#25135
opened Sep 18, 2025 by
maleksan85
Loading…
[Bugfix] Fix ShardedStateLoader support for DeepSeek models with MLA scaling parameters
deepseek
Related to DeepSeek models
#25133
opened Sep 18, 2025 by
lirong-lirong
Loading…
6 of 8 tasks
[Kernel] Support DCP for Triton backend
deepseek
Related to DeepSeek models
v1
#25132
opened Sep 18, 2025 by
frank-wei
Loading…
5 tasks
[V0 Deprecation] Remove V0 output processor
ci/build
frontend
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#25131
opened Sep 18, 2025 by
WoosukKwon
Loading…
Moves source compilation to build stage
ci/build
documentation
Improvements or additions to documentation
#25129
opened Sep 18, 2025 by
bbartels
Loading…
5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.