Skip to content
Merged
Show file tree
Hide file tree
Changes from 5 commits
Commits
Show all changes
59 commits
Select commit Hold shift + click to select a range
e6d17fd
disable piecewise CUDA graph when MNNVL allreduce fusion is active
wenscarl Apr 16, 2026
a03c834
Merge remote-tracking branch 'origin/main' into ar_debug2
wenscarl May 5, 2026
04b50a4
docs: sync LMSYS SGLang blog cards
github-actions[bot] May 6, 2026
25f2d52
address review comments
wenscarl May 7, 2026
b7f5aa7
Merge branch 'main' into ar_debug2
wenscarl May 7, 2026
9e1b1bf
Merge branch 'main' into ar_debug2
b8zhong May 7, 2026
1d67ee1
Merge branch 'sgl-project:main' into main
wenscarl May 8, 2026
5727e6e
Merge remote-tracking branch 'origin/main' into ar_debug2
wenscarl May 8, 2026
5af1e3d
Merge branch 'main' into ar_debug2
b8zhong May 11, 2026
3cec562
Merge branch 'main' into ar_debug2
b8zhong May 12, 2026
085105d
Fix mnnvl ar fusion cuda graph capture fall back.
wenscarl May 12, 2026
00f6ebe
Skip bailout.
wenscarl May 12, 2026
6fe02b6
Add fallback and remove split op
wenscarl May 13, 2026
ab3c171
Merge branch 'sgl-project:main' into main
wenscarl May 13, 2026
1fb01bb
Simplify sm100 check
wenscarl May 13, 2026
86bf470
Merge remote-tracking branch 'shuw/main' into ar_debug2
wenscarl May 13, 2026
1b6279d
Update comments
wenscarl May 13, 2026
63ca041
fix index.mdx
wenscarl May 13, 2026
7b156cb
Merge branch 'main' into ar_debug2
b8zhong May 14, 2026
06ac53e
Merge branch 'main' into ar_debug2
wenscarl May 14, 2026
3e5ef52
Merge branch 'main' into ar_debug2
wenscarl May 15, 2026
6bcff35
Merge branch 'main' into ar_debug2
wenscarl May 15, 2026
8f6caaa
fix: clear flashinfer_allreduce_fusion_backend in deterministic infer…
wenscarl May 15, 2026
22bd707
Merge branch 'main' into ar_debug2
wenscarl May 15, 2026
bd08193
Merge branch 'main' into ar_debug2
wenscarl May 16, 2026
6714746
Merge branch 'main' into ar_debug2
wenscarl May 19, 2026
ed0b000
Try force thinking.
wenscarl May 19, 2026
796796d
Merge branch 'main' into ar_debug2
wenscarl May 19, 2026
a69896a
Merge branch 'main' into ar_debug2
wenscarl May 20, 2026
0fe1bd7
Fix CI test.
wenscarl May 20, 2026
87e7b6d
Fix typo
wenscarl May 20, 2026
b1ce506
Merge branch 'main' into ar_debug2
wenscarl May 20, 2026
1427a29
Merge branch 'main' into ar_debug2
wenscarl May 20, 2026
1fe43ce
Merge branch 'main' into ar_debug2
b8zhong May 22, 2026
3087da4
Resolve PR 23402 conflicts and address FlashInfer backend review
wenscarl May 29, 2026
40e5f2d
Revert FlashInfer PosixFD transport override
wenscarl May 29, 2026
6e66963
Remove overly strict MNNVL GPU count assertion
wenscarl May 29, 2026
477135e
Update test.
wenscarl Jun 1, 2026
e50d723
Fix typo.
wenscarl Jun 1, 2026
a80c2ab
Fix lint.
wenscarl Jun 1, 2026
cc67728
Fix lint.
wenscarl Jun 1, 2026
3797eb3
Merge branch 'main' into ar_debug2
wenscarl Jun 1, 2026
ecb3de8
Fix lint.
wenscarl Jun 2, 2026
c3223e4
Merge branch 'main' into ar_debug2
b8zhong Jun 2, 2026
f53f87e
Enable piecewise cuda graph due to bug fix.
wenscarl Jun 4, 2026
fe64b62
Update comments
wenscarl Jun 8, 2026
862fd11
force thinking and update backend
wenscarl Jun 8, 2026
7a64841
Enable sm90 singlenode
wenscarl Jun 9, 2026
58cc77b
Merge branch 'main' into ar_debug2
mmangkad Jun 9, 2026
8e23959
Upd help text.
wenscarl Jun 9, 2026
aa90f8c
Merge remote-tracking branch 'origin/main' into ar_debug2
Fridge003 Jun 10, 2026
bef6a6b
enhance test
Fridge003 Jun 10, 2026
1253302
Merge branch 'main' into ar_debug2
mmangkad Jun 11, 2026
9726f00
Merge branch 'main' into ar_debug2
b8zhong Jun 11, 2026
c831285
Merge branch 'main' into ar_debug2
b8zhong Jun 12, 2026
abfff2b
Merge branch 'main' into ar_debug2
mmangkad Jun 13, 2026
7200ed6
Merge branch 'main' into ar_debug2
mmangkad Jun 15, 2026
18e038e
more
Jun 15, 2026
053edd1
Merge branch 'main' into ar_debug2
b8zhong Jun 15, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion python/sglang/srt/layers/communicator.py
Original file line number Diff line number Diff line change
Expand Up @@ -163,7 +163,7 @@ def apply_flashinfer_allreduce_fusion(batch_size: int):
and batch_size > 0
and batch_size <= FUSE_ALLREDUCE_MAX_BATCH_SIZE
and not is_dp_attention_enabled()
and get_global_server_args().enable_flashinfer_allreduce_fusion
and get_global_server_args().flashinfer_allreduce_fusion_backend is not None
and not is_flashinfer_allreduce_unavailable()
)

Expand Down
Loading
Loading