fix legacy deepep path for flashinfer_cutedsl#22925
Merged
Merged
Conversation
Contributor
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
Contributor
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
Collaborator
|
/tag-and-rerun-ci |
Collaborator
|
#21339 also has some changes to the weight loading/processing, can you check if those interfere with the deepep path also? |
ch-wan
approved these changes
Apr 17, 2026
zhangying098
pushed a commit
to zhangying098/sglang
that referenced
this pull request
Apr 23, 2026
kyx1999
pushed a commit
to KMSorSMS/sglang
that referenced
this pull request
Apr 27, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
The recent PR #21339 accidentally made it impossible to use the existing cutedsl moe backend + deepep a2a (
--moe-runner-backend flashinfer_cutedsl --moe-a2a-backend deepep).This manifested here:
Recipe bug: flashinfer_cutedsl moe-runner-backend incompatible with deepep a2a-backend #39.
This PR restores the previous DeepEP behavior without changing the existing auto backend resolution or generic runner setup logic.
Modifications
Restore the legacy DeepEP low-latency flashinfer_cutedsl MoE path by skipping MoeRunner initialization when flashinfer_cutedsl is used with moe_a2a_backend=deepep. This preserves the new standard CuteDSL path for moe_a2a_backend=none while keeping the old masked DeepEP execution path unchanged.
Accuracy Tests
N/A
Speed Tests and Profiling
N/A
Checklist
Review and Merge Process
/tag-and-rerun-ci,/tag-run-ci-label,/rerun-failed-ci