Skip to content

Commit 5fc4b60

Browse files
authored
Add sanity check for max_running_requests (#5016)
1 parent b868526 commit 5fc4b60

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

python/sglang/srt/managers/tp_worker.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -116,6 +116,7 @@ def __init__(
116116
),
117117
self.model_runner.req_to_token_pool.size,
118118
)
119+
assert self.max_running_requests > 0, "max_running_request is zero"
119120
self.max_req_len = min(
120121
self.model_config.context_len - 1,
121122
self.max_total_num_tokens - 1,

0 commit comments

Comments
 (0)