Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
eb4ba1c2
Commit
eb4ba1c2
authored
Oct 29, 2025
by
maxiao1
Browse files
update UNBALANCED_MODEL_LOADING_TIMEOUT_S=3600
parent
4b9b337b
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
python/sglang/srt/model_executor/model_runner.py
python/sglang/srt/model_executor/model_runner.py
+1
-1
No files found.
python/sglang/srt/model_executor/model_runner.py
View file @
eb4ba1c2
...
@@ -203,7 +203,7 @@ _is_xpu_xmx_available = xpu_has_xmx_support()
...
@@ -203,7 +203,7 @@ _is_xpu_xmx_available = xpu_has_xmx_support()
SGLANG_CI_SMALL_KV_SIZE
=
os
.
getenv
(
"SGLANG_CI_SMALL_KV_SIZE"
,
None
)
SGLANG_CI_SMALL_KV_SIZE
=
os
.
getenv
(
"SGLANG_CI_SMALL_KV_SIZE"
,
None
)
# Detect stragger ranks in model loading
# Detect stragger ranks in model loading
UNBALANCED_MODEL_LOADING_TIMEOUT_S
=
3600
0
UNBALANCED_MODEL_LOADING_TIMEOUT_S
=
3600
# the ratio of mamba cache pool size to max_running_requests, it will be safe when it is larger than 2 (yizhang2077)
# the ratio of mamba cache pool size to max_running_requests, it will be safe when it is larger than 2 (yizhang2077)
MAMBA_CACHE_SIZE_MAX_RUNNING_REQUESTS_RATIO
=
3
MAMBA_CACHE_SIZE_MAX_RUNNING_REQUESTS_RATIO
=
3
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment