Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
f208208b
"vllm/vscode:/vscode.git/clone" did not exist on "2abd97592f947c041ba70329532f0cf62dd8971f"
Unverified
Commit
f208208b
authored
Apr 23, 2026
by
ishandhanani
Committed by
GitHub
Apr 23, 2026
Browse files
fix(sglang): disable piecewise CUDA graph in launch scripts (#8609)
parent
c5a60a04
Changes
4
Show whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
9 additions
and
0 deletions
+9
-0
examples/backends/sglang/launch/agg.sh
examples/backends/sglang/launch/agg.sh
+1
-0
examples/backends/sglang/launch/agg_router.sh
examples/backends/sglang/launch/agg_router.sh
+2
-0
examples/backends/sglang/launch/disagg.sh
examples/backends/sglang/launch/disagg.sh
+2
-0
examples/backends/sglang/launch/disagg_router.sh
examples/backends/sglang/launch/disagg_router.sh
+4
-0
No files found.
examples/backends/sglang/launch/agg.sh
View file @
f208208b
...
...
@@ -85,6 +85,7 @@ python3 -m "$WORKER_MODULE" \
--trust-remote-code
\
--skip-tokenizer-init
\
--enable-metrics
\
--disable-piecewise-cuda-graph
\
$GPU_MEM_ARGS
\
"
${
TRACE_ARGS
[@]
}
"
\
"
${
EXTRA_ARGS
[@]
}
"
&
...
...
examples/backends/sglang/launch/agg_router.sh
View file @
f208208b
...
...
@@ -86,6 +86,7 @@ python3 -m dynamo.sglang \
--trust-remote-code
\
"
${
KV_EVENTS_ARGS_1
[@]
}
"
\
--enable-metrics
\
--disable-piecewise-cuda-graph
\
$GPU_MEM_ARGS
\
"
${
TRACE_ARGS
[@]
}
"
&
...
...
@@ -98,6 +99,7 @@ CUDA_VISIBLE_DEVICES=1 python3 -m dynamo.sglang \
--trust-remote-code
\
"
${
KV_EVENTS_ARGS_2
[@]
}
"
\
--enable-metrics
\
--disable-piecewise-cuda-graph
\
$GPU_MEM_ARGS
\
"
${
TRACE_ARGS
[@]
}
"
&
...
...
examples/backends/sglang/launch/disagg.sh
View file @
f208208b
...
...
@@ -81,6 +81,7 @@ python3 -m dynamo.sglang \
--port
40000
\
--disaggregation-transfer-backend
nixl
\
--enable-metrics
\
--disable-piecewise-cuda-graph
\
$GPU_MEM_ARGS
\
"
${
TRACE_ARGS
[@]
}
"
&
...
...
@@ -97,6 +98,7 @@ CUDA_VISIBLE_DEVICES=1 python3 -m dynamo.sglang \
--host
0.0.0.0
\
--disaggregation-transfer-backend
nixl
\
--enable-metrics
\
--disable-piecewise-cuda-graph
\
$GPU_MEM_ARGS
\
"
${
TRACE_ARGS
[@]
}
"
&
...
...
examples/backends/sglang/launch/disagg_router.sh
View file @
f208208b
...
...
@@ -76,6 +76,7 @@ python3 -m dynamo.sglang \
--kv-events-config
'{"publisher":"zmq","topic":"kv-events","endpoint":"tcp://*:5557"}'
\
--disaggregation-transfer-backend
nixl
\
--enable-metrics
\
--disable-piecewise-cuda-graph
\
"
${
TRACE_ARGS
[@]
}
"
&
# run prefill worker
...
...
@@ -91,6 +92,7 @@ CUDA_VISIBLE_DEVICES=1 python3 -m dynamo.sglang \
--kv-events-config
'{"publisher":"zmq","topic":"kv-events","endpoint":"tcp://*:5558"}'
\
--disaggregation-transfer-backend
nixl
\
--enable-metrics
\
--disable-piecewise-cuda-graph
\
"
${
TRACE_ARGS
[@]
}
"
&
# run decode worker
...
...
@@ -106,6 +108,7 @@ CUDA_VISIBLE_DEVICES=3 python3 -m dynamo.sglang \
--kv-events-config
'{"publisher":"zmq","topic":"kv-events","endpoint":"tcp://*:5560"}'
\
--disaggregation-transfer-backend
nixl
\
--enable-metrics
\
--disable-piecewise-cuda-graph
\
"
${
TRACE_ARGS
[@]
}
"
&
# run decode worker
...
...
@@ -121,6 +124,7 @@ CUDA_VISIBLE_DEVICES=2 python3 -m dynamo.sglang \
--kv-events-config
'{"publisher":"zmq","topic":"kv-events","endpoint":"tcp://*:5559"}'
\
--disaggregation-transfer-backend
nixl
\
--enable-metrics
\
--disable-piecewise-cuda-graph
\
"
${
TRACE_ARGS
[@]
}
"
&
# Wait for any worker to exit (keeps script running)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment