Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
2a24e4aa
Unverified
Commit
2a24e4aa
authored
Oct 20, 2025
by
ishandhanani
Committed by
GitHub
Oct 21, 2025
Browse files
fix: fix the ooms in gb200 default instructions (#3768)
parent
9a723734
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
9 additions
and
6 deletions
+9
-6
docs/backends/sglang/dsr1-wideep-gb200.md
docs/backends/sglang/dsr1-wideep-gb200.md
+5
-6
docs/backends/sglang/multinode-examples.md
docs/backends/sglang/multinode-examples.md
+4
-0
No files found.
docs/backends/sglang/dsr1-wideep-gb200.md
View file @
2a24e4aa
...
@@ -59,7 +59,6 @@ MC_TE_METRIC=true \
...
@@ -59,7 +59,6 @@ MC_TE_METRIC=true \
SGLANG_DISAGGREGATION_HEARTBEAT_MAX_FAILURE
=
100000
\
SGLANG_DISAGGREGATION_HEARTBEAT_MAX_FAILURE
=
100000
\
SGLANG_DISAGGREGATION_BOOTSTRAP_TIMEOUT
=
100000
\
SGLANG_DISAGGREGATION_BOOTSTRAP_TIMEOUT
=
100000
\
SGLANG_DISAGGREGATION_WAITING_TIMEOUT
=
100000
\
SGLANG_DISAGGREGATION_WAITING_TIMEOUT
=
100000
\
SGLANG_MOONCAKE_CUSTOM_MEM_POOL
=
True
\
MC_FORCE_MNNVL
=
1
\
MC_FORCE_MNNVL
=
1
\
NCCL_MNNVL_ENABLE
=
1
\
NCCL_MNNVL_ENABLE
=
1
\
NCCL_CUMEM_ENABLE
=
1
\
NCCL_CUMEM_ENABLE
=
1
\
...
@@ -98,7 +97,7 @@ python3 -m dynamo.sglang \
...
@@ -98,7 +97,7 @@ python3 -m dynamo.sglang \
--disable-cuda-graph
\
--disable-cuda-graph
\
--chunked-prefill-size
16384
\
--chunked-prefill-size
16384
\
--max-total-tokens
32768
\
--max-total-tokens
32768
\
--mem-fraction-static
0.8
\
--mem-fraction-static
0.8
2
\
--log-level
debug
--log-level
debug
```
```
...
@@ -128,10 +127,10 @@ python3 -m dynamo.sglang \
...
@@ -128,10 +127,10 @@ python3 -m dynamo.sglang \
--disaggregation-mode
decode
\
--disaggregation-mode
decode
\
--dist-init-addr
${
HEAD_DECODE_NODE_IP
}
:29500
\
--dist-init-addr
${
HEAD_DECODE_NODE_IP
}
:29500
\
--disaggregation-bootstrap-port
30001
\
--disaggregation-bootstrap-port
30001
\
--nnodes
1
2
\
--nnodes
2
\
--node-rank
0
\
--node-rank
0
\
--tp-size
4
8
\
--tp-size
8
\
--dp-size
4
8
\
--dp-size
8
\
--enable-dp-attention
\
--enable-dp-attention
\
--host
0.0.0.0
\
--host
0.0.0.0
\
--decode-log-interval
1
\
--decode-log-interval
1
\
...
@@ -155,4 +154,4 @@ python3 -m dynamo.sglang \
...
@@ -155,4 +154,4 @@ python3 -m dynamo.sglang \
--log-level
debug
--log-level
debug
```
```
On the other decode nodes (this example has 2 total decode nodes), run the same command but change
`--node-rank`
to 1
On the other decode nodes (this example has 2 total decode nodes), run the same command but change
`--node-rank`
to 1.
\ No newline at end of file
\ No newline at end of file
docs/backends/sglang/multinode-examples.md
View file @
2a24e4aa
...
@@ -39,6 +39,7 @@ python3 -m dynamo.sglang \
...
@@ -39,6 +39,7 @@ python3 -m dynamo.sglang \
--disaggregation-mode
prefill
\
--disaggregation-mode
prefill
\
--disaggregation-transfer-backend
nixl
\
--disaggregation-transfer-backend
nixl
\
--disaggregation-bootstrap-port
30001
\
--disaggregation-bootstrap-port
30001
\
--load-balance-method
round_robin
\
--host
0.0.0.0
\
--host
0.0.0.0
\
--mem-fraction-static
0.82
--mem-fraction-static
0.82
```
```
...
@@ -60,6 +61,7 @@ python3 -m dynamo.sglang \
...
@@ -60,6 +61,7 @@ python3 -m dynamo.sglang \
--disaggregation-transfer-backend
nixl
\
--disaggregation-transfer-backend
nixl
\
--disaggregation-bootstrap-port
30001
\
--disaggregation-bootstrap-port
30001
\
--host
0.0.0.0
\
--host
0.0.0.0
\
--load-balance-method
round_robin
\
--mem-fraction-static
0.82
--mem-fraction-static
0.82
```
```
...
@@ -80,6 +82,7 @@ python3 -m dynamo.sglang \
...
@@ -80,6 +82,7 @@ python3 -m dynamo.sglang \
--disaggregation-transfer-backend
nixl
\
--disaggregation-transfer-backend
nixl
\
--disaggregation-bootstrap-port
30001
\
--disaggregation-bootstrap-port
30001
\
--host
0.0.0.0
\
--host
0.0.0.0
\
--prefill-round-robin-balance
\
--mem-fraction-static
0.82
--mem-fraction-static
0.82
```
```
...
@@ -100,6 +103,7 @@ python3 -m dynamo.sglang \
...
@@ -100,6 +103,7 @@ python3 -m dynamo.sglang \
--disaggregation-transfer-backend
nixl
\
--disaggregation-transfer-backend
nixl
\
--disaggregation-bootstrap-port
30001
\
--disaggregation-bootstrap-port
30001
\
--host
0.0.0.0
\
--host
0.0.0.0
\
--prefill-round-robin-balance
\
--mem-fraction-static
0.82
--mem-fraction-static
0.82
```
```
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment