Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
78ffe3d3
Unverified
Commit
78ffe3d3
authored
Aug 04, 2025
by
atchernych
Committed by
GitHub
Aug 04, 2025
Browse files
fix: Adjust frontEnd thrasholds (#2288)
parent
30f33524
Changes
8
Hide whitespace changes
Inline
Side-by-side
Showing
8 changed files
with
44 additions
and
40 deletions
+44
-40
components/backends/sglang/deploy/agg.yaml
components/backends/sglang/deploy/agg.yaml
+4
-4
components/backends/sglang/deploy/agg_router.yaml
components/backends/sglang/deploy/agg_router.yaml
+4
-4
components/backends/sglang/deploy/disagg.yaml
components/backends/sglang/deploy/disagg.yaml
+4
-4
components/backends/vllm/deploy/agg.yaml
components/backends/vllm/deploy/agg.yaml
+16
-12
components/backends/vllm/deploy/agg_router.yaml
components/backends/vllm/deploy/agg_router.yaml
+4
-4
components/backends/vllm/deploy/disagg.yaml
components/backends/vllm/deploy/disagg.yaml
+4
-4
components/backends/vllm/deploy/disagg_planner.yaml
components/backends/vllm/deploy/disagg_planner.yaml
+4
-4
components/backends/vllm/deploy/disagg_router.yaml
components/backends/vllm/deploy/disagg_router.yaml
+4
-4
No files found.
components/backends/sglang/deploy/agg.yaml
View file @
78ffe3d3
...
@@ -12,10 +12,10 @@ spec:
...
@@ -12,10 +12,10 @@ spec:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
8000
port
:
8000
initialDelaySeconds
:
6
0
initialDelaySeconds
:
2
0
periodSeconds
:
60
periodSeconds
:
5
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
10
failureThreshold
:
3
readinessProbe
:
readinessProbe
:
exec
:
exec
:
command
:
command
:
...
...
components/backends/sglang/deploy/agg_router.yaml
View file @
78ffe3d3
...
@@ -12,10 +12,10 @@ spec:
...
@@ -12,10 +12,10 @@ spec:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
8000
port
:
8000
initialDelaySeconds
:
6
0
initialDelaySeconds
:
2
0
periodSeconds
:
60
periodSeconds
:
5
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
10
failureThreshold
:
3
readinessProbe
:
readinessProbe
:
exec
:
exec
:
command
:
command
:
...
...
components/backends/sglang/deploy/disagg.yaml
View file @
78ffe3d3
...
@@ -12,10 +12,10 @@ spec:
...
@@ -12,10 +12,10 @@ spec:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
8000
port
:
8000
initialDelaySeconds
:
6
0
initialDelaySeconds
:
2
0
periodSeconds
:
60
periodSeconds
:
5
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
10
failureThreshold
:
3
readinessProbe
:
readinessProbe
:
exec
:
exec
:
command
:
command
:
...
...
components/backends/vllm/deploy/agg.yaml
View file @
78ffe3d3
...
@@ -12,20 +12,20 @@ spec:
...
@@ -12,20 +12,20 @@ spec:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
8000
port
:
8000
initialDelaySeconds
:
6
0
initialDelaySeconds
:
2
0
periodSeconds
:
60
periodSeconds
:
5
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
10
failureThreshold
:
3
readinessProbe
:
readinessProbe
:
exec
:
exec
:
command
:
command
:
-
/bin/sh
-
/bin/sh
-
-c
-
-c
-
'
curl
-s
http://localhost:8000/health
|
jq
-e
".status
==
\"healthy\""'
-
'
curl
-s
http://localhost:8000/health
|
jq
-e
".status
==
\"healthy\""'
initialDelaySeconds
:
6
0
initialDelaySeconds
:
1
0
periodSeconds
:
60
periodSeconds
:
5
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
10
failureThreshold
:
3
dynamoNamespace
:
vllm-agg
dynamoNamespace
:
vllm-agg
componentType
:
main
componentType
:
main
replicas
:
1
replicas
:
1
...
@@ -51,15 +51,17 @@ spec:
...
@@ -51,15 +51,17 @@ spec:
httpGet
:
httpGet
:
path
:
/live
path
:
/live
port
:
9090
port
:
9090
periodSeconds
:
5
initialDelaySeconds
:
30
timeoutSeconds
:
30
periodSeconds
:
10
failureThreshold
:
1
timeoutSeconds
:
5
failureThreshold
:
3
readinessProbe
:
readinessProbe
:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
9090
port
:
9090
initialDelaySeconds
:
30
periodSeconds
:
10
periodSeconds
:
10
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
60
failureThreshold
:
60
dynamoNamespace
:
vllm-agg
dynamoNamespace
:
vllm-agg
componentType
:
worker
componentType
:
worker
...
@@ -86,7 +88,9 @@ spec:
...
@@ -86,7 +88,9 @@ spec:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
9090
port
:
9090
initialDelaySeconds
:
10
periodSeconds
:
10
periodSeconds
:
10
timeoutSeconds
:
5
failureThreshold
:
60
failureThreshold
:
60
image
:
nvcr.io/nvidian/nim-llm-dev/vllm-runtime:dep-233.17
image
:
nvcr.io/nvidian/nim-llm-dev/vllm-runtime:dep-233.17
workingDir
:
/workspace/components/backends/vllm
workingDir
:
/workspace/components/backends/vllm
...
...
components/backends/vllm/deploy/agg_router.yaml
View file @
78ffe3d3
...
@@ -12,10 +12,10 @@ spec:
...
@@ -12,10 +12,10 @@ spec:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
8000
port
:
8000
initialDelaySeconds
:
6
0
initialDelaySeconds
:
2
0
periodSeconds
:
60
periodSeconds
:
5
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
10
failureThreshold
:
3
readinessProbe
:
readinessProbe
:
exec
:
exec
:
command
:
command
:
...
...
components/backends/vllm/deploy/disagg.yaml
View file @
78ffe3d3
...
@@ -15,10 +15,10 @@ spec:
...
@@ -15,10 +15,10 @@ spec:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
8000
port
:
8000
initialDelaySeconds
:
6
0
initialDelaySeconds
:
2
0
periodSeconds
:
60
periodSeconds
:
5
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
10
failureThreshold
:
3
readinessProbe
:
readinessProbe
:
exec
:
exec
:
command
:
command
:
...
...
components/backends/vllm/deploy/disagg_planner.yaml
View file @
78ffe3d3
...
@@ -22,10 +22,10 @@ spec:
...
@@ -22,10 +22,10 @@ spec:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
8000
port
:
8000
initialDelaySeconds
:
6
0
initialDelaySeconds
:
2
0
periodSeconds
:
60
periodSeconds
:
5
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
10
failureThreshold
:
3
readinessProbe
:
readinessProbe
:
exec
:
exec
:
command
:
command
:
...
...
components/backends/vllm/deploy/disagg_router.yaml
View file @
78ffe3d3
...
@@ -15,10 +15,10 @@ spec:
...
@@ -15,10 +15,10 @@ spec:
httpGet
:
httpGet
:
path
:
/health
path
:
/health
port
:
8000
port
:
8000
initialDelaySeconds
:
6
0
initialDelaySeconds
:
2
0
periodSeconds
:
60
periodSeconds
:
5
timeoutSeconds
:
30
timeoutSeconds
:
5
failureThreshold
:
10
failureThreshold
:
3
readinessProbe
:
readinessProbe
:
exec
:
exec
:
command
:
command
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment