Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
c675fd1b
"vscode:/vscode.git/clone" did not exist on "09b26bf6b39df6fe9e2e1c635932af19fa8a6718"
Unverified
Commit
c675fd1b
authored
Jun 04, 2025
by
Hongkuan Zhou
Committed by
GitHub
Jun 04, 2025
Browse files
fix: prefillqueue stream name in load-planner (#1377)
parent
5c9a2d49
Changes
4
Show whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
1 addition
and
10 deletions
+1
-10
components/planner/src/dynamo/planner/defaults.py
components/planner/src/dynamo/planner/defaults.py
+0
-1
docs/architecture/planner.md
docs/architecture/planner.md
+0
-1
docs/guides/planner_benchmark/benchmark_planner.md
docs/guides/planner_benchmark/benchmark_planner.md
+0
-1
examples/llm/components/planner.py
examples/llm/components/planner.py
+1
-7
No files found.
components/planner/src/dynamo/planner/defaults.py
View file @
c675fd1b
...
@@ -17,7 +17,6 @@
...
@@ -17,7 +17,6 @@
# Source of truth for planner defaults
# Source of truth for planner defaults
class
PlannerDefaults
:
class
PlannerDefaults
:
namespace
=
"dynamo"
namespace
=
"dynamo"
served_model_name
=
"vllm"
environment
=
"local"
environment
=
"local"
no_operation
=
False
no_operation
=
False
log_dir
=
None
log_dir
=
None
...
...
docs/architecture/planner.md
View file @
c675fd1b
...
@@ -110,7 +110,6 @@ dynamo serve graphs.disagg:Frontend -f disagg.yaml --Planner.environment=local -
...
@@ -110,7 +110,6 @@ dynamo serve graphs.disagg:Frontend -f disagg.yaml --Planner.environment=local -
Configuration options:
Configuration options:
* `namespace` (str, default: "dynamo"): Target namespace for planner operations
* `namespace` (str, default: "dynamo"): Target namespace for planner operations
* `environment` (str, default: "local"): Target environment (local, kubernetes)
* `environment` (str, default: "local"): Target environment (local, kubernetes)
* `served-model-name` (str, default: "vllm"): Target model name
* `no-operation` (bool, default: false): Run in observation mode only
* `no-operation` (bool, default: false): Run in observation mode only
* `log-dir` (str, default: None): Tensorboard log directory
* `log-dir` (str, default: None): Tensorboard log directory
* `adjustment-interval` (int, default: 30): Seconds between adjustments
* `adjustment-interval` (int, default: 30): Seconds between adjustments
...
...
docs/guides/planner_benchmark/benchmark_planner.md
View file @
c675fd1b
...
@@ -54,7 +54,6 @@ dynamo serve graphs.disagg_router:Frontend -f disagg_1p1d.yml
...
@@ -54,7 +54,6 @@ dynamo serve graphs.disagg_router:Frontend -f disagg_1p1d.yml
genai-perf profile
\
genai-perf profile
\
--tokenizer
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
\
--tokenizer
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
\
-m
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
\
-m
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
\
--service-kind
openai
\
--endpoint-type
chat
\
--endpoint-type
chat
\
--url
http://localhost:8000
\
--url
http://localhost:8000
\
--streaming
\
--streaming
\
...
...
examples/llm/components/planner.py
View file @
c675fd1b
...
@@ -64,7 +64,7 @@ class Planner:
...
@@ -64,7 +64,7 @@ class Planner:
self
.
_prefill_queue_nats_server
=
os
.
getenv
(
self
.
_prefill_queue_nats_server
=
os
.
getenv
(
"NATS_SERVER"
,
"nats://localhost:4222"
"NATS_SERVER"
,
"nats://localhost:4222"
)
)
self
.
_prefill_queue_stream_name
=
self
.
args
.
served_model_name
self
.
_prefill_queue_stream_name
=
f
"
{
self
.
namespace
}
_prefill_queue"
self
.
prefill_client
:
Any
|
None
=
None
self
.
prefill_client
:
Any
|
None
=
None
self
.
workers_client
:
Any
|
None
=
None
self
.
workers_client
:
Any
|
None
=
None
...
@@ -411,12 +411,6 @@ if __name__ == "__main__":
...
@@ -411,12 +411,6 @@ if __name__ == "__main__":
default
=
PlannerDefaults
.
namespace
,
default
=
PlannerDefaults
.
namespace
,
help
=
"Namespace planner will look at"
,
help
=
"Namespace planner will look at"
,
)
)
parser
.
add_argument
(
"--served-model-name"
,
type
=
str
,
default
=
PlannerDefaults
.
served_model_name
,
help
=
"Model name that is being served (used for prefill queue name)"
,
)
parser
.
add_argument
(
parser
.
add_argument
(
"--no-operation"
,
"--no-operation"
,
action
=
"store_true"
,
action
=
"store_true"
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment