"vscode:/vscode.git/clone" did not exist on "6f5b460c883b53a7bd2d0f6c1c9539bf226042f3"
Unverified Commit 8bd37c96 authored by Anant Sharma's avatar Anant Sharma Committed by GitHub
Browse files

refactor: move backend deploy, launch and slurm files from components to examples (#3849)


Signed-off-by: default avatarAnant Sharma <anants@nvidia.com>
parent 78359046
...@@ -60,9 +60,9 @@ spec: ...@@ -60,9 +60,9 @@ spec:
- No worker-side configuration changes needed - No worker-side configuration changes needed
**Complete K8s Examples:** **Complete K8s Examples:**
- [TRT-LLM aggregated router example](../../components/backends/trtllm/deploy/agg_router.yaml) - [TRT-LLM aggregated router example](../../examples/backends/trtllm/deploy/agg_router.yaml)
- [vLLM aggregated router example](../../components/backends/vllm/deploy/agg_router.yaml) - [vLLM aggregated router example](../../examples/backends/vllm/deploy/agg_router.yaml)
- [SGLang aggregated router example](../../components/backends/sglang/deploy/agg_router.yaml) - [SGLang aggregated router example](../../examples/backends/sglang/deploy/agg_router.yaml)
- [Distributed inference tutorial](../../examples/basics/kubernetes/Distributed_Inference/agg_router.yaml) - [Distributed inference tutorial](../../examples/basics/kubernetes/Distributed_Inference/agg_router.yaml)
**For A/B Testing and Advanced K8s Setup:** **For A/B Testing and Advanced K8s Setup:**
......
...@@ -34,10 +34,10 @@ Learn fundamental Dynamo concepts through these introductory examples: ...@@ -34,10 +34,10 @@ Learn fundamental Dynamo concepts through these introductory examples:
These examples show how Dynamo broadly works using major inference engines. These examples show how Dynamo broadly works using major inference engines.
If you want to see advanced, framework-specific deployment patterns and best practices, check out the [Components Workflows](../components/backends/) directory: If you want to see advanced, framework-specific deployment patterns and best practices, check out the [Examples Backends](../examples/backends/) directory:
- **[vLLM](../components/backends/vllm/)** – vLLM-specific deployment and configuration - **[vLLM](backends/vllm/)** – vLLM-specific deployment and configuration
- **[SGLang](../components/backends/sglang/)** – SGLang integration examples and workflows - **[SGLang](backends/sglang/)** – SGLang integration examples and workflows
- **[TensorRT-LLM](../components/backends/trtllm/)** – TensorRT-LLM workflows and optimizations - **[TensorRT-LLM](backends/trtllm/)** – TensorRT-LLM workflows and optimizations
## Deployment Examples ## Deployment Examples
......
...@@ -62,7 +62,7 @@ resources: ...@@ -62,7 +62,7 @@ resources:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
args: args:
- "python3" - "python3"
- "-m" - "-m"
......
...@@ -25,7 +25,7 @@ spec: ...@@ -25,7 +25,7 @@ spec:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
command: command:
- python3 - python3
- -m - -m
......
...@@ -28,7 +28,7 @@ spec: ...@@ -28,7 +28,7 @@ spec:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
command: command:
- python3 - python3
- -m - -m
......
...@@ -28,7 +28,7 @@ spec: ...@@ -28,7 +28,7 @@ spec:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
command: command:
- python3 - python3
- -m - -m
......
...@@ -36,7 +36,7 @@ spec: ...@@ -36,7 +36,7 @@ spec:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
command: command:
- python3 - python3
- -m - -m
...@@ -73,7 +73,7 @@ spec: ...@@ -73,7 +73,7 @@ spec:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
command: command:
- python3 - python3
- -m - -m
......
...@@ -26,7 +26,7 @@ spec: ...@@ -26,7 +26,7 @@ spec:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
command: command:
- python3 - python3
- -m - -m
...@@ -62,7 +62,7 @@ spec: ...@@ -62,7 +62,7 @@ spec:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
command: command:
- python3 - python3
- -m - -m
......
...@@ -50,7 +50,7 @@ spec: ...@@ -50,7 +50,7 @@ spec:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
command: command:
- python3 - python3
args: args:
...@@ -86,7 +86,7 @@ spec: ...@@ -86,7 +86,7 @@ spec:
extraPodSpec: extraPodSpec:
mainContainer: mainContainer:
image: my-registry/sglang-runtime:my-tag image: my-registry/sglang-runtime:my-tag
workingDir: /workspace/components/backends/sglang workingDir: /workspace/examples/backends/sglang
command: command:
- python3 - python3
args: args:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment