Unverified Commit 24cb926e authored by atchernych's avatar atchernych Committed by GitHub
Browse files

docs: Clean index.rst (#2104)

parent a2cb1c33
../../../../components/backends/llm/README.md
\ No newline at end of file
../../../../../components/backends/sglang/docs/multinode-examples.md
\ No newline at end of file
../../../../components/backends/trtllm/README.md
\ No newline at end of file
../../../../components/backends/vllm/README.md
\ No newline at end of file
../../../deploy/metrics/docker-compose.yml
\ No newline at end of file
../../../../examples/runtime/hello_world/README.md
\ No newline at end of file
../../../guides/dynamo_deploy/operator_deployment.md
\ No newline at end of file
...@@ -67,7 +67,7 @@ Ensure you have the source code checked out and are in the `dynamo` directory: ...@@ -67,7 +67,7 @@ Ensure you have the source code checked out and are in the `dynamo` directory:
### Set Environment Variables ### Set Environment Variables
Our examples use the [`nvcr.io`](nvcr.io/nvidia/ai-dynamo/) but you can setup your own values if you use another docker registry. Our examples use the [`nvcr.io`](https://nvcr.io/nvidia/ai-dynamo/) but you can setup your own values if you use another docker registry.
```bash ```bash
export NAMESPACE=dynamo-cloud # or whatever you prefer. export NAMESPACE=dynamo-cloud # or whatever you prefer.
......
...@@ -45,26 +45,26 @@ The examples below assume you build the latest image yourself from source. If us ...@@ -45,26 +45,26 @@ The examples below assume you build the latest image yourself from source. If us
:margin: 0 :margin: 0
:padding: 3 4 0 0 :padding: 3 4 0 0
.. grid-item-card:: :doc:`Hello World </examples/hello_world>` .. grid-item-card:: :doc:`Hello World <examples/runtime/hello_world/README>`
:link: /examples/hello_world :link: examples/runtime/hello_world/README
:link-type: doc :link-type: doc
Demonstrates the basic concepts of Dynamo by creating a simple multi-service pipeline. Demonstrates the basic concepts of Dynamo by creating a simple GPU-unaware graph
.. grid-item-card:: :doc:`LLM Deployment </examples/llm_deployment>` .. grid-item-card:: :doc:`LLM Serving with VLLM <components/backends/vllm/README>`
:link: /examples/llm_deployment :link: components/backends/vllm/README
:link-type: doc :link-type: doc
Presents examples and reference implementations for deploying Large Language Models (LLMs) in various configurations. Presents examples and reference implementations for deploying Large Language Models (LLMs) in various configurations with VLLM.
.. grid-item-card:: :doc:`Multinode </examples/multinode>` .. grid-item-card:: :doc:`Multinode with SGLang <components/backends/sglang/docs/multinode-examples>`
:link: /examples/multinode :link: components/backends/sglang/docs/multinode-examples
:link-type: doc :link-type: doc
Demonstrates deployment for disaggregated serving on 3 nodes using `nvidia/Llama-3.1-405B-Instruct-FP8`. Demonstrates disaggregated serving on several nodes.
.. grid-item-card:: :doc:`TensorRT-LLM </examples/trtllm>` .. grid-item-card:: :doc:`TensorRT-LLM <components/backends/trtllm/README>`
:link: /examples/trtllm :link: components/backends/trtllm/README
:link-type: doc :link-type: doc
Presents TensorRT-LLM examples and reference implementations for deploying Large Language Models (LLMs) in various configurations. Presents TensorRT-LLM examples and reference implementations for deploying Large Language Models (LLMs) in various configurations.
...@@ -110,7 +110,7 @@ The examples below assume you build the latest image yourself from source. If us ...@@ -110,7 +110,7 @@ The examples below assume you build the latest image yourself from source. If us
Dynamo Deploy Quickstart <guides/dynamo_deploy/quickstart.md> Dynamo Deploy Quickstart <guides/dynamo_deploy/quickstart.md>
Dynamo Cloud Kubernetes Platform <guides/dynamo_deploy/dynamo_cloud.md> Dynamo Cloud Kubernetes Platform <guides/dynamo_deploy/dynamo_cloud.md>
Manual Helm Deployment <guides/dynamo_deploy/manual_helm_deployment.md> Manual Helm Deployment <deploy/helm/README.md>
GKE Setup Guide <guides/dynamo_deploy/gke_setup.md> GKE Setup Guide <guides/dynamo_deploy/gke_setup.md>
Minikube Setup Guide <guides/dynamo_deploy/minikube.md> Minikube Setup Guide <guides/dynamo_deploy/minikube.md>
Model Caching with Fluid <guides/dynamo_deploy/model_caching_with_fluid.md> Model Caching with Fluid <guides/dynamo_deploy/model_caching_with_fluid.md>
...@@ -126,22 +126,22 @@ The examples below assume you build the latest image yourself from source. If us ...@@ -126,22 +126,22 @@ The examples below assume you build the latest image yourself from source. If us
:hidden: :hidden:
:caption: API :caption: API
Python API <API/python_bindings.md>
NIXL Connect API <API/nixl_connect/README.md> NIXL Connect API <API/nixl_connect/README.md>
.. toctree:: .. toctree::
:hidden: :hidden:
:caption: Examples :caption: Examples
Aggregated and Disaggregated Deployment <examples/disagg_skeleton.md> Hello World <examples/runtime/hello_world/README.md>
LLM Deployment Examples <examples/llm_deployment.md> LLM Deployment Examples using VLLM <components/backends/vllm/README.md>
Multinode Examples <examples/multinode.md> Multinode Examples using SGLang <components/backends/sglang/docs/multinode-examples.md>
LLM Deployment Examples using TensorRT-LLM <examples/trtllm.md> LLM Deployment Examples using TensorRT-LLM <components/backends/trtllm/README.md>
.. toctree:: .. toctree::
:hidden: :hidden:
:caption: Reference :caption: Reference
Glossary <dynamo_glossary.md> Glossary <dynamo_glossary.md>
KVBM Reading <architecture/kvbm_reading.md> KVBM Reading <architecture/kvbm_reading.md>
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment