Unverified Commit 39d645e5 authored by Jonathan Tong's avatar Jonathan Tong Committed by GitHub
Browse files

docs: migrate Fern docs from fern/ into docs/ (#6206)


Signed-off-by: default avatarJont828 <jt572@cornell.edu>
parent d381e6ff
...@@ -37,8 +37,8 @@ docker compose -f deploy/docker-compose.yml up -d ...@@ -37,8 +37,8 @@ docker compose -f deploy/docker-compose.yml up -d
## Components ## Components
- [Frontend](/components/src/dynamo/frontend/README.md) - HTTP API endpoint that receives requests and forwards them to the decode worker - [Frontend](/components/src/dynamo/frontend/README.md) - HTTP API endpoint that receives requests and forwards them to the decode worker
- [vLLM Prefill Worker](/docs/backends/vllm/README.md) - Specialized worker for prefill phase execution - [vLLM Prefill Worker](/docs/pages/backends/vllm/README.md) - Specialized worker for prefill phase execution
- [vLLM Decode Worker](/docs/backends/vllm/README.md) - Specialized worker that handles requests and decides between local/remote prefill - [vLLM Decode Worker](/docs/pages/backends/vllm/README.md) - Specialized worker that handles requests and decides between local/remote prefill
```mermaid ```mermaid
--- ---
......
...@@ -4,8 +4,8 @@ This example demonstrates running Dynamo across multiple nodes with **KV-aware r ...@@ -4,8 +4,8 @@ This example demonstrates running Dynamo across multiple nodes with **KV-aware r
For more information about the core concepts, see: For more information about the core concepts, see:
- [Dynamo Disaggregated Serving](../../../docs/design_docs/disagg_serving.md) - [Dynamo Disaggregated Serving](../../../docs/pages/design-docs/disagg-serving.md)
- [KV Cache Routing](../../../docs/components/router/README.md) - [KV Cache Routing](../../../docs/pages/components/router/README.md)
## Architecture Overview ## Architecture Overview
...@@ -65,7 +65,7 @@ This is particularly beneficial for: ...@@ -65,7 +65,7 @@ This is particularly beneficial for:
- **Similar queries**: Common prefixes are computed once and reused - **Similar queries**: Common prefixes are computed once and reused
- **Batch processing**: Related requests can be routed to workers with shared context - **Batch processing**: Related requests can be routed to workers with shared context
For detailed technical information about how KV routing works, see the [Router Guide](../../../docs/components/router/router_guide.md). For detailed technical information about how KV routing works, see the [Router Guide](../../../docs/pages/components/router/router-guide.md).
## Prerequisites ## Prerequisites
...@@ -88,7 +88,7 @@ Install Dynamo with [SGLang](https://docs.sglang.io/) support: ...@@ -88,7 +88,7 @@ Install Dynamo with [SGLang](https://docs.sglang.io/) support:
pip install ai-dynamo[sglang] pip install ai-dynamo[sglang]
``` ```
For more information about the SGLang backend and its integration with Dynamo, see the [SGLang Backend Documentation](../../../docs/backends/sglang/README.md). For more information about the SGLang backend and its integration with Dynamo, see the [SGLang Backend Documentation](../../../docs/pages/backends/sglang/README.md).
### 3. Network Requirements ### 3. Network Requirements
...@@ -475,7 +475,7 @@ python -m dynamo.frontend \ ...@@ -475,7 +475,7 @@ python -m dynamo.frontend \
--router-temperature 0.0 # Temperature for probabilistic routing (0 = deterministic) --router-temperature 0.0 # Temperature for probabilistic routing (0 = deterministic)
``` ```
For more advanced configuration options including custom worker selection, block size tuning, and alternative indexing strategies, see the [Router Guide](../../../docs/components/router/router_guide.md). For more advanced configuration options including custom worker selection, block size tuning, and alternative indexing strategies, see the [Router Guide](../../../docs/pages/components/router/router-guide.md).
## Cleanup ## Cleanup
......
...@@ -17,4 +17,4 @@ limitations under the License. ...@@ -17,4 +17,4 @@ limitations under the License.
# Example: Multi-node TRTLLM Workers with Dynamo on Slurm # Example: Multi-node TRTLLM Workers with Dynamo on Slurm
See [here](/docs/backends/trtllm/multinode) for how to setup this example. See [here](/docs/pages/backends/trtllm/multinode) for how to setup this example.
...@@ -32,7 +32,7 @@ if [[ -z ${IMAGE} ]]; then ...@@ -32,7 +32,7 @@ if [[ -z ${IMAGE} ]]; then
echo "ERROR: You need to set the IMAGE environment variable to the " \ echo "ERROR: You need to set the IMAGE environment variable to the " \
"Dynamo+TRTLLM docker image or .sqsh file from 'enroot import' " \ "Dynamo+TRTLLM docker image or .sqsh file from 'enroot import' " \
"See how to build one from source here: " \ "See how to build one from source here: " \
"https://github.com/ai-dynamo/dynamo/tree/main/docs/backends/trtllm/README.md#build-container" "https://github.com/ai-dynamo/dynamo/tree/main/docs/pages/backends/trtllm/README.md#build-container"
exit 1 exit 1
fi fi
......
...@@ -35,7 +35,7 @@ if [[ -z ${IMAGE} ]]; then ...@@ -35,7 +35,7 @@ if [[ -z ${IMAGE} ]]; then
echo "ERROR: You need to set the IMAGE environment variable to the " \ echo "ERROR: You need to set the IMAGE environment variable to the " \
"Dynamo+TRTLLM docker image or .sqsh file from 'enroot import' " \ "Dynamo+TRTLLM docker image or .sqsh file from 'enroot import' " \
"See how to build one from source here: " \ "See how to build one from source here: " \
"https://github.com/ai-dynamo/dynamo/tree/main/docs/backends/trtllm/README.md#build-container" "https://github.com/ai-dynamo/dynamo/tree/main/docs/pages/backends/trtllm/README.md#build-container"
exit 1 exit 1
fi fi
......
...@@ -18,7 +18,7 @@ docker compose -f deploy/docker-compose.yml up -d ...@@ -18,7 +18,7 @@ docker compose -f deploy/docker-compose.yml up -d
## Components ## Components
- [Frontend](/components/src/dynamo/frontend/README.md) - A built-in component that launches an OpenAI compliant HTTP server, a pre-processor, and a router in a single process - [Frontend](/components/src/dynamo/frontend/README.md) - A built-in component that launches an OpenAI compliant HTTP server, a pre-processor, and a router in a single process
- [vLLM Backend](/docs/backends/vllm/README.md) - A built-in component that runs vLLM within the Dynamo runtime - [vLLM Backend](/docs/pages/backends/vllm/README.md) - A built-in component that runs vLLM within the Dynamo runtime
```mermaid ```mermaid
--- ---
......
...@@ -88,4 +88,4 @@ python3 client.py --middle ...@@ -88,4 +88,4 @@ python3 client.py --middle
- Both modes demonstrate the same cancellation behavior - Both modes demonstrate the same cancellation behavior
- The middle server shows how to properly forward context in proxy scenarios - The middle server shows how to properly forward context in proxy scenarios
For more details on the request cancellation architecture, refer to the [architecture documentation](../../../docs/fault_tolerance/request_cancellation.md). For more details on the request cancellation architecture, refer to the [architecture documentation](../../../docs/pages/fault-tolerance/request-cancellation.md).
...@@ -97,7 +97,7 @@ Hello star! ...@@ -97,7 +97,7 @@ Hello star!
Note that this a very simple degenerate example which does not demonstrate the standard Dynamo FrontEnd-Backend deployment. The hello-world client is not a web server, it is a one-off function which sends the predefined text "world,sun,moon,star" to the backend. The example is meant to show the HelloWorldWorker. As such you will only see the HelloWorldWorker pod in deployment. The client will run and exit and the pod will not be operational. Note that this a very simple degenerate example which does not demonstrate the standard Dynamo FrontEnd-Backend deployment. The hello-world client is not a web server, it is a one-off function which sends the predefined text "world,sun,moon,star" to the backend. The example is meant to show the HelloWorldWorker. As such you will only see the HelloWorldWorker pod in deployment. The client will run and exit and the pod will not be operational.
Follow the [Quickstart Guide](/docs/kubernetes/README.md) to install Dynamo Kubernetes Platform. Follow the [Quickstart Guide](/docs/pages/kubernetes/README.md) to install Dynamo Kubernetes Platform.
Then deploy to kubernetes using Then deploy to kubernetes using
```bash ```bash
......
...@@ -39,7 +39,7 @@ gpu-operator nvidia-driver-daemonset-xxxxx 1/1 ...@@ -39,7 +39,7 @@ gpu-operator nvidia-driver-daemonset-xxxxx 1/1
## Step 3: Deploy Dynamo Kubernetes Operator ## Step 3: Deploy Dynamo Kubernetes Operator
Follow the [Deploying Inference Graphs to Kubernetes](../../../docs/kubernetes/README.md) guide to install Dynamo on your AKS cluster. Follow the [Deploying Inference Graphs to Kubernetes](../../../docs/pages/kubernetes/README.md) guide to install Dynamo on your AKS cluster.
Validate that the Dynamo pods are running: Validate that the Dynamo pods are running:
...@@ -56,7 +56,7 @@ kubectl get pods -n dynamo-system ...@@ -56,7 +56,7 @@ kubectl get pods -n dynamo-system
## Step 4: Deploy and Test a Model ## Step 4: Deploy and Test a Model
Follow the [Deploy Model/Workflow](../../../docs/kubernetes/installation_guide.md#next-steps) guide to deploy and test a model on your AKS cluster. Follow the [Deploy Model/Workflow](../../../docs/pages/kubernetes/installation-guide.md#next-steps) guide to deploy and test a model on your AKS cluster.
## AKS Storage options for Model Caching and Runtime Data ## AKS Storage options for Model Caching and Runtime Data
......
...@@ -71,7 +71,7 @@ kubectl create secret generic hf-token-secret \ ...@@ -71,7 +71,7 @@ kubectl create secret generic hf-token-secret \
## Install Dynamo Kubernetes Platform ## Install Dynamo Kubernetes Platform
[See installation steps](/docs/kubernetes/installation_guide.md#overview) [See installation steps](/docs/pages/kubernetes/installation-guide.md#overview)
After installation, verify the installation: After installation, verify the installation:
......
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
File suppressed by a .gitattributes entry or the file's encoding is unsupported.
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment