"lib/discovery/src/vscode:/vscode.git/clone" did not exist on "538d3035219ab62ee82f948c40d6b2be2081f1ba"
Unverified Commit f0e1f226 authored by atchernych's avatar atchernych Committed by GitHub
Browse files

fix: fix recipe docs (#4472)


Signed-off-by: default avatarAnna Tchernych <atchernych@nvidia.com>
parent 2100f6aa
......@@ -176,12 +176,13 @@ For Llama-3-70B with vLLM (Aggregated), an example of integration with the Infer
Follow to Follow [Deploy Inference Gateway Section 2](../deploy/inference-gateway/README.md#2-deploy-inference-gateway) to install GAIE. Then apply manifests.
Update the containers.epp.image in the deployment file, i.e. llama-3-70b/vllm/agg/gaie/k8s-manifests/epp/deployment.yaml
This should be the same image you have used for your deployment.
```bash
export DEPLOY_PATH=llama-3-70b/vllm/agg/
#DEPLOY_PATH=<model>/<framework>/<mode>/
kubectl apply -R -f "$DEPLOY_PATH/gaie/k8s-manifests" -n "$NAMESPACE"
```
### DeepSeek-R1 on GB200 (Multi-node)
......
......@@ -38,7 +38,7 @@ spec:
containers:
- name: epp
image: nvcr.io/nvstaging/ai-dynamo/dynamo-frontend:0.7.0rc2-amd64
image: nvcr.io/nvidia/ai-dynamo/vllm-runtime:my-tag
imagePullPolicy: IfNotPresent
resources:
requests:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment