docs: Document that front-end needs access to model config files (#6327)

Signed-off-by: Graham King <grahamk@nvidia.com>

docs: Document that front-end needs access to model config files (#6327)
Signed-off-by: Graham King <grahamk@nvidia.com>
87c5c7bf · Graham King · GitHub · ce1a6f49 · 87c5c7bf
Unverified Commit 87c5c7bf authored Feb 17, 2026 by Graham King Committed by GitHub Feb 17, 2026
Hide whitespace changes
Inline Side-by-side

Showing with 7 additions and 1 deletion

docs/pages/components/frontend/README.md docs/pages/components/frontend/README.md +7 -1

No files found.
--- a/docs/pages/components/frontend/README.md
+++ b/docs/pages/components/frontend/README.md
@@ -34,7 +34,13 @@ The Dynamo Frontend is the API gateway for serving LLM inference requests. It pr
 python -m dynamo.frontend --http-port 8000
 ```
-This starts an OpenAI-compatible HTTP server with integrated preprocessing and routing. Backends are auto-discovered when they call `register_model`.
+This starts an OpenAI-compatible HTTP server with integrated pre/post processing and routing. Backends are auto-discovered when they call `register_model`.
+The frontend does the pre and post processing. To do this it will need access to the model configuration files: `config.json`, `tokenizer.json`, `tokenizer_config.json`, etc. It does not need the weights.
+Frontend will download the files it needs from Hugging Face, no setup is required. However we recommend setting up [modelexpress-server](https://github.com/ai-dynamo/modelexpress) and a shared folder such as a Kubernetes PVC. This ensures the model is only downloaded once across the whole cluster.
+If the model is not available on Hugging Face, such as a private or customized model, you will need to make the model files available locally at the same file path as on the backend. The backend's `--model-path <here>` will need to exist on the frontend and contain at least the configuration (JSON) files.
 ### KServe gRPC Frontend