[Doc] Fix issues in integrations/llamastack.md (#24428)

Signed-off-by: windsonsea <haifeng.yao@daocloud.io>

[Doc] Fix issues in integrations/llamastack.md (#24428)
Signed-off-by: windsonsea <haifeng.yao@daocloud.io>
c2a8b08f · Michael Yao · GitHub · f4962a6d · c2a8b08f
Unverified Commit c2a8b08f authored Sep 08, 2025 by Michael Yao Committed by GitHub Sep 08, 2025
Show whitespace changes
Inline Side-by-side

Showing with 7 additions and 7 deletions

docs/deployment/integrations/llamastack.md docs/deployment/integrations/llamastack.md +7 -7

No files found.
--- a/docs/deployment/integrations/llamastack.md
+++ b/docs/deployment/integrations/llamastack.md
 # Llama Stack

-vLLM is also available via [Llama Stack](https://github.com/meta-llama/llama-stack) .
+vLLM is also available via [Llama Stack](https://github.com/llamastack/llama-stack).

 To install Llama Stack, run

@@ -8,9 +8,9 @@ To install Llama Stack, run
 pip install llama-stack -q
 ```

-## Inference using OpenAI Compatible API
+## Inference using OpenAI-Compatible API

-Then start Llama Stack server pointing to your vLLM server with the following configuration:
+Then start the Llama Stack server and configure it to point to your vLLM server with the following settings:

 ```yaml
 inference:
@@ -20,15 +20,15 @@ inference:
      url: http://127.0.0.1:8000
 ```

-Please refer to [this guide](https://llama-stack.readthedocs.io/en/latest/distributions/self_hosted_distro/remote-vllm.html) for more details on this remote vLLM provider.
+Please refer to [this guide](https://llama-stack.readthedocs.io/en/latest/providers/inference/remote_vllm.html) for more details on this remote vLLM provider.

-## Inference via Embedded vLLM
+## Inference using Embedded vLLM

-An [inline vLLM provider](https://github.com/meta-llama/llama-stack/tree/main/llama_stack/providers/inline/inference/vllm)
+An [inline provider](https://github.com/llamastack/llama-stack/tree/main/llama_stack/providers/inline/inference)
 is also available. This is a sample of configuration using that method:

 ```yaml
-inference
+inference:
  - provider_type: vllm
    config:
      model: Llama3.1-8B-Instruct