[Doc] Add vllm-metal to hardware plugin documentation (#31174)

Signed-off-by: mgoin <mgoin64@gmail.com>

[Doc] Add vllm-metal to hardware plugin documentation (#31174)
Signed-off-by: mgoin <mgoin64@gmail.com>
95863540 · Michael Goin · GitHub · b10f41c8 · 95863540 · 95863540
Unverified Commit 95863540 authored Dec 22, 2025 by Michael Goin Committed by GitHub Dec 22, 2025
Showing with 4 additions and 0 deletions

docs/getting_started/installation/README.md docs/getting_started/installation/README.md +1 -0

docs/getting_started/installation/cpu.apple.inc.md docs/getting_started/installation/cpu.apple.inc.md +3 -0

No files found.
--- a/docs/getting_started/installation/README.md
+++ b/docs/getting_started/installation/README.md
@@ -28,3 +28,4 @@ The backends below live **outside** the main `vllm` repository and follow the
 | Cambricon MLU | `vllm-mlu` | <https://github.com/Cambricon/vllm-mlu> |
 | Baidu Kunlun XPU | N/A, install from source | <https://github.com/baidu/vLLM-Kunlun> |
 | Sophgo TPU | N/A, install from source | <https://github.com/sophgo/vllm-tpu> |
+| Apple Silicon (Metal) | N/A, install from source | <https://github.com/vllm-project/vllm-metal> |
--- a/docs/getting_started/installation/cpu.apple.inc.md
+++ b/docs/getting_started/installation/cpu.apple.inc.md
@@ -4,6 +4,9 @@ vLLM has experimental support for macOS with Apple Silicon. For now, users must

 Currently the CPU implementation for macOS supports FP32 and FP16 datatypes.

+!!! tip "GPU-Accelerated Inference with vLLM-Metal"
+    For GPU-accelerated inference on Apple Silicon using Metal, check out [vllm-metal](https://github.com/vllm-project/vllm-metal), a community-maintained hardware plugin that uses MLX as the compute backend.
+
 # --8<-- [end:installation]
 # --8<-- [start:requirements]