Unverified Commit 09ccc9c8 authored by Hongxia Yang's avatar Hongxia Yang Committed by GitHub
Browse files

[Documentation][AMD] Add information about prebuilt ROCm vLLM docker for perf...


[Documentation][AMD] Add information about prebuilt ROCm vLLM docker for perf validation purpose (#12281)
Signed-off-by: default avatarHongxia Yang <hongxyan@amd.com>
parent 69196a9b
...@@ -13,6 +13,14 @@ vLLM supports AMD GPUs with ROCm 6.2. ...@@ -13,6 +13,14 @@ vLLM supports AMD GPUs with ROCm 6.2.
Currently, there are no pre-built ROCm wheels. Currently, there are no pre-built ROCm wheels.
However, the [AMD Infinity hub for vLLM](https://hub.docker.com/r/rocm/vllm/tags) offers a prebuilt, optimized
docker image designed for validating inference performance on the AMD Instinct™ MI300X accelerator.
```{tip}
Please check [LLM inference performance validation on AMD Instinct MI300X](https://rocm.docs.amd.com/en/latest/how-to/performance-validation/mi300x/vllm-benchmark.html)
for instructions on how to use this prebuilt docker image.
```
### Build wheel from source ### Build wheel from source
0. Install prerequisites (skip if you are already in an environment/docker with the following installed): 0. Install prerequisites (skip if you are already in an environment/docker with the following installed):
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment