[ROCm] Effort to reduce the number of environment variables in command line (#17229)

Signed-off-by: Hongxia Yang <hongxia.yang@amd.com>

[ROCm] Effort to reduce the number of environment variables in command line (#17229)
Signed-off-by: Hongxia Yang <hongxia.yang@amd.com>
90d0a54c · Hongxia Yang · GitHub · 7a0a146c · 90d0a54c
Unverified Commit 90d0a54c authored May 01, 2025 by Hongxia Yang Committed by GitHub Apr 30, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 9 additions and 0 deletions

docker/Dockerfile.rocm docker/Dockerfile.rocm +9 -0

No files found.
--- a/docker/Dockerfile.rocm
+++ b/docker/Dockerfile.rocm
@@ -114,6 +114,15 @@ COPY --from=export_vllm /examples ${COMMON_WORKDIR}/vllm/examples
 ENV RAY_EXPERIMENTAL_NOSET_ROCR_VISIBLE_DEVICES=1
 ENV TOKENIZERS_PARALLELISM=false

+# ENV that can improve safe tensor loading, and end-to-end time
+ENV SAFETENSORS_FAST_GPU=1
+
+# User-friendly environment setting for multi-processing to avoid below RuntimeError.
+# RuntimeError: Cannot re-initialize CUDA in forked subprocess. To use CUDA with multiprocessing,
+# you must use the 'spawn' start method 
+# See https://pytorch.org/docs/stable/notes/multiprocessing.html#cuda-in-multiprocessing
+ENV VLLM_WORKER_MULTIPROC_METHOD=spawn
+
 # Performance environment variable.
 ENV HIP_FORCE_DEV_KERNARG=1