Commit 3de75787 authored by luopl's avatar luopl
Browse files

Update README.md

parent d44ba1a0
...@@ -66,7 +66,6 @@ vLLM Deployment(vllm官方暂不支持AFD,只支持非分离模式部署): ...@@ -66,7 +66,6 @@ vLLM Deployment(vllm官方暂不支持AFD,只支持非分离模式部署):
``` ```
#head节点执行 #head节点执行
#Tensor Parallelism #Tensor Parallelism
#暂不支持attention data parallelism
VLLM_USE_NN=0 VLLM_USE_FLASH_ATTN_PA=0 vllm serve /path/to/step3 \ VLLM_USE_NN=0 VLLM_USE_FLASH_ATTN_PA=0 vllm serve /path/to/step3 \
--reasoning-parser step3 \ --reasoning-parser step3 \
...@@ -80,6 +79,7 @@ VLLM_USE_NN=0 VLLM_USE_FLASH_ATTN_PA=0 vllm serve /path/to/step3 \ ...@@ -80,6 +79,7 @@ VLLM_USE_NN=0 VLLM_USE_FLASH_ATTN_PA=0 vllm serve /path/to/step3 \
--port $PORT_SERVING --port $PORT_SERVING
``` ```
` 暂不支持attention data parallelism`
- Client Request Examples - Client Request Examples
``` ```
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment