Commit 6a231a76 authored by luopl's avatar luopl
Browse files

Update README.md

parent a5b16b63
Pipeline #2169 failed with stages
in 0 seconds
...@@ -227,7 +227,7 @@ print("Chat response:", chat_response) ...@@ -227,7 +227,7 @@ print("Chat response:", chat_response)
### 单机多卡 ### 单机多卡
``` ```
python ./inference_vllm/multi_dcu_inference.py HIP_VISIBLE_DEVICES=0,1,2,3 python ./inference_vllm/multi_dcu_inference.py
``` ```
其中,MODEL_PATH为模型路径,tensor_parallel_size=4为使用卡数,messages为需要输入的内容。 其中,MODEL_PATH为模型路径,tensor_parallel_size=4为使用卡数,messages为需要输入的内容。
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment