Commit 6a231a76 authored by luopl's avatar luopl
Browse files

Update README.md

parent a5b16b63
Pipeline #2169 failed with stages
in 0 seconds
......@@ -227,7 +227,7 @@ print("Chat response:", chat_response)
### 单机多卡
```
python ./inference_vllm/multi_dcu_inference.py
HIP_VISIBLE_DEVICES=0,1,2,3 python ./inference_vllm/multi_dcu_inference.py
```
其中,MODEL_PATH为模型路径,tensor_parallel_size=4为使用卡数,messages为需要输入的内容。
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment