Commit 9186c857 authored by luopl's avatar luopl
Browse files

Update README.md

parent 945eec2a
......@@ -113,9 +113,9 @@ python ./inference/inference_vllm/Qwen1.5-14b_multi_dcu_inference.py
其中,prompts为提示词,model为模型路径,tensor_parallel_size=4为使用卡数。
## result
使用的加速卡:4张 DCU-K100-64G
```
Prompt: 'The capital of France is', Generated text: '______.(\u3000\u3000)\nA. New York\nB. Paris\n'
```
### 精度
模型Qwen1.5-14B-Chat,数据(LoRA finetune):alpaca_gpt4_zh,4卡,zero3训练。
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment