Update README.md

a3983b3d · laibao · 49753ddd · a3983b3d
Commit a3983b3d authored Dec 13, 2024 by laibao
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

README.md README.md +1 -1

No files found.
--- a/README.md
+++ b/README.md
@@ -105,7 +105,7 @@ python benchmarks/benchmark_throughput.py --num-prompts 1 --model meta-llama/Lla
 其中`--num-prompts`是batch数，`--model`为模型路径，`--dataset`为使用的数据集，`-tp`为使用卡数，`dtype="float16"`为推理数据类型，如果模型权重是bfloat16,需要修改为float16推理。`-q gptq`为使用gptq量化模型进行推理。
-### api服务推理性能测试
+### openAI api服务推理性能测试
 1、启动服务端：
 ```bash
 python -m vllm.entrypoints.openai.api_server  --model meta-llama/Llama-2-7b-chat-hf  --dtype float16 --enforce-eager -tp 1