Commit a8ff88a6 authored by zhuwenwen's avatar zhuwenwen
Browse files

update readme

parent 737a1a31
...@@ -111,7 +111,7 @@ python benchmarks/benchmark_throughput.py --num-prompts 1 --model Qwen/Qwen1.5-7 ...@@ -111,7 +111,7 @@ python benchmarks/benchmark_throughput.py --num-prompts 1 --model Qwen/Qwen1.5-7
### api服务推理性能测试 ### api服务推理性能测试
1、启动服务端: 1、启动服务端:
```bash ```bash
python -m vllm.entrypoints.api_server --model Qwen/Qwen1.5-7B-Chat --dtype float16 --enforce-eager -tp 1 python -m vllm.entrypoints.openai.api_server --model Qwen/Qwen1.5-7B-Chat --dtype float16 --enforce-eager -tp 1
``` ```
2、启动客户端: 2、启动客户端:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment