Commit 2093b1aa authored by luopl's avatar luopl
Browse files

Update README.md

parent 705e2c5a
......@@ -72,14 +72,12 @@ pip install pycountry
### vllm
#### 单机推理
启动vllm server
```bash
## serve启动
vllm serve stepfun-ai/Step-3.5-Flash \
--port 8001 \
--tensor-parallel-size 16 \
--distributed-executor-backend ray \
--tensor-parallel-size 8 \
--enable-expert-parallel \
--disable-cascade-attn \
--reasoning-parser step3p5 \
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment