Commit 2093b1aa authored by luopl's avatar luopl
Browse files

Update README.md

parent 705e2c5a
...@@ -72,14 +72,12 @@ pip install pycountry ...@@ -72,14 +72,12 @@ pip install pycountry
### vllm ### vllm
#### 单机推理 #### 单机推理
启动vllm server
```bash ```bash
## serve启动 ## serve启动
vllm serve stepfun-ai/Step-3.5-Flash \ vllm serve stepfun-ai/Step-3.5-Flash \
--port 8001 \ --port 8001 \
--tensor-parallel-size 16 \ --tensor-parallel-size 8 \
--distributed-executor-backend ray \
--enable-expert-parallel \ --enable-expert-parallel \
--disable-cascade-attn \ --disable-cascade-attn \
--reasoning-parser step3p5 \ --reasoning-parser step3p5 \
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment