Commit f842f435 authored by raojy's avatar raojy 💬
Browse files

Update README.md

parent 347262c4
......@@ -57,7 +57,7 @@ docker run -it \
#### 单机推理
```bash
## serve启动
vllm serve /public/home/raojy/project/model_code/qwen36 \
vllm serve Qwen/Qwen3.6-35B-A3B \
--port 8001 \
--trust-remote-code \
--dtype bfloat16 \
......@@ -68,7 +68,7 @@ vllm serve /public/home/raojy/project/model_code/qwen36 \
## client访问
curl -X POST "http://localhost:8001/v1/chat/completions" -H "Content-Type: application/json" -d '{
"model": "/public/home/raojy/project/model_code/qwen36",
"model": "Qwen/Qwen3.6-35B-A3B",
"messages": [
{"role": "system", "content": "你是一个有用的助手。"},
{"role": "user", "content": "你好,请做一下简单的自我介绍。"}
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment