"git@developer.sourcefind.cn:jerrrrry/infinicore.git" did not exist on "6dccaed2fa8338d68e42e87634c712771f728c98"
Commit f842f435 authored by raojy's avatar raojy 💬
Browse files

Update README.md

parent 347262c4
...@@ -57,7 +57,7 @@ docker run -it \ ...@@ -57,7 +57,7 @@ docker run -it \
#### 单机推理 #### 单机推理
```bash ```bash
## serve启动 ## serve启动
vllm serve /public/home/raojy/project/model_code/qwen36 \ vllm serve Qwen/Qwen3.6-35B-A3B \
--port 8001 \ --port 8001 \
--trust-remote-code \ --trust-remote-code \
--dtype bfloat16 \ --dtype bfloat16 \
...@@ -68,7 +68,7 @@ vllm serve /public/home/raojy/project/model_code/qwen36 \ ...@@ -68,7 +68,7 @@ vllm serve /public/home/raojy/project/model_code/qwen36 \
## client访问 ## client访问
curl -X POST "http://localhost:8001/v1/chat/completions" -H "Content-Type: application/json" -d '{ curl -X POST "http://localhost:8001/v1/chat/completions" -H "Content-Type: application/json" -d '{
"model": "/public/home/raojy/project/model_code/qwen36", "model": "Qwen/Qwen3.6-35B-A3B",
"messages": [ "messages": [
{"role": "system", "content": "你是一个有用的助手。"}, {"role": "system", "content": "你是一个有用的助手。"},
{"role": "user", "content": "你好,请做一下简单的自我介绍。"} {"role": "user", "content": "你好,请做一下简单的自我介绍。"}
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment