Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Qwen3.6_vllm
Commits
347262c4
Commit
347262c4
authored
Apr 18, 2026
by
raojy
💬
Browse files
Update README.md
parent
97223342
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
2 deletions
+4
-2
README.md
README.md
+4
-2
No files found.
README.md
View file @
347262c4
...
...
@@ -57,12 +57,14 @@ docker run -it \
#### 单机推理
```
bash
## serve启动
vllm serve
Qwen/Qwen3.6-35B-A3B
\
vllm serve
/public/home/raojy/project/model_code/qwen36
\
--port
8001
\
--trust-remote-code
\
--dtype
bfloat16
\
--tensor-parallel-size
4
\
--gpu-memory-utilization
0.925
--gpu-memory-utilization
0.925
\
--enable-auto-tool-choice
\
--tool-call-parser
qwen3_coder
## client访问
curl
-X
POST
"http://localhost:8001/v1/chat/completions"
-H
"Content-Type: application/json"
-d
'{
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment