Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
ModelZoo
Qwen3.5
Commits
bfb39f34
Commit
bfb39f34
authored
Jun 04, 2026
by
raojy
💬
Browse files
Update README.md
parent
09c6cf7c
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
3 deletions
+3
-3
README.md
README.md
+3
-3
No files found.
README.md
View file @
bfb39f34
...
...
@@ -182,7 +182,7 @@ curl http://localhost:8001/v1/chat/completions \
### SGLang
#### 单机推理
##### BF16
1.
serve启动,以
`Qwen
/Qwen3.6-27
B`
为例(此命令适用于非K100AI芯片)
1.
serve启动,以
`Qwen
3.5-35B-A3
B`
为例(此命令适用于非K100AI芯片)
```
bash
export
SGLANG_ENABLE_SPEC_V2
=
1
export
SGLANG_USE_FUSED_TOPK_SOFTMAX
=
1
...
...
@@ -202,7 +202,7 @@ sglang serve --model-path Qwen/Qwen3.5-35B-A3B \
--trust-remote-code
\
--chunked-prefill-size
-1
--context-length
8192
```
2.
serve启动,以
`Qwen
/Qwen3.6-27
B`
为例(此命令适用于K100AI芯片)
2.
serve启动,以
`Qwen
3.5-35B-A3
B`
为例(此命令适用于K100AI芯片)
```
bash
export
SGLANG_ENABLE_SPEC_V2
=
1
export
SGLANG_USE_FUSED_TOPK_SOFTMAX
=
1
...
...
@@ -230,7 +230,7 @@ sglang serve --model-path Qwen/Qwen3.5-35B-A3B \
curl http://localhost:8001/v1/chat/completions
\
-H
"Content-Type: application/json"
\
-d
'{
"model": "Qwen/Qwen3.
6-27B-FP8
",
"model": "Qwen/Qwen3.
5-35B-A3B
",
"messages": [
{"role": "user", "content": "Type \"I love Qwen3.5\" backwards"}
],
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment