Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Qwen3-Coder-Next_vllm
Commits
58ab51f3
Commit
58ab51f3
authored
Mar 31, 2026
by
raojy
💬
Browse files
Update README.md
parent
91544c88
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
4 deletions
+2
-4
README.md
README.md
+2
-4
No files found.
README.md
View file @
58ab51f3
...
@@ -76,9 +76,7 @@ vllm serve Qwen/Qwen3-Coder-Next \
...
@@ -76,9 +76,7 @@ vllm serve Qwen/Qwen3-Coder-Next \
--served-model-name
Qwen3-Coder-Next
\
--served-model-name
Qwen3-Coder-Next
\
--dtype
bfloat16
\
--dtype
bfloat16
\
--trust-remote-code
\
--trust-remote-code
\
--tensor-parallel-size
4
\
--tensor-parallel-size
8
\
--gpu-memory-utilization
0.95
\
--max-model-len
8192
\
--port
8000
--port
8000
## client访问
## client访问
...
@@ -110,7 +108,7 @@ curl http://localhost:8000/v1/chat/completions \
...
@@ -110,7 +108,7 @@ curl http://localhost:8000/v1/chat/completions \
|
**模型名称**
|
**权重大小**
|
**DCU型号**
|
**最低卡数需求**
|
**下载地址**
|
|
**模型名称**
|
**权重大小**
|
**DCU型号**
|
**最低卡数需求**
|
**下载地址**
|
| :------------------: | :----------: | :-----------: | :--------------: | :----------------------------------------------------------: |
| :------------------: | :----------: | :-----------: | :--------------: | :----------------------------------------------------------: |
| Qwen3-Coder-Next | 80B | BW1000 |
4
|
[
Hugging Face
](
https://huggingface.co/Qwen/Qwen3-Coder-Next
)
|
| Qwen3-Coder-Next | 80B | BW1000 |
8
|
[
Hugging Face
](
https://huggingface.co/Qwen/Qwen3-Coder-Next
)
|
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment