Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
laibao
Qwen2.5_vllm
Commits
cf627af9
Commit
cf627af9
authored
Oct 12, 2024
by
laibao
Browse files
Update README.md
parent
29507030
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
11 additions
and
13 deletions
+11
-13
README.md
README.md
+11
-13
No files found.
README.md
View file @
cf627af9
...
@@ -80,18 +80,16 @@ conda create -n qwen2.5_vllm python=3.10
...
@@ -80,18 +80,16 @@ conda create -n qwen2.5_vllm python=3.10
### 模型下载
### 模型下载
| 基座模型 | chat模型 | GPTQ模型 | AWQ模型 |
| 基座模型 | chat模型 | GPTQ模型 | AWQ模型 |
| -------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------
|
----------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------- |
| -------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------
--------------- | ----------------
----------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------
-------------
|
|
[
Qwen2.5 3B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-3B
)
|
[
Qwen2.5 3B Instruct
](
http://113.200.138.88:18080/aimodels/
Q
wen
-7B-Chat
)
|
[
Qwen2.5-3B-Instruct-GPTQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/qwen2.5-3b-instruct-gptq-int4
)
|
[
Qwen2.5-3B-Instruct-AWQ
](
http://113.200.138.88:18080/aimodels/qwen/qwen2.5-3b-instruct-awq
)
|
|
[
Qwen2.5 3B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-3B
)
|
[
Qwen2.5 3B Instruct
](
http://113.200.138.88:18080/aimodels/
q
wen
2.5-3b-instruct
)
|
[
Qwen2.5-3B-Instruct-GPTQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/qwen2.5-3b-instruct-gptq-int4
)
|
[
Qwen2.5-3B-Instruct-AWQ
](
http://113.200.138.88:18080/aimodels/qwen/qwen2.5-3b-instruct-awq
)
|
|
[
Qwen2.5-7B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-7B
)
|
[
Qwen2.5 7B Instruct
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-7B-Instruct
)
|
[
Qwen2.5-7B-Instruct-GPTQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/qwen2.5-7b-instruct-gptq-int4
)
|
[
Qwen-7B-
Chat
](
http://113.200.138.88:18080/aimodels/
Q
wen
-7B-Chat
)
|
|
[
Qwen2.5-7B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-7B
)
|
[
Qwen2.5 7B Instruct
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-7B-Instruct
)
|
[
Qwen2.5-7B-Instruct-GPTQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/qwen2.5-7b-instruct-gptq-int4
)
|
[
Qwen
2.5
-7B-
Instruct-AWQ
](
http://113.200.138.88:18080/aimodels/
q
wen
/qwen2.5-7b-instruct-awq
)
|
|
[
Qwen2.5-14B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-14B
)
|
[
Qwen-14B-
Cha
t
](
http://
113.200.138.88:18080/aimodels
/Qwen-14B-
Chat
)
|
[
Qwen-14B-
Cha
t-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen-14B-
Chat
-Int4
)
|
[
Qwen
-7B-Chat
](
http://
113.200.138.88:18080/aimodels/Qwen-7B-Chat
)
|
|
[
Qwen2.5-14B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-14B
)
|
[
Qwen
2.5
-14B-
Instruc
t
](
http
s
://
huggingface.co/Qwen
/Qwen
2.5
-14B-
Instruct
)
|
[
Qwen
2.5
-14B-
Instruc
t-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen
2.5
-14B-
Instruct-GPTQ
-Int4
)
|
[
Qwen
2.5-14B-Instruct-AWQ
](
http
s
://
huggingface.co/Qwen/Qwen2.5-14B-Instruct-AWQ
)
|
|
[
Qwen2.5-32B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-32B
)
|
[
Qwen
-72B-Cha
t
](
http://113.200.138.88:18080/aimodels/
Qwen-72B-Chat
)
|
[
Qwen
-72B-Cha
t-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen
-72B-Chat
-Int4
)
|
[
Qwen
-7B-Chat
](
http://
113.200.138.88:18080/aimodels/Qwen-7B-Chat
)
|
|
[
Qwen2.5-32B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-32B
)
|
[
Qwen
2.5-32B-Instruc
t
](
http://113.200.138.88:18080/aimodels/
qwen/Qwen2.5-32B-Instruct
)
|
[
Qwen
2.5-32B-Instruc
t-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen
2.5-32B-Instruct-GPTQ
-Int4
)
|
[
Qwen
2.5-32B-Instruct-AWQ
](
http
s
://
huggingface.co/Qwen/Qwen2.5-32B-Instruct-AWQ
)
|
|
[
Qwen2.5-72B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-72B
)
|
[
Qwen
1
.5-7B-
Cha
t
](
http
s
://
huggingface.co/Q
wen/Qwen
1
.5-7B-
Chat
)
|
[
Qwen
1
.5-7B-
Cha
t-GPTQ-Int4
](
http
s
://
huggingface.co/Q
wen/Qwen
1
.5-7B-
Cha
t-GPTQ-Int4
)
|
[
Qwen
1
.5-7B-
Chat-AWQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/Qwen
1
.5-7B-
Chat-AWQ
)
|
|
[
Qwen2.5-72B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-72B
)
|
[
Qwen
2
.5-7
2
B-
Instruc
t
](
http://
113.200.138.88:18080/aimodels/q
wen/Qwen
2
.5-7
2
B-
Instruct
)
|
[
Qwen
2
.5-7
2
B-
Instruc
t-GPTQ-Int4
](
http://
113.200.138.88:18080/aimodels/q
wen/Qwen
2
.5-7
2
B-
Instruc
t-GPTQ-Int4
)
|
[
Qwen
2
.5-7
2
B-
Instruct-AWQ
](
http://113.200.138.88:18080/aimodels/qwen/Qwen
2
.5-7
2
B-
Instruct-AWQ
)
|
|
[
Qwen2.5 Coder 1.5B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-Coder-1.5B
)
|
[
Qwen
1
.5-
14B-Cha
t
](
http://113.200.138.88:18080/aimodels/qwen/Qwen
1
.5-
14B-Chat
)
|
[
Qwen
1
.5-
14B-Cha
t-GPTQ-Int4
](
http
s
://
huggingface.co/Q
wen/Qwen
1
.5-
14B-Chat-GPTQ-Int4
)
|
[
Qwen
1
.5-
14B-Chat-AWQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/
Q
wen
1
.5-
14B-Chat-AWQ
)
|
|
[
Qwen2.5 Coder 1.5B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-Coder-1.5B
)
|
[
Qwen
2
.5-
Coder-1.5B-Instruc
t
](
http://113.200.138.88:18080/aimodels/qwen/Qwen
2
.5-
Coder-1.5B-Instruct
)
|
[
Qwen
2
.5-
Coder-1.5B-Instruc
t-GPTQ-Int4
](
http://
113.200.138.88:18080/aimodels/q
wen/Qwen
2
.5-
Coder-1.5B-Instruct-GPTQ-Int4
)
|
[
Qwen
2
.5-
Coder-1.5B-Instruct-AWQ
](
http://113.200.138.88:18080/aimodels/qwen/
q
wen
2
.5-
coder-1.5b-instruct-awq
)
|
|
[
Qwen2.5 Coder 7B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-Coder-7B
)
|
[
Qwen
1
.5
-32B-Chat
](
http://113.200.138.88:18080/aimodels/Qwen
1
.5-
32B-Chat
)
|
[
Qwen
1
.5
-32B-Chat-
GPTQ
-
Int4
](
http://113.200.138.88:18080/aimodels/Qwen
1
.5-
32B-Cha
t-GPTQ-Int4
)
|
[
Qwen
1
.5
-32B-Chat-AWQ-Int4
](
http
s
://
huggingface.co/Q
wen/Qwen
1
.5-
32B-Chat-AWQ
)
|
|
[
Qwen2.5 Coder 7B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-Coder-7B
)
|
[
Qwen
2
.5
Coder 7B
](
http://113.200.138.88:18080/aimodels/
qwen/
Qwen
2
.5-
Coder-7B
)
|
[
Qwen
2
.5
Coder 7B Instruct
GPTQ
Int4
](
http://113.200.138.88:18080/aimodels/
qwen/
Qwen
2
.5-
Coder-7B-Instruc
t-GPTQ-Int4
)
|
[
Qwen
2
.5
Coder 7B Instruct AWQ
](
http://
113.200.138.88:18080/aimodels/q
wen/Qwen
2
.5-
Coder-7B-Instruct-AWQ
)
|
|
[
Qwen2.5 Math 1.5B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-Math-1.5B
)
|
[
Qwen1.5-72B-Chat
](
http://113.200.138.88:18080/aimodels/Qwen1.5-72B-Chat
)
|
[
Qwen1.5-72B-Chat-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen1.5-72B-Chat-GPTQ-Int4
)
|
[
Qwen1.5-72B-Chat-AWQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/Qwen1.5-72B-Chat-AWQ
)
|
|
[
Qwen2.5 Math 1.5B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-Math-1.5B
)
|
[
Qwen1.5-72B-Chat
](
http://113.200.138.88:18080/aimodels/Qwen1.5-72B-Chat
)
|
[
Qwen1.5-72B-Chat-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen1.5-72B-Chat-GPTQ-Int4
)
|
[
Qwen1.5-72B-Chat-AWQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/Qwen1.5-72B-Chat-AWQ
)
|
|
[
Qwen2.5 Math 7B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-Math-7B
)
|
[
Qwen1.5-110B-Chat
](
http://113.200.138.88:18080/aimodels/Qwen1.5-110B-Chat
)
|
[
Qwen1.5-110B-Chat-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen1.5-110B-Chat-GPTQ-Int4
)
|
[
Qwen1.5-110B-Chat-AWQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/Qwen1.5-110B-Chat-AWQ
)
|
|
[
Qwen2.5 Math 7B
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2.5-Math-7B
)
|
[
Qwen1.5-110B-Chat
](
http://113.200.138.88:18080/aimodels/Qwen1.5-110B-Chat
)
|
[
Qwen1.5-110B-Chat-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen1.5-110B-Chat-GPTQ-Int4
)
|
[
Qwen1.5-110B-Chat-AWQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/Qwen1.5-110B-Chat-AWQ
)
|
|
[
Qwen2-7B
](
http://113.200.138.88:18080/aimodels/Qwen2-7B
)
|
[
Qwen2-7B-Instruct
](
http://113.200.138.88:18080/aimodels/Qwen2-7B-Instruct
)
|
[
Qwen2-7B-Instruct-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen2-7B-Instruct-GPTQ-Int4
)
|
[
Qwen2-7B-Instruct-AWQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2-7B-Instruct-AWQ
)
|
|
[
Qwen2-72B
](
http://113.200.138.88:18080/aimodels/Qwen2-72B
)
|
[
Qwen2-72B-Instruct
](
http://113.200.138.88:18080/aimodels/Qwen2-72B-Instruct
)
|
[
Qwen2-72B-Instruct-GPTQ-Int4
](
https://huggingface.co/Qwen/Qwen2-72B-Instruct-GPTQ-Int4
)
|
[
Qwen2-72B-Instruct-AWQ-Int4
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2-72B-Instruct-AWQ
)
|
### 离线批量推理
### 离线批量推理
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment