Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
2d0a73a3
Commit
2d0a73a3
authored
Jul 24, 2024
by
zhuwenwen
Browse files
update gptq relative path
parent
0e7df36b
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
4 additions
and
7 deletions
+4
-7
README.md
README.md
+2
-5
csrc/quantization/gptq/setup.py
csrc/quantization/gptq/setup.py
+2
-2
No files found.
README.md
View file @
2d0a73a3
...
@@ -59,17 +59,14 @@ git clone http://developer.hpccube.com/codes/OpenDAS/vllm.git # 根据需要的
...
@@ -59,17 +59,14 @@ git clone http://developer.hpccube.com/codes/OpenDAS/vllm.git # 根据需要的
```
```
1. 编译whl包并安装
1. 编译whl包并安装
VLLM_INSTALL_PUNICA_KERNELS=1 python setup.py bdist_wheel
VLLM_INSTALL_PUNICA_KERNELS=1 python setup.py bdist_wheel
python csrc/quantization/gptq/setup.py bdist_wheel
cd dist
cd dist
pip install vllm*
pip install vllm*
cd csrc/quantization/gptq
python setup.py bdist_wheel
cd dist
pip install gptq_kernel
pip install gptq_kernel
2. 源码编译安装
2. 源码编译安装
VLLM_INSTALL_PUNICA_KERNELS=1 python3 setup.py install
VLLM_INSTALL_PUNICA_KERNELS=1 python3 setup.py install
cd csrc/quantization/gptq
python csrc/quantization/gptq/setup.py install
python setup.py install
```
```
#### 运行基础环境准备
#### 运行基础环境准备
...
...
csrc/quantization/gptq/setup.py
View file @
2d0a73a3
...
@@ -24,8 +24,8 @@ setup(
...
@@ -24,8 +24,8 @@ setup(
CUDAExtension
(
CUDAExtension
(
name
=
"gptq_kernels"
,
name
=
"gptq_kernels"
,
sources
=
[
sources
=
[
"
.
/torch_bindings.cpp"
,
"
csrc/quantization/gptq
/torch_bindings.cpp"
,
"
.
/q_gemm.cu"
,
"
csrc/quantization/gptq
/q_gemm.cu"
,
],
],
extra_compile_args
=
extra_compile_args
,
extra_compile_args
=
extra_compile_args
,
)
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment