Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
kecinstone
2024pra-vllm
Repository
01a5d18a537b65a156cfa1a77706693a24c869c1
Switch branch/tag
2024-pra-vllm
vllm
model_executor
layers
quantization
gptq.py
Find file
Blame
History
Permalink
Add Support for 2/3/8-bit GPTQ Quantization Models (#2330)
· 01a5d18a
CHU Tianxiang
authored
Feb 29, 2024
01a5d18a
gptq.py
7.25 KB
Edit
Web IDE
Replace gptq.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace gptq.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.