Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
e519ae097ac4c126d3d8b2f3bb5ee54f696cf8bc
Switch branch/tag
vllm_cscc
vllm
model_executor
models
llama.py
Find file
Blame
History
Permalink
[ Misc ] non-uniform quantization via `compressed-tensors` for `Llama` (#6515)
· dbe55885
Robert Shaw
authored
Jul 18, 2024
dbe55885
llama.py
20.7 KB
Edit
Web IDE
Replace llama.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace llama.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.