Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
fb16eea48b4e0ac74502d1b55fb7307af2135a69
Switch branch/tag
vllm_cscc
vllm
model_executor
models
mllama.py
Find file
Blame
History
Permalink
[Bugfix] Revert QKVCrossParallelLinear usage in Mllama to keep BNB quantization work (#14498)
· fb16eea4
Isotr0py
authored
Mar 09, 2025
Signed-off-by:
Isotr0py
<
2037008807@qq.com
>
fb16eea4
mllama.py
62.9 KB
Edit
Web IDE
Replace mllama.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace mllama.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.