Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
2cd402e1692417b7645e4ece11bc2ab91072f47c
Switch branch/tag
vllm_cscc
vllm
model_executor
layers
linear.py
Find file
Blame
History
Permalink
[ Bugfix ] Enabling Loading Models With Fused QKV/MLP on Disk with FP8 (#5921)
· 2cd402e1
Robert Shaw
authored
Jun 28, 2024
Co-authored-by:
Robert Shaw
<
rshaw@neuralmagic
>
2cd402e1
linear.py
34.7 KB
Edit
Web IDE
Replace linear.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace linear.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.