Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
c83310174055bb124ea2197885b652efd59b7a0f
Switch branch/tag
vllm_cscc
csrc
quantization
fp8
nvidia
quant_utils.cuh
Find file
Blame
History
Permalink
[Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (#4535)
· c8331017
Cody Yu
authored
May 09, 2024
c8331017
quant_utils.cuh
18.6 KB
Edit
Web IDE
Replace quant_utils.cuh
×
Attach a file by drag & drop or
click to upload
Commit message
Replace quant_utils.cuh
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.