Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
5f6d10c14c17122e6d711a4829ee0ca672e07f6f
Switch branch/tag
vllm_cscc
csrc
quantization
fp8
nvidia
quant_utils.cuh
22 May, 2024
1 commit
[CI/Build] Enforce style for C++ and CUDA code with `clang-format` (#4722)
· 5f6d10c1
Michael Goin
authored
May 22, 2024
5f6d10c1
10 May, 2024
1 commit
[Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (#4535)
· c8331017
Cody Yu
authored
May 09, 2024
c8331017