Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
e661d5946a424d2396b719c582faebdfe6f7421a
Switch branch/tag
vllm_cscc
csrc
quantization
fp8
nvidia
quant_utils.cuh
05 Aug, 2024
1 commit
[CI/Build] Suppress divide-by-zero and missing return statement warnings (#7001)
· 6e4852ce
Tyler Michael Smith
authored
Aug 05, 2024
6e4852ce
30 Jul, 2024
1 commit
[Kernel] Squash a few more warnings (#6914)
· cbbc9044
Tyler Michael Smith
authored
Jul 30, 2024
cbbc9044
22 May, 2024
1 commit
[CI/Build] Enforce style for C++ and CUDA code with `clang-format` (#4722)
· 5f6d10c1
Michael Goin
authored
May 22, 2024
5f6d10c1
10 May, 2024
1 commit
[Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (#4535)
· c8331017
Cody Yu
authored
May 09, 2024
c8331017