Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
06d490282f2bab6922137eb5230be9df5ebbe9c4
Switch branch/tag
vllm_cscc
csrc
quantization
fp4
nvfp4_quant_kernels.cu
Find file
Blame
History
Permalink
[NVFP4][Perf] Tune NVFP4 input quant kernel for small batch size (#30897)
· 06d49028
Michael Goin
authored
Dec 21, 2025
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
06d49028
nvfp4_quant_kernels.cu
5.2 KB
Edit
Web IDE
Replace nvfp4_quant_kernels.cu
×
Attach a file by drag & drop or
click to upload
Commit message
Replace nvfp4_quant_kernels.cu
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.