Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Repository
e0917e6bd0fbbbbc8ba3db48ae26f07366ab9a0c
Switch branch/tag
sglang
sgl-kernel
csrc
gemm
per_token_quant_fp8.cu
Find file
Blame
History
Permalink
Remove vllm ops scaled fp8 quant and accelerate per token quant by 20-28% (#4215)
· e0917e6b
Stefan He
authored
Mar 12, 2025
Co-authored-by:
Stefan He
<
bhe@linkedin.com
>
e0917e6b
per_token_quant_fp8.cu
3.08 KB
Edit
Web IDE
Replace per_token_quant_fp8.cu
×
Attach a file by drag & drop or
click to upload
Commit message
Replace per_token_quant_fp8.cu
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.