Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
3c9740d2004fff7801433efa88e2dcfdd828bc0a
Switch branch/tag
sglang
sgl-kernel
benchmark
bench_per_token_quant_fp8.py
12 Apr, 2025
1 commit
update variable naming and comments for rocm (#5299)
· 3c9740d2
Zhaoyi Li
authored
Apr 12, 2025
3c9740d2
13 Mar, 2025
2 commits
fix accuracy issue (#4376)
· 2937387a
Yineng Zhang
authored
Mar 13, 2025
2937387a
Fix per token fp8 quant precision (#4362)
· 4068e012
Qingquan Song
authored
Mar 12, 2025
4068e012
07 Mar, 2025
3 commits
[Refactor] Reducing code duplication across FP8 CUDA quantization kernels (#4163)
· 95085d65
Stefan He
authored
Mar 06, 2025
95085d65
Add sgl_per_token_quant_fp8 (#4089)
· 63ee26d1
Stefan He
authored
Mar 06, 2025
63ee26d1
[quant kernel] sgl-kernel support per_tensor_quant fp8 (#3786)
· ad55f171
Xiaoyu Zhang
authored
Mar 07, 2025
ad55f171