Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
31da75abedcecda39a274b5efd96a8a1247d1537
Switch branch/tag
sglang
sgl-kernel
benchmark
bench_per_token_group_quant_8bit.py
24 Mar, 2025
1 commit
[Quant Kernel] refactored per token group quant fp8 to support int8 up-to 2x faster (#4396)
· 65c24c28
Chunan Zeng
authored
Mar 23, 2025
65c24c28
07 Mar, 2025
1 commit
[Refactor] Reducing code duplication across FP8 CUDA quantization kernels (#4163)
· 95085d65
Stefan He
authored
Mar 06, 2025
95085d65
19 Feb, 2025
1 commit
use warp shuffle style reduce and flashinfer vectorize (#3628)
· 55a7ec38
Xiaoyu Zhang
authored
Feb 19, 2025
55a7ec38
11 Feb, 2025
1 commit
optimize per token group quant fp8 (#3490)
· bb418ced
Xiaoyu Zhang
authored
Feb 11, 2025
bb418ced