Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Repository
915140fd18c9ff4193e994e6d756ea762a52240a
Switch branch/tag
sglang
python
sglang
srt
layers
quantization
modelopt_quant.py
Find file
Blame
History
Permalink
[NVIDIA] Add Low Latency NVFP4 decode kernels from Flashinfer (#8552)
· 915140fd
azhurkevich
authored
Aug 04, 2025
Co-authored-by:
Cheng Wan
<
cwan@x.ai
>
915140fd
modelopt_quant.py
45.5 KB
Edit
Web IDE
Replace modelopt_quant.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace modelopt_quant.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.