- 19 Nov, 2025 3 commits
- 18 Nov, 2025 2 commits
- 17 Nov, 2025 2 commits
- 15 Nov, 2025 2 commits
- 14 Nov, 2025 11 commits
-
-
lizhigong authored
使用groupgemm完成高吞吐模式适配。 See merge request OpenDAS/sglang!24
-
yiqa authored
-
yiqa authored
-
yiqa authored
-
yiqa authored
# Conflicts: # python/sglang/srt/layers/quantization/slimquant_w4a8_marlin.py
-
yiqa authored
-
yiqa authored
-
maxiao1 authored
适配w8a8_marlin See merge request OpenDAS/sglang!23
-
maxiao1 authored
-
maxiao1 authored
算子融合 See merge request OpenDAS/sglang!22
-
maxiao1 authored
-
- 13 Nov, 2025 3 commits
- 12 Nov, 2025 5 commits
- 11 Nov, 2025 6 commits
- 10 Nov, 2025 6 commits
-
-
lizhigong authored
-
lizhigong authored
-
maxiao1 authored
V0.5.4 dev liucong See merge request OpenDAS/sglang!16
-
maxiao1 authored
-
lizhigong authored
fix bug on flash attention use for chunkprefill and radix cache See merge request OpenDAS/sglang!14
-
lizhigong authored
添加环境变量SGLANG_USE_LIGHTOP 控制 lightop的融合rotaty_emb和moe_gated算子,默认禁用;修复RMSNorm:forward_hip中的错误逻辑 See merge request OpenDAS/sglang!13
-