- 12 Jun, 2025 1 commit
-
-
lizhigong authored
-
- 06 Jun, 2025 1 commit
-
-
lizhigong authored
-
- 05 Jun, 2025 2 commits
- 04 Jun, 2025 4 commits
- 03 Jun, 2025 2 commits
- 30 May, 2025 2 commits
- 29 May, 2025 8 commits
-
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
增加w8a8 线性gemm triton优化 See merge request dcutoolkit/deeplearing/vllm!128
-
zhuwenwen authored
-
lizhigong authored
-
gaoqiong authored
-
zhuwenwen authored
-
zhuwenwen authored
[feat]量化模型添加支持 moe_fused_gate kernel,并使用VLLM_ENABLE_MOE_FUSED_GATE环境变量控制开关,默认打开 See merge request dcutoolkit/deeplearing/vllm!127
-
- 28 May, 2025 8 commits
-
-
王敏 authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
修复0.8.5 config找不到的bug See merge request dcutoolkit/deeplearing/vllm!126
-
zhuwenwen authored
[feat]适配sgl moe_fused_gate kernel See merge request dcutoolkit/deeplearing/vllm!125
-
- 27 May, 2025 3 commits
- 26 May, 2025 9 commits
-
-
zhuwenwen authored
-
zhuwenwen authored
debug and fix tbo error in mtp See merge request dcutoolkit/deeplearing/vllm!124
-
lizhigong authored
-
zhuwenwen authored
fix tbo support deepseek mtp See merge request dcutoolkit/deeplearing/vllm!123
-
lizhigong authored
-
zhuwenwen authored
V0.8.5.post1 dev wm See merge request dcutoolkit/deeplearing/vllm!122
-
王敏 authored
-
王敏 authored
-
zhuwenwen authored
-