- 29 Sep, 2025 2 commits
- 28 Sep, 2025 3 commits
- 26 Sep, 2025 12 commits
-
-
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
feat: moe_align_block_size 更新lightop 接口,加入对ep的支持 See merge request dcutoolkit/deeplearing/vllm!216
-
jujl1 authored
-
-
zhuwenwen authored
-
zhuwenwen authored
add PD P do pp See merge request dcutoolkit/deeplearing/vllm!215
-
xiabo authored
-
zhuwenwen authored
[fix] pp+mtp bs 1 correctness See merge request dcutoolkit/deeplearing/vllm!214
-
zhuwenwen authored
add shared_output and routed_scaling_factor of CompressedTensorsW8A8Int8MoEMethod
-
lizhigong authored
-
- 25 Sep, 2025 4 commits
- 24 Sep, 2025 5 commits
- 23 Sep, 2025 9 commits
-
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
yangql authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
deepseek-r1-w4a8 mlp/moe调用silu-mul-quant融合 See merge request dcutoolkit/deeplearing/vllm!209
-
zhuwenwen authored
修改P等待问题 See merge request dcutoolkit/deeplearing/vllm!210
-
- 22 Sep, 2025 2 commits
- 20 Sep, 2025 1 commit
-
-
SAC_fanth authored
-
- 18 Sep, 2025 2 commits