- 09 Oct, 2025 1 commit
-
-
zhuwenwen authored
lmcache support pd See merge request dcutoolkit/deeplearing/vllm!219
-
- 06 Oct, 2025 1 commit
-
-
yangshj1 authored
-
- 05 Oct, 2025 2 commits
- 30 Sep, 2025 1 commit
-
-
zhuwenwen authored
-
- 29 Sep, 2025 5 commits
- 26 Sep, 2025 12 commits
-
-
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
feat: moe_align_block_size 更新lightop 接口,加入对ep的支持 See merge request dcutoolkit/deeplearing/vllm!216
-
jujl1 authored
-
-
zhuwenwen authored
-
zhuwenwen authored
add PD P do pp See merge request dcutoolkit/deeplearing/vllm!215
-
xiabo authored
-
zhuwenwen authored
[fix] pp+mtp bs 1 correctness See merge request dcutoolkit/deeplearing/vllm!214
-
zhuwenwen authored
add shared_output and routed_scaling_factor of CompressedTensorsW8A8Int8MoEMethod
-
lizhigong authored
-
- 25 Sep, 2025 5 commits
- 24 Sep, 2025 7 commits
- 23 Sep, 2025 6 commits