- 24 Oct, 2025 1 commit
-
-
zhuwenwen authored
support prefix cache on kme fix the error in test_moe caused by moe align not supporting 511 and 211 multi-modal switching to torch implementation on z100l&k100
-
- 20 Oct, 2025 2 commits
- 17 Oct, 2025 3 commits
- 16 Oct, 2025 2 commits
- 15 Oct, 2025 11 commits
-
-
yangql authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
删除DPSK_FP16_QUICK,以及增加awq和blockwiseint8的shared_output接口 See merge request dcutoolkit/deeplearing/vllm!228
-
yangql authored
-
zhuwenwen authored
删除DPSK_FP16_QUICK,以及增加awq和blockwiseint8的shared_output接口 See merge request dcutoolkit/deeplearing/vllm!227
-
zhuwenwen authored
-
zhuwenwen authored
-
yangql authored
-
zhuwenwen authored
-
- 14 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 13 Oct, 2025 14 commits
-
-
zhuwenwen authored
-
zhuwenwen authored
去掉all2all ep相关代码 See merge request dcutoolkit/deeplearing/vllm!226
-
王敏 authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
Merge branch 'v0.9.2-dev-ds' of ssh://10.16.6.30:10022/dcutoolkit/deeplearing/vllm into v0.9.2-dev-ds
-
-
zhuwenwen authored
[fix]修复开启dp并且不开启ep报vmfault See merge request dcutoolkit/deeplearing/vllm!225
-
- 12 Oct, 2025 4 commits
- 11 Oct, 2025 2 commits