- 23 Oct, 2025 3 commits
- 21 Oct, 2025 2 commits
- 20 Oct, 2025 4 commits
- 17 Oct, 2025 2 commits
- 16 Oct, 2025 3 commits
- 15 Oct, 2025 6 commits
- 13 Oct, 2025 11 commits
- 11 Oct, 2025 6 commits
- 10 Oct, 2025 3 commits
set USE_FUSED_RMS_QUANT=1 and USE_FUSED_SILU_MUL_QUANT=1
fix the error in test_moe caused by moe align not supporting 511 multi-modal switching to torch implementation on z100l&k100
修复awq 的mtp中的blockint8的问题 See merge request dcutoolkit/deeplearing/vllm!229
删除DPSK_FP16_QUICK,以及增加awq和blockwiseint8的shared_output接口 See merge request dcutoolkit/deeplearing/vllm!228
去掉all2all ep相关代码 See merge request dcutoolkit/deeplearing/vllm!226
fix pd send async perfomance See merge request dcutoolkit/deeplearing/vllm!224
support tbo and pd async send cache See merge request dcutoolkit/deeplearing/vllm!223