- 18 Jun, 2025 2 commits
- 17 Jun, 2025 6 commits
-
-
zhuwenwen authored
V0.8.5 zero overhead See merge request dcutoolkit/deeplearing/vllm!140
-
zhuwenwen authored
[fix]修复并行解码和moe单测问题 See merge request dcutoolkit/deeplearing/vllm!141
-
王敏 authored
-
王敏 authored
[fix]1.修复medusa、scorer等并行解码单测;2.修复moe kernel单测问题,优化代码;3.修复rejection_sampler中test_compare_nonflashinfer_backend单测问题
-
lizhigong authored
-
yangql authored
-
- 16 Jun, 2025 1 commit
-
-
yangql authored
-
- 13 Jun, 2025 10 commits
-
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
[fix]修复并行解码integration、mtp相关单测问题 See merge request dcutoolkit/deeplearing/vllm!139
-
王敏 authored
-
王敏 authored
-
xuxz authored
-
zhuwenwen authored
[fix]修复并行解码eagle和mlp相关单测问题 See merge request dcutoolkit/deeplearing/vllm!138
-
王敏 authored
-
gaoqiong authored
-
zhuwenwen authored
-
- 12 Jun, 2025 10 commits
- 10 Jun, 2025 1 commit
-
-
zhuwenwen authored
-
- 09 Jun, 2025 3 commits
- 07 Jun, 2025 1 commit
-
-
yangql authored
-
- 06 Jun, 2025 4 commits
- 05 Jun, 2025 2 commits