- 19 Nov, 2025 1 commit
-
-
zhuwenwen authored
deepseekv2-w4a8支持custom-rms-quant融合 See merge request dcutoolkit/deeplearing/vllm!259
-
- 18 Nov, 2025 1 commit
-
-
wujl5 authored
-
- 17 Nov, 2025 5 commits
- 14 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 13 Nov, 2025 5 commits
- 12 Nov, 2025 4 commits
- 11 Nov, 2025 2 commits
- 10 Nov, 2025 4 commits
- 09 Nov, 2025 2 commits
- 08 Nov, 2025 1 commit
-
-
王敏 authored
-
- 07 Nov, 2025 7 commits
-
-
zhuwenwen authored
-
zhuwenwen authored
-
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
fix:修复pp卡住问题 See merge request dcutoolkit/deeplearing/vllm!245
-
laibao authored
- 更新环境变量以控制流水线并行调度的最小注入。 - 从 Request 类中移除 num_output_placeholders,并调整 Scheduler 逻辑以使用新的最小注入功能。 - 增强 Scheduler,根据批次队列状态管理最小进度注入。
-
- 06 Nov, 2025 4 commits
- 05 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 04 Nov, 2025 2 commits