- 07 Nov, 2025 4 commits
- 06 Nov, 2025 5 commits
- 04 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 03 Nov, 2025 7 commits
- 01 Nov, 2025 2 commits
- 31 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 29 Oct, 2025 4 commits
-
-
zhuwenwen authored
# 第一个ip为D的第一个节点,第二个ip为D的第二个节点,配置:export IP_CONFIG_FILE=/data/xiabo/w4a8_1/ip_config.txt 192.168.1.1 192.168.1.100 192.168.1.2 192.168.1.101 192.168.1.3 192.168.1.102 10.16.1.75 10.16.1.76
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
修复pd分离开启异步发送时数据集推理时decode侧卡住问题 See merge request dcutoolkit/deeplearing/vllm!235
-
- 28 Oct, 2025 1 commit
-
-
maxiao1 authored
-
- 27 Oct, 2025 2 commits
- 24 Oct, 2025 1 commit
-
-
zhuwenwen authored
support prefix cache on kme fix the error in test_moe caused by moe align not supporting 511 and 211 multi-modal switching to torch implementation on z100l&k100
-
- 21 Oct, 2025 1 commit
-
-
王敏 authored
# Conflicts: # vllm/model_executor/layers/fused_moe/ep_moe/layer.py
-
- 20 Oct, 2025 2 commits
- 17 Oct, 2025 3 commits
- 16 Oct, 2025 2 commits
- 15 Oct, 2025 4 commits