- 08 May, 2026 3 commits
- 06 May, 2026 1 commit
-
-
王敏 authored
# Conflicts: # vllm/v1/worker/gpu_model_runner.py
-
- 24 Apr, 2026 3 commits
-
-
flyingdown authored
-
chenhw5 authored
-
yangql authored
-
- 23 Apr, 2026 3 commits
- 22 Apr, 2026 4 commits
- 20 Apr, 2026 1 commit
-
-
zhangyh15 authored
-
- 18 Apr, 2026 5 commits
-
-
王敏 authored
-
zhangzbb authored
[DOCs] add vllm v0.15.1 dcu use readme.md file including introduction, supported models, installation, PD, and EP usage
-
zhangzbb authored
[DOC] add vllm v0.15.1 dcu use readme.md file including introduction, supported models, installation, PD, and EP usage
-
wanglong3 authored
-
wangmin6 authored
-
- 17 Apr, 2026 1 commit
-
-
王敏 authored
-
- 16 Apr, 2026 1 commit
-
-
chenhw5 authored
-
- 15 Apr, 2026 1 commit
-
-
wanghl6 authored
-
- 11 Apr, 2026 1 commit
-
-
laibao authored
-
- 10 Apr, 2026 4 commits
- 08 Apr, 2026 2 commits
- 03 Apr, 2026 2 commits
- 02 Apr, 2026 2 commits
- 01 Apr, 2026 5 commits
- 30 Mar, 2026 1 commit
-
-
zhangzbb authored
[BUGFIX] 修复 Qwen3-MoE Attention 中 fused RMS RoPE 的 epsilon 参数顺序错误 See merge request dcutoolkit/deeplearing/vllm!539
-