- 11 Feb, 2026 3 commits
- 10 Feb, 2026 6 commits
- 09 Feb, 2026 4 commits
- 06 Feb, 2026 2 commits
- 03 Feb, 2026 2 commits
- 29 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 28 Jan, 2026 3 commits
- 27 Jan, 2026 5 commits
- 26 Jan, 2026 4 commits
- 23 Jan, 2026 4 commits
- 22 Jan, 2026 1 commit
-
-
王敏 authored
-
- 21 Jan, 2026 5 commits
-
-
zhuwenwen authored
新增 VLLM_USE_FUSED_RMS_ROPE 分支,走 fused 路径 注册 torch.ops.vllm.rms_rotary_embedding_fuse(direct_register_custom_op) cos_sin_cache 自动转 device/dtype 并缓存,避免每次重复拷贝
-
zhuwenwen authored
for qwen3, VLLM_USE_FUSED_RMS_ROPE=1 (default)
-
zhuwenwen authored
feat: Supprot fp8 channle-wise matmul. See merge request dcutoolkit/deeplearing/vllm!380
-
wanglong3 authored
-
zhuwenwen authored
V0.11.0 dev kvscale See merge request dcutoolkit/deeplearing/vllm!378
-