- 12 Mar, 2026 1 commit
-
-
wanghl6 authored
-
- 09 Mar, 2026 1 commit
-
-
wanghl6 authored
-
- 11 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 06 Feb, 2026 2 commits
- 28 Jan, 2026 2 commits
- 27 Jan, 2026 2 commits
- 26 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 17 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 16 Jan, 2026 1 commit
-
-
zhuwenwen authored
add VLLM_USE_FUSED_CACHE_QUANT_BMM_MLA to use fused rmsnorm + contiguous + rope(for dpsk-v3) + concat_and_cache_mla + q quant, control bmm(todo) + cat +mla (fp8)
-
- 26 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 23 Dec, 2025 1 commit
-
-
zhuwenwen authored
update VLLM_USE_LIGHTOP_RMS_ROPE_CONCAT impl
-
- 17 Dec, 2025 2 commits
- 16 Dec, 2025 2 commits
- 05 Dec, 2025 1 commit
-
-
zhuwenwen authored
set VLLM_REJECT_SAMPLE_OPT=1 for dpsk-v3
-
- 01 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 10 Nov, 2025 2 commits
- 08 Nov, 2025 2 commits
- 06 Nov, 2025 2 commits
- 26 Sep, 2025 1 commit
-
-
zhuwenwen authored
-
- 24 Sep, 2025 1 commit
-
-
zhuwenwen authored
[FIX] 修复mtp和VLLM_USE_TRITON_CAT不能一起开的bug
-
- 23 Sep, 2025 1 commit
-
-
yangql authored
-
- 16 Sep, 2025 1 commit
-
-
王敏 authored
-
- 13 Sep, 2025 1 commit
-
-
zhuwenwen authored
update the default values of VLLM_USE_TRITON_CAT and VLLM_USE_LIGHT_OP to True
-
- 10 Sep, 2025 3 commits
- 09 Sep, 2025 2 commits
- 15 Aug, 2025 2 commits
- 13 Aug, 2025 1 commit
-
-
王敏 authored
-
- 10 Aug, 2025 1 commit
-
-
王敏 authored
-