- 16 Jan, 2026 1 commit
-
-
zhuwenwen authored
add VLLM_USE_FUSED_CACHE_QUANT_BMM_MLA to use fused rmsnorm + contiguous + rope(for dpsk-v3) + concat_and_cache_mla + q quant, control bmm(todo) + cat +mla (fp8)
-
- 23 Dec, 2025 1 commit
-
-
zhuwenwen authored
update VLLM_USE_LIGHTOP_RMS_ROPE_CONCAT impl
-
- 19 Dec, 2025 2 commits
- 18 Dec, 2025 2 commits
- 17 Dec, 2025 4 commits
- 16 Dec, 2025 2 commits
- 07 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 01 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 23 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 19 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 13 Nov, 2025 2 commits
- 12 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 03 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 31 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 29 Oct, 2025 2 commits
- 17 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 16 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 14 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 13 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 10 Oct, 2025 3 commits
- 09 Oct, 2025 1 commit
-
-
zhuwenwen authored
This reverts merge request !219
-
- 28 Sep, 2025 1 commit
-
-
yangql authored
-
- 24 Sep, 2025 2 commits
- 23 Sep, 2025 1 commit
-
-
yangql authored
-
- 17 Sep, 2025 1 commit
-
-
lizhigong authored
-
- 16 Sep, 2025 1 commit
-
-
yangshj1 authored
-
- 13 Sep, 2025 1 commit
-
-
zhuwenwen authored
-
- 11 Sep, 2025 1 commit
-
-
zhuwenwen authored
-
- 10 Sep, 2025 1 commit
-
-
zhuwenwen authored
-