- 12 Mar, 2026 1 commit
-
-
wanghl6 authored
-
- 09 Mar, 2026 1 commit
-
-
wanghl6 authored
-
- 11 Feb, 2026 2 commits
- 06 Feb, 2026 2 commits
- 28 Jan, 2026 2 commits
- 27 Jan, 2026 2 commits
- 26 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 23 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 17 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 16 Jan, 2026 1 commit
-
-
zhuwenwen authored
add VLLM_USE_FUSED_CACHE_QUANT_BMM_MLA to use fused rmsnorm + contiguous + rope(for dpsk-v3) + concat_and_cache_mla + q quant, control bmm(todo) + cat +mla (fp8)
-
- 26 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 23 Dec, 2025 1 commit
-
-
zhuwenwen authored
update VLLM_USE_LIGHTOP_RMS_ROPE_CONCAT impl
-
- 19 Dec, 2025 2 commits
- 18 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 17 Dec, 2025 4 commits
- 16 Dec, 2025 2 commits
- 05 Dec, 2025 1 commit
-
-
zhuwenwen authored
set VLLM_REJECT_SAMPLE_OPT=1 for dpsk-v3
-
- 01 Dec, 2025 2 commits
- 27 Nov, 2025 4 commits
- 23 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 10 Nov, 2025 2 commits
- 08 Nov, 2025 2 commits
- 06 Nov, 2025 2 commits
- 13 Oct, 2025 1 commit
-
-
zhuwenwen authored
-