- 22 Jan, 2026 1 commit
-
-
王敏 authored
-
- 16 Jan, 2026 1 commit
-
-
zhuwenwen authored
add VLLM_USE_FUSED_CACHE_QUANT_BMM_MLA to use fused rmsnorm + contiguous + rope(for dpsk-v3) + concat_and_cache_mla + q quant, control bmm(todo) + cat +mla (fp8)
-
- 15 Jan, 2026 1 commit
-
-
yangql authored
-
- 12 Jan, 2026 2 commits
- 08 Jan, 2026 1 commit
-
-
wanglong3 authored
-
- 07 Jan, 2026 1 commit
-
-
wujl5 authored
-
- 05 Jan, 2026 1 commit
-
-
wanglong3 authored
-
- 04 Jan, 2026 1 commit
-
-
wujl5 authored
-
- 22 Dec, 2025 3 commits
- 18 Dec, 2025 2 commits
- 17 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 16 Dec, 2025 2 commits
- 15 Dec, 2025 1 commit
-
-
王敏 authored
-
- 10 Dec, 2025 1 commit
-
-
王敏 authored
-
- 08 Dec, 2025 1 commit
-
-
王敏 authored
-
- 04 Dec, 2025 1 commit
-
-
zhuwenwen authored
add VLLM_USE_LIGHTOP_RMS_ROPE_CONCAT when use USE_FUSED_RMS_QUANT and USE_FUSED_CUSTOM_ALL_REDUCE_RMS_QUANT
-
- 02 Dec, 2025 2 commits
- 01 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 26 Nov, 2025 1 commit
-
-
wujl5 authored
-
- 20 Nov, 2025 2 commits
- 18 Nov, 2025 1 commit
-
-
wujl5 authored
-
- 07 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 04 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 01 Nov, 2025 1 commit
-
-
王敏 authored
-
- 29 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 27 Oct, 2025 1 commit
-
-
王敏 authored
-
- 24 Oct, 2025 2 commits
- 15 Oct, 2025 4 commits
- 13 Oct, 2025 1 commit
-
-
王敏 authored
-