- 16 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 14 Dec, 2025 1 commit
-
-
laibao authored
- 新增rms_rotary_embedding_fuse自定义操作 - 添加内核配置文件E=160,N=320 - 通过VLLM_USE_FUSED_RMS_ROPE环境变量控制融合路径
-
- 04 Dec, 2025 1 commit
-
-
zhuwenwen authored
add VLLM_USE_LIGHTOP_RMS_ROPE_CONCAT when use USE_FUSED_RMS_QUANT and USE_FUSED_CUSTOM_ALL_REDUCE_RMS_QUANT
-
- 02 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 01 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 26 Nov, 2025 1 commit
-
-
wujl5 authored
-
- 20 Nov, 2025 2 commits
- 18 Nov, 2025 1 commit
-
-
wujl5 authored
-
- 13 Nov, 2025 2 commits
- 12 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 04 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 24 Oct, 2025 3 commits
- 23 Oct, 2025 1 commit
-
-
zhuwenwen authored
fix the error in test_moe caused by moe align not supporting 511 multi-modal switching to torch implementation on z100l&k100
-
- 15 Oct, 2025 1 commit
-
-
yangql authored
-
- 13 Oct, 2025 2 commits
- 25 Sep, 2025 1 commit
-
-
zhuwenwen authored
[kernels] update moe_align_block_size and moe_sum interface
-
- 24 Sep, 2025 1 commit
-
-
zhuwenwen authored
[FIX] 修复mtp和VLLM_USE_TRITON_CAT不能一起开的bug
-
- 22 Sep, 2025 1 commit
-
-
wujl5 authored
-
- 14 Sep, 2025 1 commit
-
-
wujl5 authored
-
- 10 Sep, 2025 1 commit
-
-
zhuwenwen authored
-
- 09 Sep, 2025 2 commits
- 01 Sep, 2025 2 commits
- 29 Aug, 2025 1 commit
-
-
yangql authored
-
- 15 Aug, 2025 1 commit
-
-
王敏 authored
-
- 07 Aug, 2025 1 commit
-
-
王敏 authored
-
- 06 Aug, 2025 1 commit
-
-
zhuwenwen authored
This reverts merge request !169
-
- 05 Aug, 2025 1 commit
-
-
王敏 authored
-
- 04 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 01 Aug, 2025 3 commits
- 31 Jul, 2025 1 commit
-
-
zhuwenwen authored
-
- 30 Jul, 2025 1 commit
-
-
yangql authored
-