- 10 Feb, 2026 1 commit
-
-
zhuwenwen authored
[opt] 优化epsp代码, 零消耗添加epsp update VLLM_USE_FUSED_RMS_ROPE=0 (default). for qwen3, VLLM_USE_FUSED_RMS_ROPE=1 (default) feat(moe/marlin): Marlin W16A16 MoE 自动探测并预打包(去掉手动开关) perf(qwen3): 融合 q/k RMSNorm + RoPE fused_moe_fp8接入lmslim
-
- 04 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 02 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 12 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 09 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 07 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 04 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 03 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 31 Oct, 2025 2 commits
- 24 Oct, 2025 1 commit
-
-
zhuwenwen authored
support prefix cache on kme fix the error in test_moe caused by moe align not supporting 511 and 211 multi-modal switching to torch implementation on z100l&k100
-
- 23 Oct, 2025 1 commit
-
-
zhuwenwen authored
fix the error in test_moe caused by moe align not supporting 511 multi-modal switching to torch implementation on z100l&k100
-
- 13 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 28 Sep, 2025 1 commit
-
-
yangql authored
-
- 21 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 11 Aug, 2025 1 commit
-
-
xiabo authored
-
- 09 Aug, 2025 1 commit
-
- 07 Aug, 2025 1 commit
-
-
xiabo authored
-
- 29 Jul, 2025 1 commit
-
-
jujl1 authored
-
- 28 Jul, 2025 1 commit
-
-
gaoqiong authored
-
- 04 Jul, 2025 1 commit
-
-
Michael Goin authored
-
- 03 Jul, 2025 1 commit
-
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 02 Jul, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-