- 10 Feb, 2026 1 commit
-
-
zhuwenwen authored
[opt] 优化epsp代码, 零消耗添加epsp update VLLM_USE_FUSED_RMS_ROPE=0 (default). for qwen3, VLLM_USE_FUSED_RMS_ROPE=1 (default) feat(moe/marlin): Marlin W16A16 MoE 自动探测并预打包(去掉手动开关) perf(qwen3): 融合 q/k RMSNorm + RoPE fused_moe_fp8接入lmslim
-
- 13 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 28 Sep, 2025 2 commits
- 16 Jun, 2025 1 commit
-
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
- 12 Jun, 2025 1 commit
-
-
zhuwenwen authored
-
- 14 May, 2025 1 commit
-
-
xiabo authored
-
- 31 Mar, 2025 1 commit
-
-
zhuwenwen authored
-
- 15 Mar, 2025 1 commit
-
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
- 14 Mar, 2025 1 commit
-
-
Jeff Daily authored
Signed-off-by:Jeff Daily <jeff.daily@amd.com>
-
- 11 Mar, 2025 1 commit
-
-
Jeff Daily authored
Signed-off-by:Jeff Daily <jeff.daily@amd.com>
-
- 27 Feb, 2025 1 commit
-
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
- 25 Feb, 2025 1 commit
-
-
Gregory Shtrasberg authored
-
- 30 Jul, 2024 1 commit
-
-
Tyler Michael Smith authored
-
- 22 May, 2024 1 commit
-
-
Michael Goin authored
-
- 10 May, 2024 1 commit
-
-
Cody Yu authored
-
- 03 Apr, 2024 1 commit
-
-
Adrian Abeyta authored
Co-authored-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
HaiShaw <hixiao@gmail.com> Co-authored-by:
AdrianAbeyta <Adrian.Abeyta@amd.com> Co-authored-by:
Matthew Wong <Matthew.Wong2@amd.com> Co-authored-by:
root <root@gt-pla-u18-08.pla.dcgpu> Co-authored-by:
mawong-amd <156021403+mawong-amd@users.noreply.github.com> Co-authored-by:
ttbachyinsda <ttbachyinsda@outlook.com> Co-authored-by:
guofangze <guofangze@kuaishou.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
jacobthebanana <50071502+jacobthebanana@users.noreply.github.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-