1. 22 Apr, 2026 1 commit
  2. 08 Apr, 2026 1 commit
  3. 02 Apr, 2026 1 commit
  4. 19 Mar, 2026 1 commit
  5. 18 Mar, 2026 1 commit
    • laibao's avatar
      feat(moe): 增加 LightOP moe_sum+mul+add 融合并打通参数透传 · 0639678c
      laibao authored
        新增环境变量 VLLM_USE_LIGHTOP_MOE_SUM_MUL_ADD 用于控制
        fused sum+mul+add 开关。
        在 DeepseekV2MoE 中增加 fused 路径,预计算 shared_output,并下传 iqis 与 routed_scaling_factor。
        扩展 FusedMoE/SharedFusedMoE 及相关 custom op 接口,统一透传 i_q/i_s/shared_output/routed_scaling_factor。
        同步适配 Triton、Marlin W16A16、SlimQuant W4A8、CompressedTensors W8A8 等实现,支持在内核侧完成 sum+mul+add。
      0639678c
  6. 12 Mar, 2026 2 commits
  7. 07 Mar, 2026 1 commit
  8. 06 Mar, 2026 1 commit
  9. 05 Mar, 2026 1 commit
  10. 03 Mar, 2026 1 commit
  11. 02 Mar, 2026 1 commit
  12. 06 Feb, 2026 2 commits
  13. 03 Feb, 2026 1 commit
  14. 26 Jan, 2026 1 commit
  15. 21 Jan, 2026 1 commit
  16. 20 Jan, 2026 1 commit
  17. 16 Jan, 2026 1 commit
  18. 09 Jan, 2026 1 commit
  19. 07 Jan, 2026 4 commits
  20. 06 Jan, 2026 2 commits
  21. 24 Dec, 2025 2 commits
  22. 19 Dec, 2025 2 commits
  23. 18 Dec, 2025 1 commit
  24. 17 Dec, 2025 2 commits
  25. 12 Dec, 2025 1 commit
  26. 11 Dec, 2025 1 commit
  27. 08 Dec, 2025 1 commit
  28. 02 Dec, 2025 3 commits
  29. 30 Nov, 2025 1 commit