- 06 Jan, 2026 10 commits
- 05 Jan, 2026 4 commits
- 04 Jan, 2026 2 commits
- 25 Dec, 2025 2 commits
- 23 Dec, 2025 2 commits
- 22 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 20 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 19 Dec, 2025 2 commits
- 18 Dec, 2025 11 commits
-
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> (cherry picked from commit 19c58339)
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> (cherry picked from commit d2dc5dfc)
-
sarathc-cerebras authored
Signed-off-by:
sarathc-cerebras <sarath.chandran@cerebras.net> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> (cherry picked from commit 28d15ab5)
-
Yifan Qiao authored
Signed-off-by:
Yifan Qiao <yifanqiao@berkeley.edu> (cherry picked from commit 11a89cf9)
-
zhuwenwen authored
-
zhuwenwen authored
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> (cherry picked from commit 30bb19a7)
-
zhuwenwen authored
-
Varun Sundar Rabindranath authored
(cherry picked from commit e3fc374a)
-
Nicolò Lucchesi authored
Signed-off-by:
NickLucche <nlucches@redhat.com> (cherry picked from commit 9ca8cb38)
-
- 17 Dec, 2025 5 commits
-
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> (cherry picked from commit 4f735bab)
-
Li, Jiang authored
Signed-off-by:
jiang1.li <jiang1.li@intel.com> Signed-off-by:
Li, Jiang <bigpyj64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> (cherry picked from commit 0cd53536)
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> (cherry picked from commit 00a8d762)
-
zhuwenwen authored
修复CompressedTensorsLinearMethod中的w4a16的冲突问题 feat(moe): add Marlin W16A16 fused MoE behind VLLM_USE_MARLIN_W16A16_MOE replace the fp8_mqa_logits and fp8_paged_mqa_logits interfaces in deepgemm with mqa_logits and paged_mqa_logits from lightop
-
TJian authored
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> (cherry picked from commit 2410132b)
-