- 26 Mar, 2026 1 commit
-
-
Jacob Platin authored
Signed-off-by:Jacob Platin <jacobplatin@google.com>
-
- 25 Mar, 2026 5 commits
-
-
Andreas Karatzas authored
Signed-off-by:
Andreas Karatzas <akaratza@amd.com> Signed-off-by:
Matthew Wong <Matthew.Wong2@amd.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Matthew Wong <Matthew.Wong2@amd.com>
-
Yongye Zhu authored
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Chauncey authored
[Revert] Remove CUDA torch fallbacks for fp8_mqa_logits/fp8_paged_mqa_logits_torch function (#37968) Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 24 Mar, 2026 3 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 23 Mar, 2026 9 commits
-
-
Ranran authored
Signed-off-by:
Ranran <1012869439@qq.com> Signed-off-by:
Ranran <hzz5361@psu.edu> Signed-off-by:
ran <hzz5361@psu.edu> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
yzong-rh authored
[Bug][MoE] Strengthen _supports_current_device() checks in the TRTLLM FP8, NVFP4, and FlashInfer CuteDSL MoE experts (#36728) Signed-off-by:Yifan Zong <yzong@redhat.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Kunshang Ji authored
Signed-off-by:Zhu, Zufang <zufang.zhu@intel.com>
-
Chuan (Richard) Li authored
Signed-off-by:Li <chuali@amd.com>
-
Artem Perevedentsev authored
Signed-off-by:Artem Perevedentsev <aperevedents@nvidia.com>
-
Matthias Gehre authored
Signed-off-by:
Matthias Gehre <matthias.gehre@amd.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 22 Mar, 2026 3 commits
-
-
Yongye Zhu authored
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Robert Shaw authored
-
- 21 Mar, 2026 6 commits
-
-
Robert Shaw authored
Signed-off-by:Robert Shaw <robertgshaw2@gmail.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Robert Shaw authored
-
Robert Shaw authored
Merge awq.py and awq_marlin.py into a single file, eliminating the circular import between them. awq.py becomes a backward-compat shim. Follows the same structure as gptq_marlin.py. Co-authored-by: Claude Signed-off-by:Robert Shaw <robertgshaw2@gmail.com>
-
Chaitanya Sri Krishna Lolla authored
Signed-off-by:
Tej Kiran <vpolamre@amd.com> Co-authored-by:
Tej Kiran <vpolamre@amd.com>
-
Yongye Zhu authored
Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
- 20 Mar, 2026 6 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
L.B.R. authored
Signed-off-by:
L.B.R. <lbr@mmonad.com> Co-authored-by:
L.B.R. <lbr@mmonad.com>
-
xuebwang-amd authored
Signed-off-by:xuebwang-amd <xuebwang@amd.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
- 19 Mar, 2026 4 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Wei Zhao authored
Signed-off-by:wzhao18 <wzhao18.sz@gmail.com>
-
Duyi-Wang authored
Signed-off-by:Duyi-Wang <duyi.wang@amd.com>
-
- 18 Mar, 2026 3 commits
-
-
Michael Goin authored
Signed-off-by:Michael Goin <mgoin64@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-