- 23 Mar, 2026 21 commits
-
-
yzong-rh authored
[Bug][MoE] Strengthen _supports_current_device() checks in the TRTLLM FP8, NVFP4, and FlashInfer CuteDSL MoE experts (#36728) Signed-off-by:Yifan Zong <yzong@redhat.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Co-authored-by:
zhrrr <43847754+izhuhaoran@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Angela Yi authored
Signed-off-by:angelayi <yiangela7@gmail.com>
-
Yufeng He authored
-
yanghui1-arch authored
Signed-off-by:dass90 <3053034939@qq.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
DorBernsohn authored
Signed-off-by:DorBernsohn <dor.bernsohn@gmail.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Andrew Xia authored
Signed-off-by:Andrew Xia <axia@meta.com>
-
Kunshang Ji authored
Signed-off-by:Zhu, Zufang <zufang.zhu@intel.com>
-
Chuan (Richard) Li authored
Signed-off-by:Li <chuali@amd.com>
-
Artem Perevedentsev authored
Signed-off-by:Artem Perevedentsev <aperevedents@nvidia.com>
-
Hojin Yang authored
Signed-off-by:
effortprogrammer <yhjhoward7@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
r266-tech authored
Co-authored-by:r266-tech <r266-tech@users.noreply.github.com>
-
Matthias Gehre authored
Signed-off-by:
Matthias Gehre <matthias.gehre@amd.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Baorun (Lauren) Mu authored
Signed-off-by:Baorun Mu <bmu@nvidia.com>
-
Lasha Koroshinadze authored
Signed-off-by:Lasha <26011196+lashahub@users.noreply.github.com>
-
- 22 Mar, 2026 11 commits
-
-
zhanqiuhu authored
Signed-off-by:
Zhanqiu Hu <zh338@cornell.edu> Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Co-authored-by:
Woosuk Kwon <woosuk@inferact.ai>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@inferact.ai>
-
Yongye Zhu authored
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@inferact.ai>
-
Netanel Haber authored
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Giancarlo Delfin authored
Signed-off-by:
Giancarlo Delfin <gdelfin@inferact.ai> Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Co-authored-by:
Woosuk Kwon <woosuk@inferact.ai>
-
Robert Shaw authored
-
Yang Liu authored
Signed-off-by:Yang <lymailforjob@gmail.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 21 Mar, 2026 8 commits
-
-
Robert Shaw authored
Signed-off-by:Robert Shaw <robertgshaw2@gmail.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Robert Shaw authored
-
Robert Shaw authored
Merge awq.py and awq_marlin.py into a single file, eliminating the circular import between them. awq.py becomes a backward-compat shim. Follows the same structure as gptq_marlin.py. Co-authored-by: Claude Signed-off-by:Robert Shaw <robertgshaw2@gmail.com>
-
Brandon Pelfrey authored
Signed-off-by:
Brandon Pelfrey <bpelfrey@nvidia.com> Signed-off-by:
Brandon Pelfrey <brandonpelfrey@gmail.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Mohammad Miadh Angkad authored
Signed-off-by:Mohammad Miadh Angkad <176301910+mmangkad@users.noreply.github.com>
-
Mohammad Miadh Angkad authored
Signed-off-by:Mohammad Miadh Angkad <176301910+mmangkad@users.noreply.github.com>
-
Francesco Fusco authored
Signed-off-by:Francesco Fusco <ffu@zurich.ibm.com>
-