- 01 Jul, 2025 5 commits
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
Alex Kogan authored
[Feature] A calibration-free RTN-based quantization for accurate and accelerated INT4/INT8 inference (#18768) Signed-off-by:
Alex Kogan <alex.kogan@oracle.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
fyuan1316 authored
Signed-off-by:Yuan Fang <yuanfang@alauda.io>
-
Luka Govedič authored
Signed-off-by:luka <luka@neuralmagic.com>
-
- 30 Jun, 2025 4 commits
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
redmoe-moutain authored
Signed-off-by:redmoe-moutain <agiredmoe@gmail.com>
-
- 29 Jun, 2025 2 commits
-
-
Dipika Sikka authored
Signed-off-by:
Dipika Sikka <dipikasikka1@gmail.com> Signed-off-by:
Dipika <dipikasikka1@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 28 Jun, 2025 5 commits
-
-
Wentao Ye authored
[Refactor] Create a function util and cache the results for `has_deepgemm`, `has_deepep`, `has_pplx` (#20187) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Stan Wozniak authored
Signed-off-by:Stanislaw Wozniak <stw@zurich.ibm.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Chales Xu authored
Signed-off-by:n2ptr <xuzhanchaomail@163.com>
-
Michael Goin authored
[CI Fix] Pin tests/models/registry.py MiniMaxText01ForCausalLM to revision due to model changes (#20199) Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 27 Jun, 2025 8 commits
-
-
Luka Govedič authored
Signed-off-by:luka <luka@neuralmagic.com>
-
Robert Shaw authored
Signed-off-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
Roger Wang <hey@rogerw.me>
-
Yazan Sharaya authored
Signed-off-by:Yazan-Sharaya <yazan.sharaya.yes@gmail.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <noooop@126.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Yang Wang authored
Signed-off-by:
Yang Wang <elainewy@meta.com> Signed-off-by:
Yida Wu <yidawu@alumni.cmu.edu> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Concurrensee <yida.wu@amd.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
li haoyang authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Signed-off-by:
Haoyang Li <Haoyang.Li@amd.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 26 Jun, 2025 8 commits
-
-
Bowen Wang authored
Signed-off-by:Bowen Wang <abmfy@icloud.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 25 Jun, 2025 7 commits
-
-
Chenyaaang authored
Signed-off-by:Chenyaaang <chenyangli@google.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
lkchen authored
Signed-off-by:Linkun <github@lkchen.net>
-
Nicolò Lucchesi authored
Signed-off-by:
Roger Wang <ywang@roblox.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
bnellnm authored
[Kernels][Bugfix] Use torch op for all kernels in FusedMoE forward. Add additional testing for cudagraphs. (#19717) Signed-off-by:Bill Nell <bnell@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 24 Jun, 2025 1 commit
-
-
Boyuan Feng authored
Signed-off-by:Boyuan Feng <boyuan@meta.com>
-