- 01 Jul, 2025 2 commits
-
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
Alex Kogan authored
[Feature] A calibration-free RTN-based quantization for accurate and accelerated INT4/INT8 inference (#18768) Signed-off-by:
Alex Kogan <alex.kogan@oracle.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 30 Jun, 2025 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
li haoyang authored
Signed-off-by:
Haoyang Li <Haoyang.Li@amd.com> Co-authored-by:
Haoyang Li <307790822@qq.com>
-
- 29 Jun, 2025 1 commit
-
-
Dipika Sikka authored
Signed-off-by:
Dipika Sikka <dipikasikka1@gmail.com> Signed-off-by:
Dipika <dipikasikka1@gmail.com>
-
- 28 Jun, 2025 1 commit
-
-
Wentao Ye authored
[Refactor] Create a function util and cache the results for `has_deepgemm`, `has_deepep`, `has_pplx` (#20187) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 27 Jun, 2025 2 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Robert Shaw authored
Signed-off-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
Roger Wang <hey@rogerw.me>
-
- 26 Jun, 2025 3 commits
-
-
Bowen Wang authored
Signed-off-by:Bowen Wang <abmfy@icloud.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 25 Jun, 2025 4 commits
-
-
Dipika Sikka authored
Signed-off-by:
Dipika Sikka <dipikasikka1@gmail.com> Signed-off-by:
Dipika <dipikasikka1@gmail.com>
-
bnellnm authored
[Kernels][Bugfix] Use torch op for all kernels in FusedMoE forward. Add additional testing for cudagraphs. (#19717) Signed-off-by:Bill Nell <bnell@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 24 Jun, 2025 3 commits
-
-
Boyuan Feng authored
Signed-off-by:Boyuan Feng <boyuan@meta.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@centml.ai>
-
- 23 Jun, 2025 3 commits
-
-
Jun-Howie authored
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Tyler Michael Smith authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Signed-off-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
- 22 Jun, 2025 1 commit
-
-
Ye (Charlotte) Qi authored
Signed-off-by:Ye (Charlotte) Qi <yeq@meta.com>
-
- 20 Jun, 2025 1 commit
-
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
- 19 Jun, 2025 2 commits
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Maximilien de Bayser authored
Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Signed-off-by:
Max de Bayser <maxdebayser@gmail.com> Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com>
-
- 18 Jun, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 17 Jun, 2025 2 commits
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Di Liu authored
Signed-off-by:Di Liu <liu-di@sjtu.edu.cn>
-
- 16 Jun, 2025 4 commits
-
-
Dipika Sikka authored
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Szymon Ożóg authored
Signed-off-by:SzymonOzog <szymon.ozog@gmail.com>
-
wang.yuqi authored
-
- 13 Jun, 2025 2 commits
-
-
Boyuan Feng authored
Signed-off-by:Boyuan Feng <boyuan@meta.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
- 12 Jun, 2025 6 commits
-
-
Varun Sundar Rabindranath authored
-
mobicham authored
Signed-off-by:mobicham <hicham@mobiuslabs.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Brayden Zhong authored
Signed-off-by:Brayden Zhong <b8zhong@uwaterloo.ca>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-