- 01 Jul, 2025 20 commits
-
-
Yuxuan Zhang authored
Signed-off-by:
zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Kyle Sayers authored
-
Lionel Villard authored
Signed-off-by:Lionel Villard <villard@us.ibm.com>
-
Reid authored
Signed-off-by:reidliu41 <reid201711@gmail.com>
-
TY-AMD authored
Signed-off-by:Tianyuan Wu <Tianyuan.Wu@amd.com>
-
Kebe authored
Signed-off-by:Kebe <mail@kebe7jun.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
Prashant Gupta authored
Signed-off-by:Prashant Gupta <prashantgupta@us.ibm.com>
-
Richard Barnes authored
Co-authored-by:mgoin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Alex Kogan authored
[Feature] A calibration-free RTN-based quantization for accurate and accelerated INT4/INT8 inference (#18768) Signed-off-by:
Alex Kogan <alex.kogan@oracle.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Ernest Wong authored
Signed-off-by:Ernest Wong <chwong719@gmail.com>
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <chendi.xue@intel.com>
-
Kuntai Du authored
Signed-off-by:KuntaiDu <kuntai@uchicago.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
fyuan1316 authored
Signed-off-by:Yuan Fang <yuanfang@alauda.io>
-
Luka Govedič authored
Signed-off-by:luka <luka@neuralmagic.com>
-
- 30 Jun, 2025 16 commits
-
-
Zhonghua Deng authored
Signed-off-by:Abatom <abzhonghua@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
li haoyang authored
Signed-off-by:
Haoyang Li <Haoyang.Li@amd.com> Co-authored-by:
Haoyang Li <307790822@qq.com>
-
Michael Yao authored
Signed-off-by:windsonsea <haifeng.yao@daocloud.io>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
noiji authored
Signed-off-by: noiji <>
-
Reid authored
Signed-off-by:reidliu41 <reid201711@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <chendi.xue@intel.com>
-
redmoe-moutain authored
Signed-off-by:redmoe-moutain <agiredmoe@gmail.com>
-
- 29 Jun, 2025 3 commits
-
-
Huy Do authored
Signed-off-by:Huy Do <huydhn@gmail.com>
-
Dipika Sikka authored
Signed-off-by:
Dipika Sikka <dipikasikka1@gmail.com> Signed-off-by:
Dipika <dipikasikka1@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 28 Jun, 2025 1 commit
-
-
Wentao Ye authored
[Refactor] Create a function util and cache the results for `has_deepgemm`, `has_deepep`, `has_pplx` (#20187) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-