- 12 Jun, 2025 23 commits
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Aaron Pham authored
Signed-off-by:Aaron Pham <contact@aarnphm.xyz>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Varun Sundar Rabindranath authored
-
Ekagra Ranjan authored
[Spec Decode][Benchmark] Generalize spec decode offline benchmark to more methods and datasets (#18847)
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Luka Govedič authored
Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com>
-
mobicham authored
Signed-off-by:mobicham <hicham@mobiuslabs.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
jmswen authored
Signed-off-by:Jon Swenson <jmswen@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Russell Bryant authored
Signed-off-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Aaron Pham <Aaronpham0103@gmail.com>
-
niu_he authored
Signed-off-by:2niuhe <carlton2tang@gmail.com>
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
wonjun Jang authored
Signed-off-by:strutive07 <strutive07@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
Brayden Zhong authored
Signed-off-by:Brayden Zhong <b8zhong@uwaterloo.ca>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 11 Jun, 2025 17 commits
-
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
Robert Shaw authored
Signed-off-by:rshaw@neuralmagic.com <robertgshaw2@gmail.com>
-
rasmith authored
[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
David Xia authored
Signed-off-by:David Xia <david@davidxia.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
runzhen authored
Signed-off-by:Runzhen Wang <wangrunzhen@gmail.com>
-
Louie Tsai authored
Signed-off-by:Tsai, Louie <louie.tsai@intel.com>
-
Ximingwang-09 authored
Signed-off-by:
ximing.wxm <ximing.wxm@antgroup.com> Co-authored-by:
ximing.wxm <ximing.wxm@antgroup.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-