- 12 Jun, 2025 16 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Varun Sundar Rabindranath authored
-
Ekagra Ranjan authored
[Spec Decode][Benchmark] Generalize spec decode offline benchmark to more methods and datasets (#18847)
-
Luka Govedič authored
Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com>
-
mobicham authored
Signed-off-by:mobicham <hicham@mobiuslabs.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
jmswen authored
Signed-off-by:Jon Swenson <jmswen@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
wonjun Jang authored
Signed-off-by:strutive07 <strutive07@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
Brayden Zhong authored
Signed-off-by:Brayden Zhong <b8zhong@uwaterloo.ca>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 11 Jun, 2025 13 commits
-
-
Robert Shaw authored
Signed-off-by:rshaw@neuralmagic.com <robertgshaw2@gmail.com>
-
rasmith authored
[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Ximingwang-09 authored
Signed-off-by:
ximing.wxm <ximing.wxm@antgroup.com> Co-authored-by:
ximing.wxm <ximing.wxm@antgroup.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
artetaout authored
Signed-off-by:artetaout <lulala341@gmail.com>
-
Junhao Li authored
Signed-off-by:Junhao Li <junhao@ubicloud.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
wang.yuqi authored
-
- 10 Jun, 2025 11 commits
-
-
Richard Zou authored
Signed-off-by:rzou <zou3519@gmail.com>
-
Xu Wenqing authored
Signed-off-by:许文卿 <xwq391974@alibaba-inc.com>
-
py-andy-c authored
Signed-off-by:py-andy-c <pychen1017@gmail.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Jee Jee Li authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Rachel Guo authored
[BugFix][FlashInfer] Fix attention backend interface mismatch with unexpected keyword `use_irope` (#19134) Signed-off-by:Yunqiu Guo <guorachel@meta.com>
-
Isotr0py authored
-
Louie Tsai authored
Signed-off-by:
Tsai, Louie <louie.tsai@intel.com> Co-authored-by:
Li, Jiang <bigpyj64@gmail.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Li Wang authored
Signed-off-by:
wangli <wangli858794774@gmail.com> Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-