- 16 May, 2025 1 commit
-
-
kliuae authored
Signed-off-by:kf <kuanfu.liu@embeddedllm.com>
-
- 15 May, 2025 18 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
TJian authored
[Bugfix] [ROCm]: Remove assertion logic when using AITER fused moe in unquantizedMethod to reenable LLama4 BF16 (#18205) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Zhonghua Deng authored
Signed-off-by:Abatom <abzhonghua@gmail.com>
-
Sebastian Schoennenbeck authored
Signed-off-by:Sebastian Schönnenbeck <sebastian.schoennenbeck@comma-soft.com>
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
hustxiayang authored
Signed-off-by:yangxia <yangxiast@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
omahs authored
Signed-off-by:omahs <73983677+omahs@users.noreply.github.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
inkcherry authored
Signed-off-by:inkcherry <mingzhi.liu@intel.com>
-
Chenheli Hua authored
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Luka Govedič authored
Signed-off-by:Luka Govedič <lgovedic@redhat.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
David Xia authored
-
- 14 May, 2025 21 commits
-
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Jerry Zhang authored
Signed-off-by:Jerry Zhang <jerryzh168@gmail.com>
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Co-authored-by:
Russell Bryant <rbryant@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
David Xia authored
Co-authored-by:Aaron Pham <Aaronpham0103@gmail.com>
-
bnellnm authored
-
Ekagra Ranjan authored
[V1][Spec Decode] Share input embedding of target model with EAGLE draft model to free ~1GB for llama 3 model (#17326) Co-authored-by:
root <root@ekagra-8xh100.us-east5-a.c.serving-efficiency-poc.internal> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Nick Hill authored
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
TJian authored
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Andrzej Kotłowski authored
Signed-off-by:Andrzej Kotłowski <akotlowski@habana.ai>
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
wang.yuqi authored
-
lkchen authored
Signed-off-by:Linkun <github@lkchen.net>
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-