- 16 May, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:Lucia Fang <fanglu@fb.com>
-
- 15 May, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
omahs authored
Signed-off-by:omahs <73983677+omahs@users.noreply.github.com>
-
- 14 May, 2025 3 commits
-
-
bnellnm authored
-
Ekagra Ranjan authored
[V1][Spec Decode] Share input embedding of target model with EAGLE draft model to free ~1GB for llama 3 model (#17326) Co-authored-by:
root <root@ekagra-8xh100.us-east5-a.c.serving-efficiency-poc.internal> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Ecthlion_zyy authored
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
- 12 May, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 09 May, 2025 1 commit
-
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
- 08 May, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 07 May, 2025 3 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Satyajith Chilappagari authored
Signed-off-by:
Satyajith Chilappagari <satchill@amazon.com> Co-authored-by:
Aaron Dou <yzdou@amazon.com> Co-authored-by:
Shashwat Srijan <sssrijan@amazon.com> Co-authored-by:
Chongming Ni <chongmni@amazon.com> Co-authored-by:
Amulya Ballakur <amulyaab@amazon.com> Co-authored-by:
Patrick Lange <patlange@amazon.com> Co-authored-by:
Elaine Zhao <elaineyz@amazon.com> Co-authored-by:
Lin Lin Pan <tailinpa@amazon.com> Co-authored-by:
Navyadhara Gogineni <navyadha@amazon.com> Co-authored-by:
Yishan McNabb <yishanm@amazon.com> Co-authored-by:
Mrinal Shukla <181322398+mrinalks@users.noreply.github.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 06 May, 2025 2 commits
-
-
Jevin Jiang authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 04 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 02 May, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 01 May, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 30 Apr, 2025 1 commit
-
-
Marco authored
Signed-off-by:
Marco <121761685+mlinmg@users.noreply.github.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 29 Apr, 2025 1 commit
-
-
Bryan Lu authored
Signed-off-by:Bryan Lu <yuzhelu@amazon.com>
-
- 28 Apr, 2025 1 commit
-
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com>
-
- 26 Apr, 2025 2 commits
-
-
Isotr0py authored
-
Yihua Cheng authored
-
- 25 Apr, 2025 1 commit
-
-
Benjamin Chislett authored
Signed-off-by:
Bryan Lu <yuzhelu@amazon.com> Signed-off-by:
Benjamin Chislett <benjamin.chislett@centml.ai> Co-authored-by:
Bryan Lu <yuzhelu@amazon.com>
-
- 19 Apr, 2025 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Isotr0py authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Yang Fan authored
Signed-off-by:
fyabc <suyang.fy@alibaba-inc.com> Signed-off-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Roger Wang <136131678+ywang96@users.noreply.github.com> Co-authored-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Xiong Wang <wangxiongts@163.com>
-
- 18 Apr, 2025 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 17 Apr, 2025 4 commits
-
-
Yihua Cheng authored
Signed-off-by:
ApostaC <yihua98@uchicago.edu> Signed-off-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by:
remi <remi@mistral.ai> Co-authored-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Rémi Delacourt <54138269+Flechman@users.noreply.github.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Richard Liaw authored
Signed-off-by:Richard Liaw <rliaw@berkeley.edu>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 16 Apr, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 15 Apr, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 14 Apr, 2025 2 commits
-
-
courage17340 authored
Signed-off-by:courage17340 <courage17340@163.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-