- 16 May, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:Lucia Fang <fanglu@fb.com>
-
- 15 May, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
omahs authored
Signed-off-by:omahs <73983677+omahs@users.noreply.github.com>
-
- 14 May, 2025 4 commits
-
-
bnellnm authored
-
Ekagra Ranjan authored
[V1][Spec Decode] Share input embedding of target model with EAGLE draft model to free ~1GB for llama 3 model (#17326) Co-authored-by:
root <root@ekagra-8xh100.us-east5-a.c.serving-efficiency-poc.internal> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
majianpeng authored
Signed-off-by:Ma, Jianpeng <jianpeng.ma@intel.com>
-
Ecthlion_zyy authored
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
- 12 May, 2025 3 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Xu Wenqing authored
Signed-off-by:
许文卿 <xwq391974@alibaba-inc.com> Signed-off-by:
Xu Wenqing <xuwq1993@qq.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 11 May, 2025 1 commit
-
-
Frieda Huang authored
Signed-off-by:Frieda (Jingying) Huang <jingyingfhuang@gmail.com>
-
- 10 May, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 09 May, 2025 2 commits
-
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
- 08 May, 2025 4 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Rick Yuan authored
Signed-off-by:
Rick Yuan <yuan821120@gmail.com> Signed-off-by:
RIck Yuan <yuan821120@gmail.com> Co-authored-by:
Aaron Pham <Aaronpham0103@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Aaron Pham authored
Signed-off-by:Aaron Pham <contact@aarnphm.xyz>
-
- 07 May, 2025 4 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Satyajith Chilappagari authored
Signed-off-by:
Satyajith Chilappagari <satchill@amazon.com> Co-authored-by:
Aaron Dou <yzdou@amazon.com> Co-authored-by:
Shashwat Srijan <sssrijan@amazon.com> Co-authored-by:
Chongming Ni <chongmni@amazon.com> Co-authored-by:
Amulya Ballakur <amulyaab@amazon.com> Co-authored-by:
Patrick Lange <patlange@amazon.com> Co-authored-by:
Elaine Zhao <elaineyz@amazon.com> Co-authored-by:
Lin Lin Pan <tailinpa@amazon.com> Co-authored-by:
Navyadhara Gogineni <navyadha@amazon.com> Co-authored-by:
Yishan McNabb <yishanm@amazon.com> Co-authored-by:
Mrinal Shukla <181322398+mrinalks@users.noreply.github.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 06 May, 2025 3 commits
-
-
Jevin Jiang authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 04 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 02 May, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 01 May, 2025 4 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 30 Apr, 2025 4 commits
-
-
zh Wang authored
Signed-off-by:zh Wang <rekind133@outlook.com>
-
Alec authored
Signed-off-by:
alec-flowers <aflowers@nvidia.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
Marco authored
Signed-off-by:
Marco <121761685+mlinmg@users.noreply.github.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Huy Do authored
-
- 29 Apr, 2025 3 commits
-
-
Bryan Lu authored
Signed-off-by:Bryan Lu <yuzhelu@amazon.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-