- 26 May, 2025 3 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
AlexZhao authored
Signed-off-by:
zhaohaidao <zhaohaidao2008@hotmail.com> Signed-off-by:
zhaohaiyuan <zhaohaiyuan@xiaohongshu.com> Co-authored-by:
zhaohaiyuan <zhaohaiyuan@xiaohongshu.com>
-
- 25 May, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 23 May, 2025 3 commits
-
-
Feng XiaoLong authored
Signed-off-by:
Crucifixion-Fxl <xmufxl@gmail.com> Co-authored-by:
Crucifixion-Fxl <xmufxl@gmail.com>
-
Chenheli Hua authored
Signed-off-by:Chenheli Hua <huachenheli@outlook.com>
-
Sanger Steel authored
[Frontend] [Core] Add Tensorizer support for V1, LoRA adapter serialization and deserialization (#17926) Signed-off-by:Sanger Steel <sangersteel@gmail.com>
-
- 22 May, 2025 4 commits
-
-
Kai Wu authored
Signed-off-by:Kai Wu <kaiwu@meta.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Calvin Chen authored
Signed-off-by:calvin chen <120380290@qq.com>
-
CYJiang authored
Signed-off-by:googs1025 <googs1025@gmail.com>
-
- 21 May, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 20 May, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 19 May, 2025 2 commits
-
-
Gong Shufan authored
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 16 May, 2025 2 commits
-
-
David Xia authored
Signed-off-by:David Xia <david@davidxia.com>
-
Lucia Fang authored
Signed-off-by:Lucia Fang <fanglu@fb.com>
-
- 15 May, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
omahs authored
Signed-off-by:omahs <73983677+omahs@users.noreply.github.com>
-
- 14 May, 2025 4 commits
-
-
bnellnm authored
-
Ekagra Ranjan authored
[V1][Spec Decode] Share input embedding of target model with EAGLE draft model to free ~1GB for llama 3 model (#17326) Co-authored-by:
root <root@ekagra-8xh100.us-east5-a.c.serving-efficiency-poc.internal> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
majianpeng authored
Signed-off-by:Ma, Jianpeng <jianpeng.ma@intel.com>
-
Ecthlion_zyy authored
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
- 12 May, 2025 3 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Xu Wenqing authored
Signed-off-by:
许文卿 <xwq391974@alibaba-inc.com> Signed-off-by:
Xu Wenqing <xuwq1993@qq.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 11 May, 2025 1 commit
-
-
Frieda Huang authored
Signed-off-by:Frieda (Jingying) Huang <jingyingfhuang@gmail.com>
-
- 10 May, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 09 May, 2025 2 commits
-
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
- 08 May, 2025 4 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Rick Yuan authored
Signed-off-by:
Rick Yuan <yuan821120@gmail.com> Signed-off-by:
RIck Yuan <yuan821120@gmail.com> Co-authored-by:
Aaron Pham <Aaronpham0103@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Aaron Pham authored
Signed-off-by:Aaron Pham <contact@aarnphm.xyz>
-
- 07 May, 2025 4 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Satyajith Chilappagari authored
Signed-off-by:
Satyajith Chilappagari <satchill@amazon.com> Co-authored-by:
Aaron Dou <yzdou@amazon.com> Co-authored-by:
Shashwat Srijan <sssrijan@amazon.com> Co-authored-by:
Chongming Ni <chongmni@amazon.com> Co-authored-by:
Amulya Ballakur <amulyaab@amazon.com> Co-authored-by:
Patrick Lange <patlange@amazon.com> Co-authored-by:
Elaine Zhao <elaineyz@amazon.com> Co-authored-by:
Lin Lin Pan <tailinpa@amazon.com> Co-authored-by:
Navyadhara Gogineni <navyadha@amazon.com> Co-authored-by:
Yishan McNabb <yishanm@amazon.com> Co-authored-by:
Mrinal Shukla <181322398+mrinalks@users.noreply.github.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 06 May, 2025 1 commit
-
-
Jevin Jiang authored
-