- 30 Jun, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 28 Jun, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 26 Jun, 2025 1 commit
-
-
Ekagra Ranjan authored
-
- 24 Jun, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 23 Jun, 2025 1 commit
-
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
- 21 Jun, 2025 1 commit
-
-
汪志鹏 authored
Signed-off-by:汪志鹏 <wangzhipeng628@gmail.com>
-
- 19 Jun, 2025 1 commit
-
-
Maximilien de Bayser authored
Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Signed-off-by:
Max de Bayser <maxdebayser@gmail.com> Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com>
-
- 17 Jun, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 12 Jun, 2025 2 commits
-
-
Ekagra Ranjan authored
[Spec Decode][Benchmark] Generalize spec decode offline benchmark to more methods and datasets (#18847)
-
niu_he authored
Signed-off-by:2niuhe <carlton2tang@gmail.com>
-
- 11 Jun, 2025 1 commit
-
-
wang.yuqi authored
-
- 10 Jun, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 07 Jun, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 04 Jun, 2025 1 commit
-
-
汪志鹏 authored
Signed-off-by:汪志鹏 <wangzhipeng628@gmail.com>
-
- 03 Jun, 2025 3 commits
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
汪志鹏 authored
Signed-off-by:汪志鹏 <wangzhipeng628@gmail.com>
-
Siyuan Liu authored
Signed-off-by:
Siyuan Liu <lsiyuan@google.com> Co-authored-by:
Hossein Sarshar <hossein.sarshar@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@google.com>
-
- 02 Jun, 2025 1 commit
-
-
Calvin Chen authored
Signed-off-by:calvin chen <120380290@qq.com>
-
- 31 May, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Yizhou Liu <liu_yizhou@outlook.com>
-
Satyajith Chilappagari authored
Signed-off-by:
Satyajith Chilappagari <satchill@amazon.com> Co-authored-by:
Ashraf Mahgoub <ashymahg@amazon.com> Co-authored-by:
Rohith Nallamaddi <nalrohit@amazon.com> Co-authored-by:
FeliciaLuo <luof@amazon.com> Co-authored-by:
Elaine Zhao <elaineyz@amazon.com>
-
- 28 May, 2025 2 commits
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
wang.yuqi authored
-
- 27 May, 2025 3 commits
-
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 26 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 25 May, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 23 May, 2025 2 commits
-
-
Feng XiaoLong authored
Signed-off-by:
Crucifixion-Fxl <xmufxl@gmail.com> Co-authored-by:
Crucifixion-Fxl <xmufxl@gmail.com>
-
Chenheli Hua authored
Signed-off-by:Chenheli Hua <huachenheli@outlook.com>
-
- 22 May, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 21 May, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 20 May, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 19 May, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 16 May, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:Lucia Fang <fanglu@fb.com>
-
- 15 May, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
omahs authored
Signed-off-by:omahs <73983677+omahs@users.noreply.github.com>
-
- 14 May, 2025 3 commits
-
-
bnellnm authored
-
Ekagra Ranjan authored
[V1][Spec Decode] Share input embedding of target model with EAGLE draft model to free ~1GB for llama 3 model (#17326) Co-authored-by:
root <root@ekagra-8xh100.us-east5-a.c.serving-efficiency-poc.internal> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Ecthlion_zyy authored
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-