- 09 Sep, 2025 1 commit
-
-
Zebing Lin authored
Signed-off-by:linzebing <linzebing1995@gmail.com>
-
- 02 Sep, 2025 1 commit
-
-
Didier Durand authored
Signed-off-by:Didier Durand <durand.didier@gmail.com>
-
- 30 Aug, 2025 1 commit
-
-
Ning Xie authored
-
- 29 Aug, 2025 1 commit
-
-
Flora Feng authored
Signed-off-by:sfeng33 <4florafeng@gmail.com>
-
- 28 Aug, 2025 1 commit
-
-
Hanchenli authored
Signed-off-by:Hanchenli <lihanc2002@gmail.com>
-
- 26 Aug, 2025 1 commit
-
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.me> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com>
-
- 25 Aug, 2025 1 commit
-
-
Chenguang Zheng authored
[Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711) Signed-off-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
- 19 Aug, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 16 Aug, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 15 Aug, 2025 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 13 Aug, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 05 Aug, 2025 2 commits
-
-
Woosuk Kwon authored
-
PiteXChen authored
Signed-off-by:CLFutureX <775523362@qq.com>
-
- 30 Jul, 2025 1 commit
-
-
Ruixiang Tan authored
Signed-off-by:
tanruixiang <tanruixiang0104@gmail.com> Signed-off-by:
Ruixiang Tan <819464715@qq.com> Signed-off-by:
GitHub <noreply@github.com>
-
- 29 Jul, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 28 Jul, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 24 Jul, 2025 1 commit
-
-
Zhou Fang authored
Signed-off-by:Zhou Fang <fang.github@gmail.com>
-
- 23 Jul, 2025 1 commit
-
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
- 22 Jul, 2025 1 commit
-
-
Jialin Ouyang authored
[Core] Introduce popleft_n and append_n in FreeKVCacheBlockQueue to further optimize block_pool (#21222) Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
- 21 Jul, 2025 1 commit
-
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 19 Jul, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:
Lucia Fang <fanglu@fb.com> Signed-off-by:
Lu Fang <fanglu@meta.com> Signed-off-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
Lu Fang <fanglu@meta.com>
-
- 18 Jul, 2025 1 commit
-
-
JialinOuyang-Meta authored
Signed-off-by:Jialin Ouyang <jialino@meta.com>
-
- 15 Jul, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 14 Jul, 2025 1 commit
-
-
Maroon Ayoub authored
Signed-off-by:Maroon Ayoub <maroon.ayoub@ibm.com>
-
- 12 Jul, 2025 1 commit
-
-
Woosuk Kwon authored
-
- 04 Jul, 2025 1 commit
-
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 30 Jun, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 23 Jun, 2025 1 commit
-
-
amit authored
Signed-off-by:
amit <amit.man@gmail.com> Co-authored-by:
Roger Wang <Rogerw0108@gmail.com>
-
- 19 Jun, 2025 1 commit
-
-
Maximilien de Bayser authored
Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Signed-off-by:
Max de Bayser <maxdebayser@gmail.com> Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com>
-
- 12 Jun, 2025 1 commit
-
-
jmswen authored
Signed-off-by:Jon Swenson <jmswen@gmail.com>
-
- 10 Jun, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 06 Jun, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 04 Jun, 2025 1 commit
-
-
Chen Zhang authored
[Bugfix] Max concurrency estimation and check_enough_kv_cache_memory for models with sliding window layers (#19029) Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 03 Jun, 2025 3 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 30 May, 2025 1 commit
-
-
Nick Hill authored
-
- 23 May, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Chauncey authored
Co-authored-by:simon-mo <xmo@berkeley.edu>
-
- 21 May, 2025 1 commit
-
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-