- 25 Aug, 2025 2 commits
-
-
Chenguang Zheng authored
[Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711) Signed-off-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Noam Gat authored
Signed-off-by:
Noam Gat <noamgat@gmail.com> Co-authored-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 22 Aug, 2025 6 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Flora Feng authored
Signed-off-by:sfeng33 <4florafeng@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Arjun Reddy authored
Signed-off-by:
Arjun Reddy <189282188+arjunbreddy22@users.noreply.github.com> Co-authored-by:
Arjun Reddy <189282188+arjunbreddy22@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 21 Aug, 2025 2 commits
-
-
Kebe authored
Signed-off-by:Kebe <mail@kebe7jun.com>
-
22quinn authored
Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 20 Aug, 2025 3 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
- 19 Aug, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 18 Aug, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 17 Aug, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 16 Aug, 2025 3 commits
-
-
afeldman-nm authored
Signed-off-by:
Andrew Feldman <afeldman@redhat.com> Signed-off-by:
Andrew Feldman <afeld2012@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 15 Aug, 2025 7 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Yinghai Lu <yinghai@thinkingmachines.ai>
-
fhl2000 authored
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059) Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
Thomas Parnell authored
Signed-off-by:
Daniel Afrimi <danielafrimi8@gmail.com> Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Daniel Afrimi <danielafrimi8@gmail.com> Co-authored-by:
Burkhard Ringlein <ngl@zurich.ibm.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 14 Aug, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 13 Aug, 2025 7 commits
-
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by:
Woosuk Kwon <woosuk@thinkingmachines.ai>
-
- 12 Aug, 2025 4 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
phantomlei authored
Signed-off-by:phantomlei <phantomlei3@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 11 Aug, 2025 1 commit
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-