- 26 Aug, 2025 6 commits
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.me> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com>
-
Zijing Liu authored
[Disagg][Perf] Use CUDA event sync instead of blocking `tolist` to avoid unintentional copy ops blocking across different CUDA streams, improving disagg TTIT/TTFT (#22760) Signed-off-by:
Zijing Liu <liuzijing2014@gmail.com> Signed-off-by:
Zijing Liu <liuzijing2014@users.noreply.github.com>
-
weiliang authored
Signed-off-by:
Siyuan Fu <siyuanf@nvidia.com> Signed-off-by:
siyuanf <siyuanf@nvidia.com> Signed-off-by:
Weiliang Liu <weiliangl@nvidia.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 25 Aug, 2025 8 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@thinkingmachines.ai>
-
Chaojun Zhang authored
Signed-off-by:chzhang <chaojun.zhang@intel.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
Ayush Satyam authored
Signed-off-by:Ayush Satyam <ayushsatyam146@gmail.com>
-
Chenguang Zheng authored
[Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711) Signed-off-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Noam Gat authored
Signed-off-by:
Noam Gat <noamgat@gmail.com> Co-authored-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 24 Aug, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
汪志鹏 authored
Signed-off-by:汪志鹏 <wangzhipeng628@gmail.com>
-
- 23 Aug, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 22 Aug, 2025 8 commits
-
-
elvischenv authored
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Didier Durand authored
Signed-off-by:Didier Durand <durand.didier@gmail.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni001@gmail.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 21 Aug, 2025 11 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <noooop@126.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Paul Pak authored
Signed-off-by:Paul Pak <paulpak58@gmail.com>
-
22quinn authored
Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Signed-off-by:
asafg <39553475+Josephasafg@users.noreply.github.com> Co-authored-by:
asafg <asafg@ai21.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@thinkingmachines.ai>
-
- 20 Aug, 2025 3 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni001@gmail.com>
-
rongfu.leng authored
Signed-off-by:
rongfu.leng <rongfu.leng@daocloud.io> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
rongfu.leng <lenronfu@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-