- 30 Aug, 2025 1 commit
-
-
Andy Lo authored
Signed-off-by:
Andy Lo <andy@mistral.ai> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
- 29 Aug, 2025 1 commit
-
-
wang.yuqi authored
Signed-off-by:wang.yuqi <noooop@126.com>
-
- 28 Aug, 2025 1 commit
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
- 27 Aug, 2025 5 commits
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Hyogeun Oh (오효근) authored
Signed-off-by:
Zerohertz <ohg3417@gmail.com> Signed-off-by:
Hyogeun Oh (오효근) <ohg3417@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 26 Aug, 2025 3 commits
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Zijing Liu authored
[Disagg][Perf] Use CUDA event sync instead of blocking `tolist` to avoid unintentional copy ops blocking across different CUDA streams, improving disagg TTIT/TTFT (#22760) Signed-off-by:
Zijing Liu <liuzijing2014@gmail.com> Signed-off-by:
Zijing Liu <liuzijing2014@users.noreply.github.com>
-
- 25 Aug, 2025 3 commits
-
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Ayush Satyam authored
Signed-off-by:Ayush Satyam <ayushsatyam146@gmail.com>
-
Chenguang Zheng authored
[Core][Multimodal] Track encode cache entries by mm_hash and enable embedding sharing between requests (#22711) Signed-off-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
- 24 Aug, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 23 Aug, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 22 Aug, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 21 Aug, 2025 4 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <noooop@126.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@thinkingmachines.ai>
-
- 20 Aug, 2025 2 commits
-
-
rongfu.leng authored
Signed-off-by:
rongfu.leng <rongfu.leng@daocloud.io> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
rongfu.leng <lenronfu@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 19 Aug, 2025 3 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 18 Aug, 2025 3 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 17 Aug, 2025 1 commit
-
-
Calvin Chen authored
Signed-off-by:calvin chen <wen.chen@dynamia.ai>
-
- 16 Aug, 2025 2 commits
-
-
afeldman-nm authored
Signed-off-by:
Andrew Feldman <afeldman@redhat.com> Signed-off-by:
Andrew Feldman <afeld2012@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 15 Aug, 2025 6 commits
-
-
rishitdholakia13 authored
[Structured Outputs] [Bug] Fix misalignment in apply_grammar_bitmask causing unintended masking and NaN logits (#22963) Signed-off-by:rishitdholakia13 <rishit+github@cohere.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
JartX authored
Signed-off-by:JartX <sagformas@epdcenter.es>
-
fhl2000 authored
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059) Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
Thomas Parnell authored
Signed-off-by:
Daniel Afrimi <danielafrimi8@gmail.com> Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Daniel Afrimi <danielafrimi8@gmail.com> Co-authored-by:
Burkhard Ringlein <ngl@zurich.ibm.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 14 Aug, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-