- 20 Oct, 2024 4 commits
-
-
Andy Dai authored
-
Chen Zhang authored
-
Michael Goin authored
-
Chen Zhang authored
-
- 19 Oct, 2024 10 commits
-
-
Michael Goin authored
-
Cyrus Leung authored
-
Yue Zhang authored
-
Russell Bryant authored
Signed-off-by:Russell Bryant <russell.bryant@gmail.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
Nick Hill authored
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
sasha0552 authored
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Chih-Chieh Yang <chih.chieh.yang@ibm.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-
- 18 Oct, 2024 15 commits
-
-
Cody Yu authored
-
Kunjan authored
Co-authored-by:Kunjan Patel <kunjanp_google_com@vllm.us-central1-a.c.kunjanp-gke-dev-2.internal>
-
Michael Goin authored
-
Russell Bryant authored
-
Cyrus Leung authored
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Tyler Michael Smith authored
-
Cyrus Leung authored
-
Nick Hill authored
-
tomeras91 authored
-
Nick Hill authored
-
Russell Bryant authored
-
Haoyu Wang authored
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
Dipika Sikka authored
-
- 17 Oct, 2024 11 commits
-
-
Robert Shaw authored
Signed-off-by:
Max de Bayser <maxdebayser@gmail.com> Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Andrew Feldman <afeldman@neuralmagic.com> Co-authored-by:
afeldman-nm <156691304+afeldman-nm@users.noreply.github.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
laishzh <laishengzhang@gmail.com> Co-authored-by:
Max de Bayser <maxdebayser@gmail.com> Co-authored-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Shashwat Srijan authored
-
sasha0552 authored
-
Kai Wu authored
Co-authored-by:Isotr0py <2037008807@qq.com>
-
bnellnm authored
-
Luka Govedič authored
-
Cyrus Leung authored
-
Daniele authored
-
Kuntai Du authored
Removing the block manager v1. This is the initial piece of prefix-caching-centric design. In order to achieve prefix-caching-centric design, we need to simplify the code path so that we only use v2 block manager (which has much higher performance on prefix caching).
-
Li, Jiang authored
-
Woosuk Kwon authored
-