- 13 Sep, 2024 7 commits
-
-
Alexander Matveev authored
-
Cyrus Leung authored
-
Cyrus Leung authored
-
shangmingc authored
-
Cyrus Leung authored
-
Dipika Sikka authored
-
Roger Wang authored
-
- 12 Sep, 2024 22 commits
-
-
Wenxiang authored
-
Patrick von Platen authored
-
Roger Wang authored
-
Roger Wang authored
[Hotfix][Core][VLM] Disable chunked prefill by default and prefix caching for multimodal models (#8425)
-
Alexander Matveev authored
-
Nick Hill authored
-
William Lin authored
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
Luis Vega authored
-
WANGWEI authored
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Isotr0py authored
Co-authored-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Roger Wang authored
-
youkaichao authored
-
Woosuk Kwon authored
-
Kevin Lin authored
-
Blueyo0 authored
-
tomeras91 authored
-
Michael Goin authored
-
Woosuk Kwon authored
-
youkaichao authored
-
Cody Yu authored
-
- 11 Sep, 2024 11 commits
-
-
Simon Mo authored
-
Patrick von Platen authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
Lily Liu authored
Co-authored-by:youkaichao <youkaichao@126.com>
-
Aarni Koskela authored
-
bnellnm authored
Co-authored-by:Sage Moore <sage@neuralmagic.com>
-
Cyrus Leung authored
-
Alexey Kondratiev(AMD) authored
-
Li, Jiang authored
-
Yang Fan authored
Co-authored-by:
Roger Wang <136131678+ywang96@users.noreply.github.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Pooya Davoodi authored
-
Yangshen⚡Deng authored
Co-authored-by:
Roger Wang <ywang@roblox.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-