- 07 Nov, 2024 1 commit
-
-
Flávia Béo authored
Signed-off-by:
Flavia Beo <flavia.beo@ibm.com> Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Max de Bayser <mbayser@br.ibm.com>
-
- 06 Nov, 2024 1 commit
-
-
Konrad Zawora authored
Signed-off-by:
yuwenzho <yuwen.zhou@intel.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Signed-off-by:
Bob Zhu <bob.zhu@intel.com> Signed-off-by:
zehao-intel <zehao.huang@intel.com> Signed-off-by:
Konrad Zawora <kzawora@habana.ai> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Sanju C Sudhakaran <scsudhakaran@habana.ai> Co-authored-by:
Michal Adamczyk <madamczyk@habana.ai> Co-authored-by:
Marceli Fylcek <mfylcek@habana.ai> Co-authored-by:
Himangshu Lahkar <49579433+hlahkar@users.noreply.github.com> Co-authored-by:
Vivek Goel <vgoel@habana.ai> Co-authored-by:
yuwenzho <yuwen.zhou@intel.com> Co-authored-by:
Dominika Olszewska <dolszewska@habana.ai> Co-authored-by:
barak goldberg <149692267+bgoldberg-habana@users.noreply.github.com> Co-authored-by:
Michal Szutenberg <37601244+szutenberg@users.noreply.github.com> Co-authored-by:
Jan Kaniecki <jkaniecki@habana.ai> Co-authored-by: Agata Dobrzynie...
-
- 04 Nov, 2024 1 commit
-
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 02 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 31 Oct, 2024 1 commit
-
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
- 30 Oct, 2024 1 commit
-
-
Went-Liang authored
Signed-off-by:Went-Liang <wenteng_liang@163.com>
-
- 27 Oct, 2024 1 commit
-
-
madt2709 authored
-
- 24 Oct, 2024 1 commit
-
-
Vinay R Damodaran authored
Signed-off-by:Vinay Damodaran <vrdn@hey.com>
-
- 22 Oct, 2024 1 commit
-
-
Travis Johnson authored
Signed-off-by:
Travis Johnson <tsjohnso@us.ibm.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 19 Oct, 2024 1 commit
-
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
- 18 Oct, 2024 1 commit
-
-
Cyrus Leung authored
-
- 17 Oct, 2024 1 commit
-
-
Kuntai Du authored
Removing the block manager v1. This is the initial piece of prefix-caching-centric design. In order to achieve prefix-caching-centric design, we need to simplify the code path so that we only use v2 block manager (which has much higher performance on prefix caching).
-
- 16 Oct, 2024 2 commits
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Cyrus Leung authored
-
- 11 Oct, 2024 3 commits
-
-
Wallas Henrique authored
Signed-off-by:Wallas Santos <wallashss@ibm.com>
-
Tyler Michael Smith authored
-
Cyrus Leung authored
-
- 07 Oct, 2024 1 commit
-
-
sroy745 authored
-
- 05 Oct, 2024 1 commit
-
-
Chen Zhang authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
- 04 Oct, 2024 2 commits
-
-
Roger Wang authored
Co-authored-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Michael Goin authored
-
- 03 Oct, 2024 2 commits
-
-
xendo authored
Co-authored-by:Jerzy Zagorski <jzagorsk@amazon.com>
-
sroy745 authored
-
- 02 Oct, 2024 1 commit
-
-
afeldman-nm authored
Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Andrew Feldman <afeld2012@gmail.com>
-
- 01 Oct, 2024 1 commit
-
-
Lily Liu authored
-
- 30 Sep, 2024 1 commit
-
-
Sebastian Schoennenbeck authored
-
- 27 Sep, 2024 1 commit
-
-
Varun Sundar Rabindranath authored
Co-authored-by:Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 23 Sep, 2024 2 commits
-
-
Alexander Matveev authored
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
- 17 Sep, 2024 2 commits
-
-
sroy745 authored
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
- 12 Sep, 2024 2 commits
-
-
Roger Wang authored
[Hotfix][Core][VLM] Disable chunked prefill by default and prefix caching for multimodal models (#8425)
-
youkaichao authored
-
- 11 Sep, 2024 1 commit
-
-
Aarni Koskela authored
-
- 10 Sep, 2024 1 commit
-
-
Cody Yu authored
[MISC] Keep chunked prefill enabled by default with long context when prefix caching is enabled (#8342)
-
- 07 Sep, 2024 1 commit
-
-
Cyrus Leung authored
-
- 06 Sep, 2024 1 commit
-
-
Patrick von Platen authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
- 04 Sep, 2024 1 commit
-
-
Harsha vardhan manoj Bikki authored
Co-authored-by:Harsha Bikki <harbikh@amazon.com>
-
- 02 Sep, 2024 1 commit
-
-
Isotr0py authored
-
- 30 Aug, 2024 1 commit
-
-
Cyrus Leung authored
-