"csrc/vscode:/vscode.git/clone" did not exist on "e38074b1e6ad0975acbfa15d858c4bd7cd005e99"
- 28 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 22 Jan, 2025 1 commit
-
-
Cody Yu authored
-
- 05 Jan, 2025 1 commit
-
-
Lancer authored
Co-authored-by:Lancer <maruixiang6688@gmail.com>
-
- 13 Dec, 2024 1 commit
-
-
Sungjae Lee authored
[Core] support LoRA and prompt adapter in content-based hashing for Block Manager v2 prefix caching (#8240)
-
- 23 Nov, 2024 1 commit
-
-
Ricky Xu authored
Signed-off-by:rickyx <rickyx@anyscale.com>
-
- 06 Nov, 2024 1 commit
-
-
Konrad Zawora authored
Signed-off-by:
yuwenzho <yuwen.zhou@intel.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Signed-off-by:
Bob Zhu <bob.zhu@intel.com> Signed-off-by:
zehao-intel <zehao.huang@intel.com> Signed-off-by:
Konrad Zawora <kzawora@habana.ai> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Sanju C Sudhakaran <scsudhakaran@habana.ai> Co-authored-by:
Michal Adamczyk <madamczyk@habana.ai> Co-authored-by:
Marceli Fylcek <mfylcek@habana.ai> Co-authored-by:
Himangshu Lahkar <49579433+hlahkar@users.noreply.github.com> Co-authored-by:
Vivek Goel <vgoel@habana.ai> Co-authored-by:
yuwenzho <yuwen.zhou@intel.com> Co-authored-by:
Dominika Olszewska <dolszewska@habana.ai> Co-authored-by:
barak goldberg <149692267+bgoldberg-habana@users.noreply.github.com> Co-authored-by:
Michal Szutenberg <37601244+szutenberg@users.noreply.github.com> Co-authored-by:
Jan Kaniecki <jkaniecki@habana.ai> Co-authored-by:
Agata Dobrzyniewicz <160237065+adobrzyniewicz-habana@users.noreply.github.com> Co-authored-by:
Krzysztof Wisniewski <kwisniewski@habana.ai> Co-authored-by:
Dudi Lester <160421192+dudilester@users.noreply.github.com> Co-authored-by:
Ilia Taraban <tarabanil@gmail.com> Co-authored-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Michał Kuligowski <mkuligowski@habana.ai> Co-authored-by:
Jakub Maksymczuk <jmaksymczuk@habana.ai> Co-authored-by:
Tomasz Zielinski <85164140+tzielinski-habana@users.noreply.github.com> Co-authored-by:
Sun Choi <schoi@habana.ai> Co-authored-by:
Iryna Boiko <iboiko@habana.ai> Co-authored-by:
Bob Zhu <41610754+czhu15@users.noreply.github.com> Co-authored-by:
hlin99 <73271530+hlin99@users.noreply.github.com> Co-authored-by:
Zehao Huang <zehao.huang@intel.com> Co-authored-by:
Andrzej Kotłowski <Andrzej.Kotlowski@intel.com> Co-authored-by:
Yan Tomsinsky <73292515+Yantom1@users.noreply.github.com> Co-authored-by:
Nir David <ndavid@habana.ai> Co-authored-by:
Yu-Zhou <yu.zhou@intel.com> Co-authored-by:
Ruheena Suhani Shaik <rsshaik@habana.ai> Co-authored-by:
Karol Damaszke <kdamaszke@habana.ai> Co-authored-by:
Marcin Swiniarski <mswiniarski@habana.ai> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Jacek Czaja <jacek.czaja@intel.com> Co-authored-by:
Jacek Czaja <jczaja@habana.ai> Co-authored-by:
Yuan <yuan.zhou@outlook.com>
-
- 22 Oct, 2024 1 commit
-
-
Kuntai Du authored
-
- 17 Oct, 2024 1 commit
-
-
Kuntai Du authored
Removing the block manager v1. This is the initial piece of prefix-caching-centric design. In order to achieve prefix-caching-centric design, we need to simplify the code path so that we only use v2 block manager (which has much higher performance on prefix caching).
-
- 07 Oct, 2024 1 commit
-
-
sroy745 authored
-
- 29 Sep, 2024 1 commit
-
-
sroy745 authored
-
- 27 Sep, 2024 1 commit
-
-
Varun Sundar Rabindranath authored
Co-authored-by:Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 18 Sep, 2024 1 commit
-
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 27 Aug, 2024 1 commit
-
-
Jonathan Berkhahn authored
-
- 26 Aug, 2024 1 commit
-
-
Cody Yu authored
-
- 19 Aug, 2024 1 commit
-
-
Cody Yu authored
-
- 09 Aug, 2024 1 commit
-
-
Cade Daniel authored
-
- 08 Aug, 2024 1 commit
-
-
Zach Zheng authored
-
- 06 Aug, 2024 1 commit
-
-
afeldman-nm authored
[Core] Subclass ModelRunner to support cross-attention & encoder sequences (towards eventual encoder/decoder model support) (#4942) Co-authored-by:
Andrew Feldman <afeld2012@gmail.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
- 19 Jul, 2024 1 commit
-
-
Antoni Baum authored
-
- 02 Jul, 2024 1 commit
-
-
Alexander Matveev authored
-
- 15 Jun, 2024 2 commits
-
-
Cyrus Leung authored
-
leiwen83 authored
Signed-off-by:
Lei Wen <wenlei03@qiyi.com> Co-authored-by:
Lei Wen <wenlei03@qiyi.com>
-
- 03 Jun, 2024 1 commit
-
-
Kaiyang Chen authored
-
- 29 May, 2024 1 commit
-
-
afeldman-nm authored
[Core] Cross-attention KV caching and memory-management (towards eventual encoder/decoder model support) (#4837)
-
- 28 May, 2024 1 commit
-
-
Michał Moskal authored
Co-authored-by:Ruth Evans <ruthevans@Ruths-MacBook-Pro.local>
-
- 24 May, 2024 1 commit
-
-
leiwen83 authored
Co-authored-by:Lei Wen <wenlei03@qiyi.com>
-
- 07 May, 2024 1 commit
-
-
youkaichao authored
-
- 02 May, 2024 2 commits
-
-
SangBin Cho authored
-
SangBin Cho authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
- 01 May, 2024 1 commit
-
-
leiwen83 authored
Co-authored-by:
Lei Wen <wenlei03@qiyi.com> Co-authored-by:
Sage Moore <sagemoore@utexas.edu>
-
- 23 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 16 Apr, 2024 1 commit
-
-
Cade Daniel authored
-
- 12 Apr, 2024 1 commit
-
-
Michael Feil authored
Co-authored-by:Roger Wang <136131678+ywang96@users.noreply.github.com>
-
- 02 Apr, 2024 1 commit
-
-
Michael Goin authored
-
- 01 Apr, 2024 1 commit
-
-
Cade Daniel authored
-
- 28 Mar, 2024 1 commit
-
-
Cade Daniel authored
-