- 11 Nov, 2024 1 commit
-
-
Yangcheng Li authored
-
- 17 Oct, 2024 1 commit
-
-
Kuntai Du authored
Removing the block manager v1. This is the initial piece of prefix-caching-centric design. In order to achieve prefix-caching-centric design, we need to simplify the code path so that we only use v2 block manager (which has much higher performance on prefix caching).
-
- 11 Oct, 2024 1 commit
-
-
homeffjy authored
-
- 07 Oct, 2024 1 commit
-
-
sroy745 authored
-
- 29 Sep, 2024 1 commit
-
-
sroy745 authored
-
- 27 Sep, 2024 1 commit
-
-
Varun Sundar Rabindranath authored
Co-authored-by:Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 18 Sep, 2024 1 commit
-
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 28 Aug, 2024 1 commit
-
-
Cody Yu authored
-
- 27 Aug, 2024 1 commit
-
-
Jonathan Berkhahn authored
-
- 26 Aug, 2024 1 commit
-
-
Cody Yu authored
-
- 19 Aug, 2024 1 commit
-
-
Cody Yu authored
-
- 02 Jul, 2024 2 commits
-
-
Murali Andoorveedu authored
Signed-off-by:Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
-
Alexander Matveev authored
-
- 15 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 03 Jun, 2024 1 commit
-
-
Kaiyang Chen authored
-
- 29 May, 2024 1 commit
-
-
afeldman-nm authored
[Core] Cross-attention KV caching and memory-management (towards eventual encoder/decoder model support) (#4837)
-
- 28 May, 2024 1 commit
-
-
Michał Moskal authored
Co-authored-by:Ruth Evans <ruthevans@Ruths-MacBook-Pro.local>
-
- 08 May, 2024 1 commit
-
-
youkaichao authored
-
- 07 May, 2024 1 commit
-
-
youkaichao authored
-
- 02 May, 2024 2 commits
-
-
SangBin Cho authored
-
SangBin Cho authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
- 01 May, 2024 1 commit
-
-
leiwen83 authored
Co-authored-by:
Lei Wen <wenlei03@qiyi.com> Co-authored-by:
Sage Moore <sagemoore@utexas.edu>
-
- 15 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 12 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 01 Apr, 2024 1 commit
-
-
Cade Daniel authored
-
- 28 Mar, 2024 1 commit
-
-
Cade Daniel authored
-