- 17 Oct, 2024 1 commit
-
-
Kuntai Du authored
Removing the block manager v1. This is the initial piece of prefix-caching-centric design. In order to achieve prefix-caching-centric design, we need to simplify the code path so that we only use v2 block manager (which has much higher performance on prefix caching).
-
- 02 Oct, 2024 1 commit
-
-
afeldman-nm authored
Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Andrew Feldman <afeld2012@gmail.com>
-
- 28 Sep, 2024 1 commit
-
-
Varun Sundar Rabindranath authored
-
- 27 Sep, 2024 1 commit
-
-
Varun Sundar Rabindranath authored
Co-authored-by:Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 18 Sep, 2024 1 commit
-
-
afeldman-nm authored
-
- 12 Sep, 2024 1 commit
-
-
William Lin authored
-
- 06 Sep, 2024 1 commit
-
-
afeldman-nm authored
-
- 03 Sep, 2024 1 commit
-
-
Alexander Matveev authored
-
- 30 Aug, 2024 1 commit
-
-
afeldman-nm authored
-
- 29 Aug, 2024 1 commit
-
-
Alexander Matveev authored
-
- 27 Aug, 2024 2 commits
-
-
Nick Hill authored
-
Megha Agarwal authored
Co-authored-by:Alexander Matveev <alexm@neuralmagic.com>
-
- 23 Aug, 2024 1 commit
-
-
Alexander Matveev authored
-
- 19 Aug, 2024 1 commit
-
-
William Lin authored
Co-authored-by:afeldman-nm <156691304+afeldman-nm@users.noreply.github.com>
-