- 06 Nov, 2024 1 commit
-
-
Sungjae Lee authored
Co-authored-by:LiuXiaoxuanPKU <lilyliupku@gmail.com>
-
- 02 Nov, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 28 Oct, 2024 1 commit
-
-
wangshuai09 authored
Signed-off-by:wangshuai09 <391746016@qq.com>
-
- 18 Oct, 2024 1 commit
-
-
Cody Yu authored
-
- 17 Oct, 2024 1 commit
-
-
Kuntai Du authored
Removing the block manager v1. This is the initial piece of prefix-caching-centric design. In order to achieve prefix-caching-centric design, we need to simplify the code path so that we only use v2 block manager (which has much higher performance on prefix caching).
-
- 12 Oct, 2024 1 commit
-
-
Lily Liu authored
-
- 10 Oct, 2024 1 commit
-
-
sroy745 authored
[Core] Add an environment variable which needs to be set explicitly to allow BlockSpaceManagerV1 (#9149)
-
- 03 Oct, 2024 1 commit
-
-
sroy745 authored
-
- 01 Oct, 2024 2 commits
- 25 Sep, 2024 1 commit
-
-
Travis Johnson authored
Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
- 22 Sep, 2024 1 commit
-
-
Lily Liu authored
-
- 21 Sep, 2024 1 commit
-
-
Cyrus Leung authored
-
- 11 Sep, 2024 1 commit
-
-
Lily Liu authored
Co-authored-by:youkaichao <youkaichao@126.com>
-
- 02 Sep, 2024 1 commit
-
-
Lily Liu authored
-
- 30 Aug, 2024 1 commit
-
-
afeldman-nm authored
-
- 29 Aug, 2024 1 commit
-
-
Jonas M. Kübler authored
-
- 25 Aug, 2024 1 commit
-
-
Nick Hill authored
-
- 22 Aug, 2024 2 commits
-
-
Travis Johnson authored
Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
Abhinav Goyal authored
-
- 20 Aug, 2024 1 commit
-
-
Abhinav Goyal authored
-
- 19 Aug, 2024 1 commit
-
-
SangBin Cho authored
-
- 16 Aug, 2024 2 commits
-
-
jon-chuang authored
-
shangmingc authored
-
- 14 Aug, 2024 1 commit
-
-
Wallas Henrique authored
Signed-off-by:
Wallas Santos <wallashss@ibm.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com> Co-authored-by:
youkaichao <youkaichao@126.com>
-
- 09 Aug, 2024 1 commit
-
-
Travis Johnson authored
Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
- 05 Aug, 2024 1 commit
-
-
Cade Daniel authored
-
- 30 Jul, 2024 1 commit
-
-
Nick Hill authored
-
- 24 Jul, 2024 2 commits
- 21 Jul, 2024 1 commit
-
-
sroy745 authored
[Spec Decode] Disable Log Prob serialization to CPU for spec decoding for both draft and target models. (#6485)
-
- 19 Jul, 2024 4 commits
-
-
Thomas Parnell authored
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Woo-Yeon Lee authored
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
- 17 Jul, 2024 1 commit
-
-
Alexander Matveev authored
-
- 16 Jul, 2024 1 commit
-
-
Cody Yu authored
-
- 10 Jul, 2024 2 commits
-
-
sroy745 authored
[Speculative Decoding] Enabling bonus token in speculative decoding for KV cache based models (#5765)
-
Abhinav Goyal authored
-
- 09 Jul, 2024 1 commit
-
-
Swapnil Parekh authored
Co-authored-by:
Swapnil Parekh <swapnilp@ibm.com> Co-authored-by:
Joe G <joseph.granados@h2o.ai> Co-authored-by:
Antoni Baum <antoni.baum@protonmail.com>
-