- 07 Dec, 2024 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 05 Dec, 2024 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 08 Nov, 2024 1 commit
-
-
whyiug authored
Signed-off-by:whyiug <whyiug@hotmail.com>
-
- 17 Oct, 2024 1 commit
-
-
Kuntai Du authored
Removing the block manager v1. This is the initial piece of prefix-caching-centric design. In order to achieve prefix-caching-centric design, we need to simplify the code path so that we only use v2 block manager (which has much higher performance on prefix caching).
-
- 15 Oct, 2024 1 commit
-
-
Michael Goin authored
-
- 05 Sep, 2024 1 commit
-
-
sroy745 authored
[Documentation][Spec Decode] Add documentation about lossless guarantees in Speculative Decoding in vLLM (#7962)
-
- 07 Aug, 2024 1 commit
-
-
Stas Bekman authored
-
- 05 Aug, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 14 Jul, 2024 1 commit
-
-
Yuan Tang authored
-
- 11 Jun, 2024 2 commits
-
-
Cade Daniel authored
-
Cade Daniel authored
-