- 05 Sep, 2024 9 commits
-
-
sroy745 authored
[Documentation][Spec Decode] Add documentation about lossless guarantees in Speculative Decoding in vLLM (#7962)
-
Michael Goin authored
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
manikandan.tm@zucisystems.com authored
-
Cyrus Leung authored
-
Elfie Guo authored
-
Kevin H. Luu authored
Signed-off-by:kevin <kevin@anyscale.com>
-
Woosuk Kwon authored
-
William Lin authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
- 04 Sep, 2024 15 commits
-
-
Maureen McElaney authored
-
Simon Mo authored
-
Harsha vardhan manoj Bikki authored
Co-authored-by:Harsha Bikki <harbikh@amazon.com>
-
Cody Yu authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
Kyle Mistele authored
Co-authored-by:
constellate <constellate@1-ai-appserver-staging.codereach.com> Co-authored-by:
Kyle Mistele <kyle@constellate.ai>
-
Woosuk Kwon authored
-
alexeykondrat authored
Co-authored-by:Simon Mo <simon.mo@hey.com>
-
Cody Yu authored
-
wnma authored
-
TimWang authored
-
Cyrus Leung authored
-
Peter Salas authored
-
Dipika Sikka authored
Co-authored-by:Michael Goin <michael@neuralmagic.com>
-
Woosuk Kwon authored
-
Nick Hill authored
-
- 03 Sep, 2024 11 commits
-
-
Dipika Sikka authored
-
Simon Mo authored
-
Woosuk Kwon authored
-
Kevin H. Luu authored
Signed-off-by:kevin <kevin@anyscale.com>
-
tomeras91 authored
-
Antoni Baum authored
-
Alexander Matveev authored
-
Kevin H. Luu authored
Signed-off-by:kevin <kevin@anyscale.com>
-
Cody Yu authored
Co-authored-by:
Tao He <sighingnow@gmail.com> Co-authored-by:
Juelianqvq <Juelianqvq@noreply.github.com>
-
Isotr0py authored
-
Woosuk Kwon authored
-
- 02 Sep, 2024 5 commits
-
-
wang.yuqi authored
[Bugfix] Fix #7592 vllm 0.5.4 enable_chunked_prefill throughput is slightly lower than 0.5.3~0.5.0. (#7874)
-
Isotr0py authored
-
Isotr0py authored
-
Woosuk Kwon authored
-
Lily Liu authored
-