- 11 Apr, 2024 3 commits
-
-
SangBin Cho authored
Co-authored-by:Simon Mo <simon.mo@hey.com>
-
youkaichao authored
[Core][Model] Use torch.compile to accelerate layernorm in commandr (#3985)
-
SangBin Cho authored
-
- 10 Apr, 2024 12 commits
-
-
youkaichao authored
[WIP][Core][Refactor] move vllm/model_executor/parallel_utils into vllm/distributed and vllm/device_communicators (#3950)
-
Travis Johnson authored
Signed-off-by:
Travis Johnson <tsjohnso@us.ibm.com> Co-authored-by:
Simon Mo <simon.mo@hey.com>
-
Frαnçois authored
-
Daniel E Marasco authored
-
youkaichao authored
Co-authored-by:Roger Wang <136131678+ywang96@users.noreply.github.com>
-
James Whedbee authored
-
Woosuk Kwon authored
-
Travis Johnson authored
Signed-off-by:Travis Johnson <tsjohnso@us.ibm.com>
-
胡译文 authored
-
zhaotyer authored
Co-authored-by:tianyi_zhao <tianyi.zhao@transwarp.io>
-
Zedong Peng authored
-
Jee Li authored
-
- 09 Apr, 2024 4 commits
-
-
Juan Villamizar authored
Co-authored-by:
jpvillam <jpvillam@amd.com> Co-authored-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Junichi Sato authored
-
Cade Daniel authored
[Misc] [Core] Implement RFC "Augment BaseExecutor interfaces to enable hardware-agnostic speculative decoding" (#3837)
-
youkaichao authored
-
- 08 Apr, 2024 5 commits
-
-
Roy authored
-
Matt Wong authored
-
Kiran R authored
Co-authored-by:roy <jasonailu87@gmail.com>
-
egortolmachev authored
Co-authored-by:Egor Tolmachev <t333ga@gmail.com>
-
ywfang authored
-
- 07 Apr, 2024 3 commits
-
-
Isotr0py authored
-
youkaichao authored
-
youkaichao authored
-
- 06 Apr, 2024 1 commit
-
-
youkaichao authored
-
- 05 Apr, 2024 9 commits
-
-
Isotr0py authored
-
SangBin Cho authored
-
Thomas Parnell authored
-
Woosuk Kwon authored
-
Noam Gat authored
-
Cade Daniel authored
-
Cade Daniel authored
-
youkaichao authored
[CI/Build] fix pip cache with vllm_nccl & refactor dockerfile to build wheels (#3859)
-
Sean Gallen authored
Co-authored-by:
Simon Mo <simon.mo@hey.com> Co-authored-by:
Roger Wang <136131678+ywang96@users.noreply.github.com>
-
- 04 Apr, 2024 3 commits
-
-
youkaichao authored
-
Saurabh Dash authored
-
Michael Goin authored
-