- 25 Mar, 2024 6 commits
-
-
Dylan Hawk authored
Co-authored-by:Dylan Hawk <dylanwawk@gmail.com>
-
Travis Johnson authored
Signed-off-by:
Travis Johnson <tsjohnso@us.ibm.com> Co-authored-by:
Nick Hill <nickhill@us.ibm.com>
-
Swapnil Parekh authored
Co-authored-by:Swapnil Parekh <swapnilp@ibm.com>
-
SangBin Cho authored
-
Woosuk Kwon authored
-
youkaichao authored
-
- 24 Mar, 2024 2 commits
-
-
youkaichao authored
-
Nick Hill authored
-
- 22 Mar, 2024 4 commits
-
-
Antoni Baum authored
Co-authored-by:MeloYang <meloyang05@gmail.com>
-
Thomas Parnell authored
Co-authored-by:Jan van Lunteren <jvl@zurich.ibm.com>
-
Zhuohan Li authored
-
Roy authored
-
- 20 Mar, 2024 5 commits
-
-
Roy authored
-
SangBin Cho authored
-
Antoni Baum authored
Co-authored-by:Roger Wang <136131678+ywang96@users.noreply.github.com>
-
Woosuk Kwon authored
-
ElizaWszola authored
[PREFIX CACHING FOLLOW UP] A bunch of fixes to block allocator performance when automatic prefix caching is disabled (#3357) Co-authored-by:Zhuohan Li <zhuohan123@gmail.com>
-
- 18 Mar, 2024 1 commit
-
-
Robert Shaw authored
-
- 16 Mar, 2024 3 commits
- 15 Mar, 2024 1 commit
-
-
Antoni Baum authored
-
- 14 Mar, 2024 1 commit
-
-
陈序 authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
- 13 Mar, 2024 4 commits
-
-
Terry authored
-
Or Sharir authored
Add missing kernel for CodeLlama-34B on A/H100 (no tensor parallelism) when using Multi-LoRA. (#3350)
-
Woosuk Kwon authored
-
Breno Faria authored
-
- 11 Mar, 2024 3 commits
-
-
Zhuohan Li authored
-
Zhuohan Li authored
-
Roy authored
-
- 10 Mar, 2024 1 commit
-
-
Terry authored
-
- 09 Mar, 2024 1 commit
-
-
Cade Daniel authored
-
- 08 Mar, 2024 1 commit
-
-
ElizaWszola authored
-
- 07 Mar, 2024 2 commits
-
-
jacobthebanana authored
Possible fix for conflict between Automated Prefix Caching (#2762) and multi-LoRA support (#1804) (#3263)
-
Woosuk Kwon authored
-
- 06 Mar, 2024 2 commits
-
-
Cade Daniel authored
-
SangBin Cho authored
-
- 05 Mar, 2024 1 commit
-
-
Nick Hill authored
-
- 04 Mar, 2024 2 commits
-
-
Antoni Baum authored
Co-authored-by:Zhuohan Li <zhuohan123@gmail.com>
-
Antoni Baum authored
Co-authored-by:Avnish Narayan <avnish@anyscale.com>
-