- 16 Mar, 2024 3 commits
- 15 Mar, 2024 1 commit
-
-
Antoni Baum authored
-
- 14 Mar, 2024 1 commit
-
-
陈序 authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
- 13 Mar, 2024 4 commits
-
-
Terry authored
-
Or Sharir authored
Add missing kernel for CodeLlama-34B on A/H100 (no tensor parallelism) when using Multi-LoRA. (#3350)
-
Woosuk Kwon authored
-
Breno Faria authored
-
- 11 Mar, 2024 3 commits
-
-
Zhuohan Li authored
-
Zhuohan Li authored
-
Roy authored
-
- 10 Mar, 2024 1 commit
-
-
Terry authored
-
- 09 Mar, 2024 1 commit
-
-
Cade Daniel authored
-
- 08 Mar, 2024 1 commit
-
-
ElizaWszola authored
-
- 07 Mar, 2024 2 commits
-
-
jacobthebanana authored
Possible fix for conflict between Automated Prefix Caching (#2762) and multi-LoRA support (#1804) (#3263)
-
Woosuk Kwon authored
-
- 06 Mar, 2024 2 commits
-
-
Cade Daniel authored
-
SangBin Cho authored
-
- 05 Mar, 2024 1 commit
-
-
Nick Hill authored
-
- 04 Mar, 2024 2 commits
-
-
Antoni Baum authored
Co-authored-by:Zhuohan Li <zhuohan123@gmail.com>
-
Antoni Baum authored
Co-authored-by:Avnish Narayan <avnish@anyscale.com>
-
- 02 Mar, 2024 1 commit
-
-
Sage Moore authored
Co-authored-by:
ElizaWszola <eliza@neuralmagic.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 01 Mar, 2024 1 commit
-
-
Robert Shaw authored
Co-authored-by:
Robert Shaw <114415538+rib-2@users.noreply.github.com> Co-authored-by:
alexm <alexm@neuralmagic.com>
-
- 29 Feb, 2024 2 commits
-
-
felixzhu555 authored
Co-authored-by:
br3no <breno@veltefaria.de> Co-authored-by:
simon-mo <simon.mo@hey.com>
-
Seonghyeon authored
-
- 28 Feb, 2024 2 commits
-
-
Woosuk Kwon authored
-
Liangfu Chen authored
-
- 27 Feb, 2024 2 commits
-
-
Tao He authored
Signed-off-by:Tao He <sighingnow@gmail.com>
-
Dylan Hawk authored
-
- 26 Feb, 2024 1 commit
-
-
Jared Moore authored
-
- 25 Feb, 2024 1 commit
-
-
Harry Mellor authored
-
- 22 Feb, 2024 3 commits
-
-
Ronen Schaffer authored
-
Woosuk Kwon authored
-
Massimiliano Pronesti authored
-
- 21 Feb, 2024 2 commits
-
-
Nick Hill authored
-
Antoni Baum authored
-
- 20 Feb, 2024 1 commit
-
-
Zhuohan Li authored
-
- 19 Feb, 2024 2 commits
-
-
Ronen Schaffer authored
-
Isotr0py authored
-