- 16 Mar, 2024 1 commit
-
-
Robert Shaw authored
-
- 15 Mar, 2024 2 commits
-
-
Tao He authored
Signed-off-by:
Tao He <sighingnow@gmail.com> Co-authored-by:
simon-mo <simon.mo@hey.com>
-
Dan Clark authored
Co-authored-by:declark1 <daniel.clark@ibm.com>
-
- 11 Mar, 2024 2 commits
-
-
Zhuohan Li authored
-
Roy authored
-
- 08 Mar, 2024 1 commit
-
-
Nick Hill authored
-
- 04 Mar, 2024 1 commit
-
-
Antoni Baum authored
Co-authored-by:Avnish Narayan <avnish@anyscale.com>
-
- 03 Mar, 2024 1 commit
-
-
Jason Cox authored
-
- 01 Mar, 2024 3 commits
-
-
Allen.Dou authored
Co-authored-by:
zixiao <shunli.dsl@alibaba-inc.com> Co-authored-by:
Simon Mo <simon.mo@hey.com>
-
Huarong authored
Co-authored-by:huohuarong <huohuarong@zuoshouyisheng.com>
-
Seonghyeon authored
-
- 29 Feb, 2024 2 commits
-
-
felixzhu555 authored
Co-authored-by:
br3no <breno@veltefaria.de> Co-authored-by:
simon-mo <simon.mo@hey.com>
-
Jae-Won Chung authored
-
- 27 Feb, 2024 1 commit
-
-
Dylan Hawk authored
-
- 26 Feb, 2024 1 commit
-
-
Jared Moore authored
-
- 25 Feb, 2024 1 commit
-
-
Harry Mellor authored
-
- 22 Feb, 2024 1 commit
-
-
Mustafa Eyceoz authored
-
- 21 Feb, 2024 1 commit
-
-
Nick Hill authored
-
- 17 Feb, 2024 1 commit
-
-
jvmncs authored
how to serve the loras (mimicking the [multilora inference example](https://github.com/vllm-project/vllm/blob/main/examples/multilora_inference.py)): ```terminal $ export LORA_PATH=~/.cache/huggingface/hub/models--yard1--llama-2-7b-sql-lora-test/ $ python -m vllm.entrypoints.api_server \ --model meta-llama/Llama-2-7b-hf \ --enable-lora \ --lora-modules sql-lora=$LORA_PATH sql-lora2=$LORA_PATH ``` the above server will list 3 separate values if the user queries `/models`: one for the base served model, and one each for the specified lora modules. in this case sql-lora and sql-lora2 point to the same underlying lora, but this need not be the case. lora config values take the same values they do in EngineArgs no work has been done here to scope client permissions to specific models
-
- 01 Feb, 2024 1 commit
-
-
Simon Mo authored
-
- 25 Jan, 2024 1 commit
-
-
Simon Mo authored
-
- 24 Jan, 2024 1 commit
-
-
Federico Galatolo authored
-
- 23 Jan, 2024 1 commit
-
-
Erfan Al-Hossami authored
-
- 22 Jan, 2024 1 commit
-
-
Jannis Schönleber authored
-
- 19 Jan, 2024 1 commit
-
-
Simon Mo authored
-
- 17 Jan, 2024 1 commit
-
-
FlorianJoncour authored
-
- 12 Jan, 2024 1 commit
-
-
Chirag Jain authored
-
- 05 Jan, 2024 1 commit
-
-
Iskren Ivov Chernev authored
-
- 18 Dec, 2023 1 commit
-
-
Harry Mellor authored
-
- 10 Dec, 2023 2 commits
- 03 Dec, 2023 2 commits
- 01 Dec, 2023 1 commit
-
-
Adam Brusselback authored
-
- 30 Nov, 2023 1 commit
-
-
Michael McCulloch authored
Co-authored-by:Michael McCulloch <mjm.gitlab@fastmail.com>
-
- 27 Nov, 2023 1 commit
-
-
Yunmo Chen authored
-
- 20 Nov, 2023 1 commit
-
-
Simon Mo authored
-
- 18 Nov, 2023 1 commit
-
-
liuyhwangyh authored
-
- 16 Nov, 2023 1 commit
-
-
Iskren Ivov Chernev authored
-
- 01 Nov, 2023 1 commit
-
-
Fluder-Paradyne authored
-