- 09 May, 2024 1 commit
-
-
Cyrus Leung authored
-
- 03 May, 2024 1 commit
-
-
Sebastian Schoennenbeck authored
-
- 01 May, 2024 2 commits
-
-
sasha0552 authored
-
Robert Caulk authored
-
- 30 Apr, 2024 1 commit
-
-
Florian Greinacher authored
Co-authored-by:
Lily Liu <lilyliupku@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 27 Apr, 2024 1 commit
-
-
Cyrus Leung authored
-
- 20 Apr, 2024 2 commits
-
-
nunjunj authored
-
Ayush Rautwar authored
Co-authored-by:Ubuntu <ubuntu@ip-172-31-13-147.ec2.internal>
-
- 18 Apr, 2024 1 commit
-
-
James Whedbee authored
-
- 16 Apr, 2024 1 commit
-
-
Noam Gat authored
Co-authored-by:Simon Mo <simon.mo@hey.com>
-
- 11 Apr, 2024 2 commits
-
-
Dylan Hawk authored
Co-authored-by:Dylan Hawk <dylanwawk@gmail.com>
-
SangBin Cho authored
-
- 07 Apr, 2024 1 commit
-
-
youkaichao authored
-
- 29 Mar, 2024 1 commit
-
-
Roy authored
-
- 25 Mar, 2024 2 commits
-
-
Dylan Hawk authored
Co-authored-by:Dylan Hawk <dylanwawk@gmail.com>
-
SangBin Cho authored
-
- 16 Mar, 2024 1 commit
-
-
Simon Mo authored
-
- 11 Mar, 2024 1 commit
-
-
Zhuohan Li authored
-
- 04 Mar, 2024 1 commit
-
-
Antoni Baum authored
Co-authored-by:Avnish Narayan <avnish@anyscale.com>
-
- 29 Feb, 2024 1 commit
-
-
felixzhu555 authored
Co-authored-by:
br3no <breno@veltefaria.de> Co-authored-by:
simon-mo <simon.mo@hey.com>
-
- 27 Feb, 2024 1 commit
-
-
Dylan Hawk authored
-
- 26 Feb, 2024 1 commit
-
-
Jared Moore authored
-
- 17 Feb, 2024 1 commit
-
-
jvmncs authored
how to serve the loras (mimicking the [multilora inference example](https://github.com/vllm-project/vllm/blob/main/examples/multilora_inference.py)): ```terminal $ export LORA_PATH=~/.cache/huggingface/hub/models--yard1--llama-2-7b-sql-lora-test/ $ python -m vllm.entrypoints.api_server \ --model meta-llama/Llama-2-7b-hf \ --enable-lora \ --lora-modules sql-lora=$LORA_PATH sql-lora2=$LORA_PATH ``` the above server will list 3 separate values if the user queries `/models`: one for the base served model, and one each for the specified lora modules. in this case sql-lora and sql-lora2 point to the same underlying lora, but this need not be the case. lora config values take the same values they do in EngineArgs no work has been done here to scope client permissions to specific models
-
- 25 Jan, 2024 1 commit
-
-
Simon Mo authored
-
- 19 Jan, 2024 1 commit
-
-
Simon Mo authored
-
- 17 Jan, 2024 1 commit
-
-
FlorianJoncour authored
-