"vscode:/vscode.git/clone" did not exist on "698be79e9c32331ce2dfbdc59ba6e07cf9f38bc0"
- 28 Feb, 2024 1 commit
-
-
Ganesh Jagadeesan authored
-
- 27 Feb, 2024 2 commits
-
-
Woosuk Kwon authored
-
张大成 authored
Co-authored-by:
zhangdacheng <zhangdacheng@ainirobot.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 21 Feb, 2024 1 commit
-
-
Zhuohan Li authored
-
- 19 Feb, 2024 1 commit
-
-
Isotr0py authored
-
- 17 Feb, 2024 1 commit
-
-
jvmncs authored
how to serve the loras (mimicking the [multilora inference example](https://github.com/vllm-project/vllm/blob/main/examples/multilora_inference.py)): ```terminal $ export LORA_PATH=~/.cache/huggingface/hub/models--yard1--llama-2-7b-sql-lora-test/ $ python -m vllm.entrypoints.api_server \ --model meta-llama/Llama-2-7b-hf \ --enable-lora \ --lora-modules sql-lora=$LORA_PATH sql-lora2=$LORA_PATH ``` the above server will list 3 separate values if the user queries `/models`: one for the base served model, and one each for the specified lora modules. in this case sql-lora and sql-lora2 point to the same underlying lora, but this need not be the case. lora config values take the same values they do in EngineArgs no work has been done here to scope client permissions to specific models
-
- 13 Feb, 2024 1 commit
-
-
Philipp Moritz authored
Co-authored-by:Roy <jasonailu87@gmail.com>
-
- 12 Feb, 2024 1 commit
-
-
Philipp Moritz authored
-
- 01 Feb, 2024 1 commit
-
-
Fengzhe Zhou authored
-
- 25 Jan, 2024 1 commit
-
-
Junyang Lin authored
-
- 24 Jan, 2024 1 commit
-
-
LastWhisper authored
-
- 22 Jan, 2024 1 commit
-
-
Junyang Lin authored
-
- 17 Jan, 2024 1 commit
-
-
Hyunsung Lee authored
-
- 03 Jan, 2024 1 commit
-
-
Zhuohan Li authored
-
- 20 Dec, 2023 1 commit
-
-
Ronen Schaffer authored
-
- 19 Dec, 2023 1 commit
-
-
avideci authored
-
- 17 Dec, 2023 2 commits
-
-
Suhong Moon authored
-
Woosuk Kwon authored
-
- 14 Dec, 2023 1 commit
-
-
Antoni Baum authored
-
- 13 Dec, 2023 1 commit
-
-
Woosuk Kwon authored
-
- 11 Dec, 2023 1 commit
-
-
Woosuk Kwon authored
-
- 06 Dec, 2023 1 commit
-
-
Peter Götz authored
adpated -> adapted
-
- 01 Dec, 2023 1 commit
-
-
Woosuk Kwon authored
-
- 30 Nov, 2023 1 commit
-
-
Simon Mo authored
-
- 22 Nov, 2023 1 commit
-
-
Casper authored
-
- 18 Nov, 2023 1 commit
-
-
liuyhwangyh authored
-
- 17 Nov, 2023 1 commit
-
-
Zhuohan Li authored
-
- 16 Nov, 2023 1 commit
-
-
Zhuohan Li authored
-
- 29 Sep, 2023 1 commit
-
-
Usama Ahmed authored
-
- 28 Sep, 2023 1 commit
-
-
Woosuk Kwon authored
-
- 05 Sep, 2023 1 commit
-
-
Zhuohan Li authored
-
- 31 Aug, 2023 1 commit
-
-
Woosuk Kwon authored
* Minor fix in supported models * Add another small fix for Aquila model --------- Co-authored-by:Zhuohan Li <zhuohan123@gmail.com>
-
- 22 Aug, 2023 1 commit
-
-
Zhuohan Li authored
-
- 14 Aug, 2023 1 commit
-
-
Uranus authored
-
- 02 Aug, 2023 2 commits
-
-
Zhuohan Li authored
-
Zhuohan Li authored
-
- 25 Jul, 2023 1 commit
-
-
Zhuohan Li authored
-
- 20 Jul, 2023 1 commit
-
-
Zhuohan Li authored
-
- 09 Jul, 2023 1 commit
-
-
Andre Slavescu authored
Co-authored-by:woWoosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 03 Jul, 2023 1 commit
-
-
Woosuk Kwon authored
-