- 05 Aug, 2024 1 commit
-
-
Cyrus Leung authored
-
- 03 Aug, 2024 1 commit
-
-
Robert Shaw authored
Signed-off-by:
Joe Runde <Joseph.Runde@ibm.com> Co-authored-by:
Joe Runde <Joseph.Runde@ibm.com> Co-authored-by:
Joe Runde <joe@joerun.de> Co-authored-by:
Nick Hill <nickhill@us.ibm.com> Co-authored-by:
Simon Mo <simon.mo@hey.com>
-
- 31 Jul, 2024 1 commit
-
-
Simon Mo authored
Co-authored-by:Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>
-
- 27 Jul, 2024 1 commit
-
-
Cyrus Leung authored
-
- 25 Jul, 2024 1 commit
-
-
Evan Z. Liu authored
-
- 24 Jul, 2024 1 commit
-
-
Daniele authored
-
- 22 Jul, 2024 1 commit
-
-
Cyrus Leung authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
- 19 Jul, 2024 2 commits
-
-
Daniele authored
-
Cyrus Leung authored
-
- 18 Jul, 2024 1 commit
-
-
Nick Hill authored
Co-authored-by:Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 16 Jul, 2024 1 commit
-
-
sasha0552 authored
-
- 14 Jul, 2024 1 commit
-
-
Ethan Xu authored
Co-authored-by:simon-mo <simon.mo@hey.com>
-
- 09 Jul, 2024 1 commit
-
-
Swapnil Parekh authored
Co-authored-by:
Swapnil Parekh <swapnilp@ibm.com> Co-authored-by:
Joe G <joseph.granados@h2o.ai> Co-authored-by:
Antoni Baum <antoni.baum@protonmail.com>
-
- 07 Jul, 2024 1 commit
-
-
youkaichao authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
- 02 Jul, 2024 1 commit
-
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 26 Jun, 2024 1 commit
-
-
sasha0552 authored
-
- 13 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 10 Jun, 2024 1 commit
-
-
Roger Wang authored
-
- 01 Jun, 2024 1 commit
-
-
Nadav Shmayovits authored
-
- 16 May, 2024 1 commit
-
-
Pierre Dulac authored
-
- 11 May, 2024 1 commit
-
-
Chang Su authored
-
- 09 May, 2024 1 commit
-
-
Cyrus Leung authored
-
- 06 May, 2024 1 commit
-
-
Cyrus Leung authored
-
- 03 May, 2024 1 commit
-
-
Yang, Bo authored
-
- 02 May, 2024 1 commit
-
-
youkaichao authored
-
- 01 May, 2024 1 commit
-
-
Robert Shaw authored
-
- 26 Apr, 2024 1 commit
-
-
SangBin Cho authored
Co-authored-by:Danny Guinther <dguinther@neuralmagic.com>
-
- 23 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 18 Apr, 2024 1 commit
-
-
Harry Mellor authored
Co-authored-by:Alexandre Payot <alexandrep@graphcore.ai>
-
- 02 Apr, 2024 1 commit
-
-
A-Mahla authored
Co-authored-by: A-Mahla <>
-
- 29 Mar, 2024 1 commit
-
-
yhu422 authored
-
- 25 Mar, 2024 1 commit
-
-
SangBin Cho authored
-
- 19 Mar, 2024 1 commit
-
-
Simon Mo authored
-
- 15 Mar, 2024 1 commit
-
-
Dan Clark authored
Co-authored-by:declark1 <daniel.clark@ibm.com>
-
- 11 Mar, 2024 1 commit
-
-
Zhuohan Li authored
-
- 08 Mar, 2024 1 commit
-
-
Nick Hill authored
-
- 03 Mar, 2024 1 commit
-
-
Jason Cox authored
-
- 01 Mar, 2024 1 commit
-
-
Allen.Dou authored
Co-authored-by:
zixiao <shunli.dsl@alibaba-inc.com> Co-authored-by:
Simon Mo <simon.mo@hey.com>
-
- 25 Feb, 2024 1 commit
-
-
Harry Mellor authored
-
- 17 Feb, 2024 1 commit
-
-
jvmncs authored
how to serve the loras (mimicking the [multilora inference example](https://github.com/vllm-project/vllm/blob/main/examples/multilora_inference.py)): ```terminal $ export LORA_PATH=~/.cache/huggingface/hub/models--yard1--llama-2-7b-sql-lora-test/ $ python -m vllm.entrypoints.api_server \ --model meta-llama/Llama-2-7b-hf \ --enable-lora \ --lora-modules sql-lora=$LORA_PATH sql-lora2=$LORA_PATH ``` the above server will list 3 separate values if the user queries `/models`: one for the base served model, and one each for the specified lora modules. in this case sql-lora and sql-lora2 point to the same underlying lora, but this need not be the case. lora config values take the same values they do in EngineArgs no work has been done here to scope client permissions to specific models
-