- 26 Jan, 2024 2 commits
-
-
fxmarty authored
Tested with ``` CUDA_VISIBLE_DEVICES=0 text-generation-launcher --model-id TheBloke/Llama-2-7B-Chat-GPTQ --quantize gptq EXLLAMA_VERSION=1 CUDA_VISIBLE_DEVICES=0 text-generation-launcher --model-id TheBloke/Llama-2-7B-Chat-GPTQ --quantize gptq CUDA_VISIBLE_DEVICES="0,1" text-generation-launcher --model-id TheBloke/Llama-2-7B-Chat-GPTQ --quantize gptq ``` all with good and identical results on MI210. --------- Co-authored-by:
Felix Marty <felix@hf.co> Co-authored-by:
OlivierDehaene <olivier@huggingface.co> Co-authored-by:
OlivierDehaene <23298448+OlivierDehaene@users.noreply.github.com>
-
Nicolas Patry authored
-
- 09 Jan, 2024 1 commit
-
-
OlivierDehaene authored
-
- 21 Dec, 2023 1 commit
-
-
regisss authored
-
- 04 Dec, 2023 1 commit
-
-
fxmarty authored
As per title
-
- 27 Nov, 2023 1 commit
-
-
fxmarty authored
This PR adds support for AMD Instinct MI210 & MI250 GPUs, with paged attention and FAv2 support. Remaining items to discuss, on top of possible others: * Should we have a `ghcr.io/huggingface/text-generation-inference:1.1.0+rocm` hosted image, or is it too early? * Should we set up a CI on MI210/MI250? I don't have access to the runners of TGI though. * Are we comfortable with those changes being directly in TGI, or do we need a fork? --------- Co-authored-by:
Felix Marty <felix@hf.co> Co-authored-by:
OlivierDehaene <olivier@huggingface.co> Co-authored-by:
Your Name <you@example.com>
-
- 09 Oct, 2023 1 commit
-
-
Omar Sanseviero authored
-
- 28 Sep, 2023 1 commit
-
-
OlivierDehaene authored
-
- 27 Sep, 2023 1 commit
-
-
Merve Noyan authored
Added note on serving supported models from a different folder without re-downloading them. --------- Co-authored-by:Nicolas Patry <patry.nicolas@protonmail.com>
-
- 10 Aug, 2023 1 commit
-
-
Merve Noyan authored
I added ToC for docs v1 & started setting up for doc-builder. cc @Narsil @osanseviero --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by:
osanseviero <osanseviero@gmail.com> Co-authored-by:
Mishig <mishig.davaadorj@coloradocollege.edu>
-