- 04 Dec, 2023 1 commit
-
-
fxmarty authored
As per title
-
- 27 Nov, 2023 1 commit
-
-
fxmarty authored
This PR adds support for AMD Instinct MI210 & MI250 GPUs, with paged attention and FAv2 support. Remaining items to discuss, on top of possible others: * Should we have a `ghcr.io/huggingface/text-generation-inference:1.1.0+rocm` hosted image, or is it too early? * Should we set up a CI on MI210/MI250? I don't have access to the runners of TGI though. * Are we comfortable with those changes being directly in TGI, or do we need a fork? --------- Co-authored-by:
Felix Marty <felix@hf.co> Co-authored-by:
OlivierDehaene <olivier@huggingface.co> Co-authored-by:
Your Name <you@example.com>
-
- 09 Oct, 2023 1 commit
-
-
Omar Sanseviero authored
-
- 28 Sep, 2023 1 commit
-
-
OlivierDehaene authored
-
- 27 Sep, 2023 1 commit
-
-
Merve Noyan authored
Added note on serving supported models from a different folder without re-downloading them. --------- Co-authored-by:Nicolas Patry <patry.nicolas@protonmail.com>
-
- 10 Aug, 2023 1 commit
-
-
Merve Noyan authored
I added ToC for docs v1 & started setting up for doc-builder. cc @Narsil @osanseviero --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by:
osanseviero <osanseviero@gmail.com> Co-authored-by:
Mishig <mishig.davaadorj@coloradocollege.edu>
-