- 19 Jul, 2023 1 commit
-
-
OlivierDehaene authored
-
- 13 Jul, 2023 2 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
-
- 10 Jul, 2023 1 commit
-
-
OlivierDehaene authored
Close #571
-
- 05 Jul, 2023 1 commit
-
-
OlivierDehaene authored
# What does this PR do? In title. Adds argument `--hostname` in router to support something like `--hostname ::`. Tested with ```commandline cargo run -- --port 8080 --hostname :: curl -I -X GET 'http://[::1]:8080/health ' # failed before this commit ``` Trigger CI --------- Co-authored-by:
Phil Chen <philchen2000@gmail.com>
-
- 01 Jul, 2023 1 commit
-
-
OlivierDehaene authored
-
- 30 Jun, 2023 2 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
Closes #478
-
- 28 Jun, 2023 1 commit
-
-
Robert Kimball authored
# This PR adds an http header option to disable buffering for the generate_stream endpoint response stream. Problem: If a model is run behind a proxy server such as nginx that has buffering enabled then the response stream from generate_stream gets aggregated into a single response which basically disables streaming. Instead of getting a chunked response where each token is presented over time the response presents everything all at once. Solution: This change adds the `X-Accel-Buffering` http header which disables buffering for the generate_stream response, allowing the response to stream properly.
-
- 23 Jun, 2023 1 commit
-
-
OlivierDehaene authored
-
- 16 Jun, 2023 1 commit
-
-
OlivierDehaene authored
-
- 05 Jun, 2023 1 commit
-
-
OlivierDehaene authored
-
- 02 Jun, 2023 1 commit
-
-
OlivierDehaene authored
Close #288
-
- 24 May, 2023 1 commit
-
-
OlivierDehaene authored
Closes #307 #308
-
- 23 May, 2023 1 commit
-
-
OlivierDehaene authored
@njhill FYI
-
- 16 May, 2023 1 commit
-
-
OlivierDehaene authored
Fixes #333 --------- Co-authored-by:Nicolas Patry <patry.nicolas@protonmail.com>
-
- 09 May, 2023 2 commits
-
-
OlivierDehaene authored
-
Sai Vinay G authored
-
- 04 May, 2023 1 commit
-
-
Nicolas Patry authored
-
- 02 May, 2023 1 commit
-
-
Nicolas Patry authored
-
- 27 Apr, 2023 1 commit
-
-
Ehsan M. Kermani authored
-
- 26 Apr, 2023 2 commits
-
-
Nicolas Patry authored
Co-authored-by:
OlivierDehaene <23298448+OlivierDehaene@users.noreply.github.com> Co-authored-by:
OlivierDehaene <olivier@huggingface.co>
-
Nicolas Patry authored
-
- 25 Apr, 2023 2 commits
-
-
Nicolas Patry authored
-
OlivierDehaene authored
-
- 24 Apr, 2023 1 commit
-
-
OlivierDehaene authored
Co-authored-by:Nick Hill <nickhill@us.ibm.com>
-
- 21 Apr, 2023 1 commit
-
-
OlivierDehaene authored
-
- 20 Apr, 2023 1 commit
-
-
OlivierDehaene authored
-
- 19 Apr, 2023 1 commit
-
-
OlivierDehaene authored
-
- 18 Apr, 2023 1 commit
-
-
OlivierDehaene authored
close #125
-
- 17 Apr, 2023 1 commit
-
-
OlivierDehaene authored
closes #189
-
- 09 Apr, 2023 2 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
-
- 30 Mar, 2023 2 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
-
- 29 Mar, 2023 1 commit
-
-
OlivierDehaene authored
The only difference is that now it pushes to registry.internal.huggingface.tech/api-inference/community/text-generation-inference/sagemaker:... instead of registry.internal.huggingface.tech/api-inference/community/text-generation-inference:sagemaker-... --------- Co-authored-by:Philipp Schmid <32632186+philschmid@users.noreply.github.com>
-
- 28 Mar, 2023 1 commit
-
-
OlivierDehaene authored
-
- 16 Mar, 2023 1 commit
-
-
OlivierDehaene authored
-
- 13 Mar, 2023 1 commit
-
-
OlivierDehaene authored
-
- 09 Mar, 2023 1 commit
-
-
OlivierDehaene authored
-