- 07 Feb, 2023 1 commit
-
-
OlivierDehaene authored
-
- 03 Feb, 2023 1 commit
-
-
OlivierDehaene authored
-
- 02 Feb, 2023 2 commits
-
-
OlivierDehaene authored
@njhill, @yk FYI generated_text was concatenated to the user prompt for legacy reason. We want to remove this behaviour as we don't think it is useful and even detrimonial to usability. We also remove the unused Vec.
-
OlivierDehaene authored
-
- 01 Feb, 2023 3 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
-
OlivierDehaene authored
-
- 31 Jan, 2023 7 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
-
OlivierDehaene authored
-
OlivierDehaene authored
-
OlivierDehaene authored
-
OlivierDehaene authored
Reverts huggingface/text-generation-inference#36
-
OlivierDehaene authored
Add token streaming using ServerSideEvents (SSE). The signature of the SSE events is: ```rust struct Details { finish_reason: String, generated_tokens: u32, seed: Option<u64>, } struct StreamResponse { token: Token, generated_text: Option<String>, details: Option<Details>, } struct ErrorResponse { error: String, } ```
-
- 30 Jan, 2023 1 commit
-
-
OlivierDehaene authored
Co-authored-by:Yannic Kilcher <yk@users.noreply.github.com>
-
- 26 Jan, 2023 1 commit
-
-
OlivierDehaene authored
-
- 24 Jan, 2023 1 commit
-
-
OlivierDehaene authored
-
- 20 Jan, 2023 2 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
-
- 17 Jan, 2023 1 commit
-
-
Nick Hill authored
- Fix some type hints, in particular base tokenizer class - Make use of `tensor.new_zero/empty` methods - Simplify env var string parsing in launcher
-
- 05 Jan, 2023 1 commit
-
-
OlivierDehaene authored
Co-authored-by:Nick Hill <nickhill@us.ibm.com>
-
- 03 Jan, 2023 1 commit
-
-
Nicolas Patry authored
Fixes #12 in the easiest way I could think of.
-
- 30 Dec, 2022 1 commit
-
-
Nick Hill authored
AFAIK there is no torch device type called "gpu".
-
- 16 Dec, 2022 1 commit
-
-
OlivierDehaene authored
-
- 15 Dec, 2022 1 commit
-
-
OlivierDehaene authored
-
- 12 Dec, 2022 1 commit
-
-
OlivierDehaene authored
-
- 08 Dec, 2022 2 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
-
- 05 Dec, 2022 1 commit
-
-
Nick Hill authored
- Avoid theoretical hang in batcher loop - Avoid a couple of clones in the router generate method - Keep attention mask tensors as integers - Remove num_heads attribute Co-authored-by:OlivierDehaene <Olivier.dehaene@gmail.com>
-
- 01 Dec, 2022 1 commit
-
-
OlivierDehaene authored
-
- 09 Nov, 2022 1 commit
-
-
OlivierDehaene authored
-
- 08 Nov, 2022 1 commit
-
-
OlivierDehaene authored
-
- 07 Nov, 2022 1 commit
-
-
OlivierDehaene authored
-
- 04 Nov, 2022 2 commits
-
-
OlivierDehaene authored
-
OlivierDehaene authored
-
- 03 Nov, 2022 1 commit
-
-
OlivierDehaene authored
-
- 02 Nov, 2022 1 commit
-
-
OlivierDehaene authored
-
- 28 Oct, 2022 1 commit
-
-
OlivierDehaene authored
-
- 27 Oct, 2022 1 commit
-
-
OlivierDehaene authored
-
- 22 Oct, 2022 1 commit
-
-
Nicolas Patry authored
Co-authored-by:OlivierDehaene <23298448+OlivierDehaene@users.noreply.github.com>
-