fix(server): Use cleanup_tokenization_spaces=False for lossless decoding (#13)

Fixes #12 in the easiest way I could think of.

fix(server): Use cleanup_tokenization_spaces=False for lossless decoding (#13)
Fixes #12 in the easiest way I could think of.
b94f3021 · Nicolas Patry · GitHub · 60472f9d · b94f3021
Unverified Commit b94f3021 authored Jan 03, 2023 by Nicolas Patry Committed by GitHub Jan 03, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 1 deletion

server/text_generation/models/causal_lm.py server/text_generation/models/causal_lm.py +2 -1

No files found.
--- a/server/text_generation/models/causal_lm.py
+++ b/server/text_generation/models/causal_lm.py
@@ -354,7 +354,8 @@ class CausalLM(Model):
            if stop:
                # Decode all tokens
                output_text = self.tokenizer.decode(
-                    all_input_ids.squeeze(-1), skip_special_tokens=True
+                    all_input_ids.squeeze(-1), skip_special_tokens=True,
+                    cleanup_tokenization_spaces=False
                )
                # Slice with input_length to remove padding
                token_ids = all_input_ids[-new_input_length:]