"vscode:/vscode.git/clone" did not exist on "ab26e7d7dba9ec5042b334cfc4c73bb24e664a21"
  1. 13 Jul, 2023 10 commits
  2. 12 Jul, 2023 9 commits
  3. 11 Jul, 2023 15 commits
  4. 10 Jul, 2023 4 commits
  5. 07 Jul, 2023 2 commits
    • Dan Saattrup Nielsen's avatar
      Enable `conversational` pipeline for `GPTSw3Tokenizer` (#24648) · abaca9f9
      Dan Saattrup Nielsen authored
      * feat: Add `_build_conversation_input_ids` to GPT-SW3 tokenizer, adjust line length
      
      * feat: Merge in PR https://github.com/huggingface/transformers/pull/24504
      
      .
      
      This allows the GPT-SW3 models (and other GPT-2 based models) to be 4-bit quantised
      using `load_in_4bit` with `bitsandbytes`.
      
      * fix: F-string
      
      * fix: F-string
      
      * fix: Remove EOS token from all responses
      
      * fix: Remove redundant newlines
      
      * feat: Add `load_in_4bit` to `Pipeline`
      
      * fix: Separate turns with `\n<s>\n` rather than `<s>`
      
      * fix: Add missing newline in prompt
      
      * tests: Add unit tests for the new `_build_conversation_input_ids` method
      
      * style: Automatic style correction
      
      * tests: Compare encodings rather than decodings
      
      * fix: Remove `load_in_4bit` from pipeline arguments
      
      * docs: Add description and references of the GPT-SW3 chat format
      
      * style: Line breaks
      
      * Apply suggestions from code review
      
      Fix Conversation type hints
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * fix: Import TYPE_CHECKING
      
      * style: Run automatic fixes
      
      * tests: Remove `_build_conversation_input_ids` unit tests
      
      * tests: Remove import of `Conversation` in GPT-SW3 unit test
      
      * style: Revert formatting
      
      * style: Move TYPE_CHECKING line after all imports
      
      * style: Imports order
      
      * fix: Change prompt to ensure that `sp_model.encode` and `encode` yields same result
      
      * docs: Add TODO comment related to the addition of whitespace during decoding
      
      * style: Automatic style checks
      
      * fix: Remove final whitespace in prompt, as prefix whitespace is used by sentencepiece
      
      ---------
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      abaca9f9
    • Joao Gante's avatar
      Whisper: fix prompted max length (#24666) · f614b6e3
      Joao Gante authored
      f614b6e3