"vscode:/vscode.git/clone" did not exist on "956a483173e77ebf655ca9636a5f7b6ef010b307"
  1. 19 Apr, 2023 1 commit
  2. 18 Apr, 2023 3 commits
    • Sylvain Gugger's avatar
      Use code on the Hub from another repo (#22814) · 5f9b825c
      Sylvain Gugger authored
      * initial work
      
      * Add other classes
      
      * Refactor code
      
      * Move warning and fix dynamic pipeline
      
      * Issue warning when necessary
      
      * Add test
      
      * Do not skip auto tests
      
      * Fix failing tests
      
      * Refactor and address review comments
      
      * Address review comments
      5f9b825c
    • Joao Gante's avatar
      Generate: Add assisted generation (#22211) · 78cda46f
      Joao Gante authored
      * working mvp
      
      * remove breakpoint
      
      * fix commit
      
      * standardize outputs
      
      * tmp commit
      
      * tests almost ready
      
      * tmp commit
      
      * skip a few models
      
      * Add streaming; Docs and examples
      
      * document limitations
      
      * PR commits
      
      * Amy PR comments
      78cda46f
    • Matthijs Hollemans's avatar
      TTS fine-tuning for SpeechT5 (#21824) · ac2bc50a
      Matthijs Hollemans authored
      
      
      * wrong argument name
      
      * append eos_token_id
      
      * all tokenizers need mask and ctc_blank tokens
      
      * remove reduction factor from feature extractor
      
      * add proper TTS loss
      
      * did shifting the wrong way around
      
      * mask out padded portions
      
      * remove logits again (don't really need it)
      
      * fix unit tests
      
      * fixup
      
      * pad also returns the decoder attention mask, since that's useful to have
      
      * clean up feature extractor logic
      
      * pad can handle TTS task too
      
      * remove stop_labels from loss calculation
      
      * simplify logic
      
      * fixup
      
      * do -100 masking properly
      
      * small STFT optimization (calculate mel filterbanks only once)
      
      * replace torchaudio fbanks with audio_utils
      
      * remove torchaudio dependency
      
      * simplify & speed up the STFT
      
      * don't serialize window and mel filters
      
      * output cross attentions when generating speech
      
      * add guided attention loss
      
      * fix failing test
      
      * Update src/transformers/models/speecht5/feature_extraction_speecht5.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * Update src/transformers/models/speecht5/modeling_speecht5.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * change type annotation of attention_mask to LongTensor
      
      * extract loss into class
      
      * remove unused frame_signal_scale argument
      
      * use config object in loss class
      
      * fix type annotations in doc comments
      
      * change optional to just bool
      
      * implement missing tokenizer method
      
      * add deprecation warning
      
      * Update src/transformers/models/speecht5/feature_extraction_speecht5.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/speecht5/feature_extraction_speecht5.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * add deprecation warning for stop_labels
      
      ---------
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      ac2bc50a
  3. 17 Apr, 2023 4 commits
  4. 14 Apr, 2023 1 commit
  5. 13 Apr, 2023 3 commits
  6. 12 Apr, 2023 2 commits
    • Matt's avatar
      Fix docstrings for TF BLIP (#22618) · 50f82e12
      Matt authored
      * Fix docstrings for TFBLIP
      
      * Fix missing line in TF port!
      
      * Use values from torch tests now other bugs fixed
      
      * Use values from torch tests now other bugs fixed
      
      * Fix doctest string
      50f82e12
    • pioliverse's avatar
      add model resources for CPMAnt (new) (#20906) · 523ca4e0
      pioliverse authored
      
      
      * resolve conflicts
      
      * rebase and make style
      
      * test
      
      * test
      
      * test
      
      * rebase and make style
      
      * rebase and make style
      
      * tests
      
      * tests
      
      * rewrite some functions
      
      * rebase and make style
      
      * fix load_tf_weights_in_cpmant
      
      * reformat some unrelated files
      
      * upgrade quality
      
      * fix some bugs & docstring
      
      * add models and tests
      
      * solve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * tests
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix load_tf_weights_in_cpmant
      
      * reformat some unrelated files
      
      * upgrade quality
      
      * fix some bugs & docstring
      
      * save resolution
      
      * make style
      
      * delete redefinition code
      
      * reformat function
      
      * reformat
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * tests
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix load_tf_weights_in_cpmant
      
      * reformat some unrelated files
      
      * upgrade quality
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix load_tf_weights_in_cpmant
      
      * reformat some unrelated files
      
      * upgrade quality
      
      * resolve conflicts
      
      * make style
      
      * fix bugs and refactor
      
      * modify docstrings and make style
      
      * unify import format in __init__.py
      
      * fix import-altclp bug
      
      * fix copies to update index.md
      
      * fix unused config parameters
      
      * fix unused config parameters
      
      * fix unused config parameters
      
      * update README_ja.md
      
      * dummy commit for unit test
      
      * fix attention mask
      
      * add CPMAntTokenizer&-Fast to auto-mapping
      
      * drop redundant changes in README_ko
      
      * fix  defaults in docstring
      
      * fix use_cache and some docstring
      
      * add missing args in tokenizer
      
      * modify tester inheritance
      
      * add is_jieba_available
      
      * fix some bugs
      
      * make style and fix-copies
      
      * add doctests
      
      * skip integration tests
      
      * add is_jieba_available
      
      * fix bugs in common tests
      
      * adjust docstrings and make style
      
      * add argument docstring
      
      * adjust code to some specifications
      
      * make style and fix-copies
      
      * add fast tokenization test
      
      * dummy commit for unit test
      
      * dummy commit for unit test
      
      * dummy commit for unit test
      
      * normalize some comments and names
      
      * Bert->CPMAnt
      
      * camel names and drop redundant codes
      
      * make style and fix-coies
      
      * add CpmTokenizerFast _import_structure
      
      * drop cpmanttokenizerfast in model_doc
      
      * fix some problems
      
      * fix CPMAnt tokenization for common test
      
      * make style and fixup
      
      * fix copies and fixup
      
      * fix bugs in tokenization test
      
      * dummy commit for connection failure in unittest
      
      * fix copies
      
      * drop trailing comma
      
      * fix decorator in tests
      
      * dummy commit for connection failure in unittest
      
      ---------
      Co-authored-by: default avatarGong Baitao <gongbaitao11@gmail.com>
      523ca4e0
  7. 11 Apr, 2023 1 commit
  8. 10 Apr, 2023 2 commits
    • Sugawara's avatar
      add GPTNeoXForSequenceClassification (#22671) · 6daa9cb5
      Sugawara authored
      * add GPTNeoXForSequenceClassification
      
      * move the labels to logits.device (ref: #22561)
      
      * fix
      6daa9cb5
    • Joel Lamy-Poirier's avatar
      Add GPTBigCode model (Optimized GPT2 with MQA from Santacoder & BigCode) (#22575) · e0921c6b
      Joel Lamy-Poirier authored
      
      
      * Add model with cli tool
      
      * Remove unwanted stuff
      
      * Add new code
      
      * Remove inference runner
      
      * Style
      
      * Fix checks
      
      * Test updates
      
      * make fixup
      
      * fix docs
      
      * fix doc
      
      * fix test
      
      * hopefully fix pipeline tests
      
      * refactor
      
      * fix CIs
      
      * add comment
      
      * rename to `GPTBigCodeForCausalLM`
      
      * correct readme
      
      * make fixup + docs
      
      * make fixup
      
      * fixes
      
      * fixes
      
      * Remove pruning
      
      * Remove import
      
      * Doc updates
      
      * More pruning removal
      
      * Combine copies
      
      * Single MQA implementation, remove kv cache pre-allocation and padding
      
      * Update doc
      
      * Revert refactor to match gpt2 style
      
      * Merge back key and value caches, fix some type hints
      
      * Update doc
      
      * Fix position ids pith padding (PR 21080)
      
      * Add conversion script temporarily
      
      * Update conversion script
      
      * Remove checkpoint conversion
      
      * New model
      
      * Fix MQA test
      
      * Fix copies
      
      * try fix tests
      
      * FIX TEST!!
      
      * remove  `DoubleHeadsModel`
      
      * add MQA tests
      
      * add slow tests
      
      * clean up
      
      * add CPU checker
      
      * final fixes
      
      * fixes
      
      - fix GPU issue
      - fixed slow tests
      - skip disk offload
      
      * fix final issue
      
      * Simplify and comment baddbmm fix
      
      * Remove unnecessary code
      
      * Transpose tweaks
      
      * Use beta=1 on cpu, improve tests
      
      ---------
      Co-authored-by: default avataryounesbelkada <younesbelkada@gmail.com>
      e0921c6b
  9. 07 Apr, 2023 2 commits
  10. 06 Apr, 2023 6 commits
    • Yih-Dar's avatar
      Update tiny model summary file for recent models (#22637) · c7ec71ba
      Yih-Dar authored
      
      
      * Update tiny model summary file for recent models
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      c7ec71ba
    • Younes Belkada's avatar
      [`Blip`] Fix slow tests and doctests with correct values (#22632) · ed672864
      Younes Belkada authored
      fix slow tests and doctests
      ed672864
    • Yih-Dar's avatar
      update_pip_test_mapping (#22606) · fa01127a
      Yih-Dar authored
      
      
      * Add TFBlipForConditionalGeneration
      
      * update pipeline_model_mapping
      
      * Add import
      
      * Revert changes in GPTSanJapaneseTest
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      fa01127a
    • Yih-Dar's avatar
      Make tiny model creation + pipeline testing more robust (#22500) · 2c22bc79
      Yih-Dar authored
      
      
      * Final Tiny things
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      2c22bc79
    • amyeroberts's avatar
      Backbone add mixin tests (#22542) · 12d51db2
      amyeroberts authored
      * Add out_indices to backbones, deprecate out_features
      
      * Update - can specify both out_features and out_indices but not both
      
      * Add backbone mixin tests
      
      * Test tidy up
      
      * Add test_backbone for convnext
      
      * Remove redefinition of method
      
      * Update for Dinat and Nat backbones
      
      * Update tests
      
      * Smarter indexing
      
      * Add checks on config creation for backbone
      
      * PR comments
      12d51db2
    • Nicolas Patry's avatar
      Adding Llama FastTokenizer support. (#22264) · 1670be4b
      Nicolas Patry authored
      * Adding Llama FastTokenizer support.
      
      - Requires https://github.com/huggingface/tokenizers/pull/1183 version
      - Only support byte_fallback for llama, raise otherwise (safety net).
      - Lots of questions are special tokens
      
      How to test:
      
      ```python
      
      from transformers.convert_slow_tokenizer import convert_slow_tokenizer
      from transformers import AutoTokenizer
      from tokenizers import Tokenizer
      
      tokenizer = AutoTokenizer.from_pretrained("huggingface/llama-7b")
      
      if False:
          new_tokenizer = Tokenizer.from_file("tok.json")
      else:
          new_tokenizer = convert_slow_tokenizer(tokenizer)
          new_tokenizer.save("tok.json")
      
      strings = [
          "This is a test",
          "生活的真谛是",
          "生活的真谛是[MASK]。",
          # XXX: This one is problematic because of special tokens
          # "<s> Something something",
      ]
      
      for string in strings:
          encoded = tokenizer(string)["input_ids"]
          encoded2 = new_tokenizer.encode(string).ids
      
          assert encoded == encoded2, f"{encoded} != {encoded2}"
      
          decoded = tokenizer.decode(encoded)
          decoded2 = new_tokenizer.decode(encoded2)
      
          assert decoded.strip() == decoded2, f"{repr(decoded)} != {repr(decoded2)}"
      ```
      
      The converter + some test script.
      
      The test script.
      
      Tmp save.
      
      Adding Fast tokenizer + tests.
      
      Adding the tokenization tests.
      
      Correct combination.
      
      Small fix.
      
      Fixing tests.
      
      Fixing with latest update.
      
      Rebased.
      
      fix copies + normalized added tokens  + copies.
      
      Adding doc.
      
      TMP.
      
      Doc + split files.
      
      Doc.
      
      Versions + try import.
      
      Fix Camembert + warnings -> Error.
      
      Fix by ArthurZucker.
      
      Not a decorator.
      
      * Fixing comments.
      
      * Adding more to docstring.
      
      * Doc rewriting.
      1670be4b
  11. 05 Apr, 2023 3 commits
  12. 04 Apr, 2023 5 commits
  13. 03 Apr, 2023 6 commits
  14. 30 Mar, 2023 1 commit