1. 12 Oct, 2023 2 commits
  2. 11 Oct, 2023 4 commits
    • Patrick von Platen's avatar
      [Assistant Generation] Improve Encoder Decoder (#26701) · da69de17
      Patrick von Platen authored
      * [Assistant Generation] Improve enc dec
      
      * save more
      
      * Fix logit processor checks
      
      * Clean
      
      * make style
      
      * fix deprecation
      
      * fix generation test
      
      * Apply suggestions from code review
      
      * fix biogpt
      
      * make style
      da69de17
    • Yih-Dar's avatar
      `Copied from` for test files (#26713) · 5334796d
      Yih-Dar authored
      
      
      * copied statement for test files
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      5334796d
    • Billy Bradley's avatar
      In assisted decoding, pass model_kwargs to model's forward call (fix... · dcc49d8a
      Billy Bradley authored
      In assisted decoding, pass model_kwargs to model's forward call (fix prepare_input_for_generation in all models) (#25242)
      
      * In assisted decoding, pass model_kwargs to model's forward call
      
      Previously, assisted decoding would ignore any additional kwargs
      that it doesn't explicitly handle. This was inconsistent with other
      generation methods, which pass the model_kwargs through
      prepare_inputs_for_generation and forward the returned dict to the
      model's forward call.
      
      The prepare_inputs_for_generation method needs to be amended in all
      models, as previously it only kept the last input ID when a past_key_values
      was passed.
      
      * Improve variable names in _extend_attention_mask
      
      * Refactor extending token_type_ids into a function
      
      * Replace deepcopy with copy to optimize performance
      
      * Update new persimmon model with llama changes for assisted generation
      
      * Update new mistral model for assisted generation with prepare_inputs_for_generation
      
      * Update position_ids creation in falcon prepare_inputs_for_generation to support assisted generation
      dcc49d8a
    • Thien Tran's avatar
      Make Whisper Encoder's sinusoidal PE non-trainable by default (#26032) · 1e3c9dda
      Thien Tran authored
      
      
      * set encoder's PE as non-trainable
      
      * freeze flax
      
      * init sinusoids
      
      * add test for non-trainable embed positions
      
      * simplify TF encoder embed_pos
      
      * revert tf
      
      * clean up
      
      * add sinusoidal init for jax
      
      * make consistent sinusoidal function
      
      * fix dtype
      
      * add default dtype
      
      * use numpy for sinusoids. fix jax
      
      * add sinusoid init for TF
      
      * fix
      
      * use custom embedding
      
      * use specialized init for each impl
      
      * fix sinusoids init. add test for pytorch
      
      * fix TF dtype
      
      * simplify sinusoid init for flax and tf
      
      * add tests for TF
      
      * change default dtype to float32
      
      * add sinusoid test for flax
      
      * Update src/transformers/models/whisper/modeling_flax_whisper.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * Update src/transformers/models/whisper/modeling_tf_whisper.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * move sinusoidal init to _init_weights
      
      ---------
      Co-authored-by: default avatarsanchit-gandhi <sanchit@huggingface.co>
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      1e3c9dda
  3. 09 Oct, 2023 1 commit
  4. 06 Oct, 2023 6 commits
  5. 05 Oct, 2023 3 commits
  6. 04 Oct, 2023 3 commits
  7. 03 Oct, 2023 6 commits
  8. 02 Oct, 2023 4 commits
    • Arthur's avatar
      Code-llama-nit (#26300) · bab33319
      Arthur authored
      * fix encoding when the fill token is None
      
      * add tests and edge cases
      
      * fiuxp
      
      * Update tests/models/code_llama/test_tokenization_code_llama.py
      bab33319
    • Arthur's avatar
      Fix model integration ci (#26322) · 63864e05
      Arthur authored
      * fix wav2vec2
      
      * nit
      
      * stash
      
      * one more file to update
      
      * fix byt5
      
      * vocab size is 256, don't change that!
      
      * use other revision
      
      * test persimon in smaller size
      
      * style
      
      * tests
      
      * nits
      
      * update add tokens from pretrained
      
      * test tokenization
      
      * nits
      
      * potential fnet fix?
      
      * more nits
      
      * nits
      
      * correct test
      
      * assert close
      
      * udpate
      
      * ouch
      
      * fix it
      
      * some more nits
      
      * FINALLU
      
      * use `adept` checkpoints
      
      * more adept checkpoints
      
      * that was invlved!
      63864e05
    • Younes Belkada's avatar
      [`core`/ `auto` ] Fix bnb test with code revision + bug with code revision (#26431) · 6824461f
      Younes Belkada authored
      * fix bnb test with code revision
      
      * fix test
      
      * Apply suggestions from code review
      
      * Update src/transformers/models/auto/auto_factory.py
      
      * Update src/transformers/models/auto/auto_factory.py
      
      * Update src/transformers/models/auto/auto_factory.py
      6824461f
    • Lysandre Debut's avatar
      Revert falcon exception (#26472) · 67239f73
      Lysandre Debut authored
      * Revert "Falcon: fix revision propagation (#26006)"
      
      This reverts commit 118c676ef3124423e5d062b665f05cde55bc9a90.
      
      * Revert "Put Falcon back (#25960)"
      
      This reverts commit 22a69f1d.
      67239f73
  9. 29 Sep, 2023 2 commits
  10. 28 Sep, 2023 2 commits
  11. 27 Sep, 2023 4 commits
  12. 26 Sep, 2023 2 commits
    • sanjeevk-os's avatar
    • NielsRogge's avatar
      Add Nougat (#25942) · ace74d16
      NielsRogge authored
      
      
      * Add conversion script
      
      * Add NougatImageProcessor
      
      * Add crop margin
      
      * More improvements
      
      * Add docs, READMEs
      
      * Remove print statements
      
      * Include model_max_length
      
      * Add NougatTokenizerFast
      
      * Fix imports
      
      * Improve postprocessing
      
      * Improve image processor
      
      * Fix image processor
      
      * Improve normalize method
      
      * More improvements
      
      * More improvements
      
      * Add processor, improve docs
      
      * Simplify fast tokenizer
      
      * Remove test file
      
      * Fix docstrings
      
      * Use NougatProcessor in conversion script
      
      * Add is_levensthein_available
      
      * Add tokenizer tests
      
      * More improvements
      
      * Use numpy instead of opencv
      
      * Add is_cv2_available
      
      * Fix cv2_available
      
      * Add is_nltk_available
      
      * Add image processor tests, improve crop_margin
      
      * Add integration tests
      
      * Improve integration test
      
      * Use do_rescale instead of hacks, thanks Amy
      
      * Remove random_padding
      
      * Address comments
      
      * Address more comments
      
      * Add import
      
      * Address more comments
      
      * Address more comments
      
      * Address comment
      
      * Address comment
      
      * Set max_model_input_sizes
      
      * Add tests
      
      * Add requires_backends
      
      * Add Nougat to exotic tests
      
      * Use to_pil_image
      
      * Address comment regarding nltk
      
      * Add NLTK
      
      * Improve variable names, integration test
      
      * Add test
      
      * refactor, document, and test regexes
      
      * remove named capture groups, add comments
      
      * format
      
      * add non-markdown fixed tokenization
      
      * format
      
      * correct flakyness of args parse
      
      * add regex comments
      
      * test functionalities for crop_image, align long axis and expected output
      
      * add regex tests
      
      * remove cv2 dependency
      
      * test crop_margin equality between cv2 and python
      
      * refactor table regexes to markdown
      
      add newline
      
      * change print to log, improve doc
      
      * fix high count tables correction
      
      * address PR comments: naming, linting, asserts
      
      * Address comments
      
      * Add copied from
      
      * Update conversion script
      
      * Update conversion script to convert both small and base versions
      
      * Add inference example
      
      * Add more info
      
      * Fix style
      
      * Add require annotators to test
      
      * Define all keyword arguments explicitly
      
      * Move cv2 annotator
      
      * Add tokenizer init method
      
      * Transfer checkpoints
      
      * Add reference to Donut
      
      * Address comments
      
      * Skip test
      
      * Remove cv2 method
      
      * Add copied from statements
      
      * Use cached_property
      
      * Fix docstring
      
      * Add file to not doctested
      
      ---------
      Co-authored-by: default avatarPablo Montalvo <pablo.montalvo.leroux@gmail.com>
      ace74d16
  13. 25 Sep, 2023 1 commit