1. 02 Oct, 2023 2 commits
    • Florian Seiler's avatar
      Fix num_heads in _upad_input (#26490) · ca0379b8
      Florian Seiler authored
      
      
      * Fix num_heads in _upad_input
      
      The variable num_key_value_heads has falsely been named num_heads, which led to reshaping the query_layer using the wrong attention head count. (It would have been enough to use the correct variable self.num_heads instead of num_heads, but I renamed num_heads to num_key_value_heads for clarity)
      
      * fixed copies using make fix-copies and ran make fixup
      
      ---------
      Co-authored-by: default avatarfseiler <f.seiler@jerocom.de>
      ca0379b8
    • Lysandre Debut's avatar
      Revert falcon exception (#26472) · 67239f73
      Lysandre Debut authored
      * Revert "Falcon: fix revision propagation (#26006)"
      
      This reverts commit 118c676ef3124423e5d062b665f05cde55bc9a90.
      
      * Revert "Put Falcon back (#25960)"
      
      This reverts commit 22a69f1d.
      67239f73
  2. 29 Sep, 2023 6 commits
  3. 28 Sep, 2023 10 commits
  4. 27 Sep, 2023 12 commits
  5. 26 Sep, 2023 7 commits
    • Nathan Lambert's avatar
      Add torch `RMSProp` optimizer (#26425) · 408b2b3c
      Nathan Lambert authored
      add rmsprop
      408b2b3c
    • Matt's avatar
      [InternLM] Add support for InternLM (#26302) · 6ba63ac3
      Matt authored
      * Add config.bias to LLaMA to allow InternLM models to be ported as LLaMA checkpoints
      
      * Rename bias -> attention_bias and add docstring
      6ba63ac3
    • Hugo Laurençon's avatar
      Fix DeepSpeed issue with Idefics (#26393) · 0ac38750
      Hugo Laurençon authored
      Fix deepspeed issue with Idefics
      0ac38750
    • sanjeevk-os's avatar
    • titi's avatar
      Deleted duplicate sentence (#26394) · a8531f3b
      titi authored
      a8531f3b
    • NielsRogge's avatar
      [ViTMatte] Add resources (#26317) · a09130fe
      NielsRogge authored
      Add resource
      a09130fe
    • NielsRogge's avatar
      Add Nougat (#25942) · ace74d16
      NielsRogge authored
      
      
      * Add conversion script
      
      * Add NougatImageProcessor
      
      * Add crop margin
      
      * More improvements
      
      * Add docs, READMEs
      
      * Remove print statements
      
      * Include model_max_length
      
      * Add NougatTokenizerFast
      
      * Fix imports
      
      * Improve postprocessing
      
      * Improve image processor
      
      * Fix image processor
      
      * Improve normalize method
      
      * More improvements
      
      * More improvements
      
      * Add processor, improve docs
      
      * Simplify fast tokenizer
      
      * Remove test file
      
      * Fix docstrings
      
      * Use NougatProcessor in conversion script
      
      * Add is_levensthein_available
      
      * Add tokenizer tests
      
      * More improvements
      
      * Use numpy instead of opencv
      
      * Add is_cv2_available
      
      * Fix cv2_available
      
      * Add is_nltk_available
      
      * Add image processor tests, improve crop_margin
      
      * Add integration tests
      
      * Improve integration test
      
      * Use do_rescale instead of hacks, thanks Amy
      
      * Remove random_padding
      
      * Address comments
      
      * Address more comments
      
      * Add import
      
      * Address more comments
      
      * Address more comments
      
      * Address comment
      
      * Address comment
      
      * Set max_model_input_sizes
      
      * Add tests
      
      * Add requires_backends
      
      * Add Nougat to exotic tests
      
      * Use to_pil_image
      
      * Address comment regarding nltk
      
      * Add NLTK
      
      * Improve variable names, integration test
      
      * Add test
      
      * refactor, document, and test regexes
      
      * remove named capture groups, add comments
      
      * format
      
      * add non-markdown fixed tokenization
      
      * format
      
      * correct flakyness of args parse
      
      * add regex comments
      
      * test functionalities for crop_image, align long axis and expected output
      
      * add regex tests
      
      * remove cv2 dependency
      
      * test crop_margin equality between cv2 and python
      
      * refactor table regexes to markdown
      
      add newline
      
      * change print to log, improve doc
      
      * fix high count tables correction
      
      * address PR comments: naming, linting, asserts
      
      * Address comments
      
      * Add copied from
      
      * Update conversion script
      
      * Update conversion script to convert both small and base versions
      
      * Add inference example
      
      * Add more info
      
      * Fix style
      
      * Add require annotators to test
      
      * Define all keyword arguments explicitly
      
      * Move cv2 annotator
      
      * Add tokenizer init method
      
      * Transfer checkpoints
      
      * Add reference to Donut
      
      * Address comments
      
      * Skip test
      
      * Remove cv2 method
      
      * Add copied from statements
      
      * Use cached_property
      
      * Fix docstring
      
      * Add file to not doctested
      
      ---------
      Co-authored-by: default avatarPablo Montalvo <pablo.montalvo.leroux@gmail.com>
      ace74d16
  6. 25 Sep, 2023 3 commits