1. 27 Sep, 2023 5 commits
  2. 26 Sep, 2023 7 commits
    • Nathan Lambert's avatar
      Add torch `RMSProp` optimizer (#26425) · 408b2b3c
      Nathan Lambert authored
      add rmsprop
      408b2b3c
    • Matt's avatar
      [InternLM] Add support for InternLM (#26302) · 6ba63ac3
      Matt authored
      * Add config.bias to LLaMA to allow InternLM models to be ported as LLaMA checkpoints
      
      * Rename bias -> attention_bias and add docstring
      6ba63ac3
    • Hugo Lauren莽on's avatar
      Fix DeepSpeed issue with Idefics (#26393) · 0ac38750
      Hugo Lauren莽on authored
      Fix deepspeed issue with Idefics
      0ac38750
    • sanjeevk-os's avatar
    • titi's avatar
      Deleted duplicate sentence (#26394) · a8531f3b
      titi authored
      a8531f3b
    • NielsRogge's avatar
      [ViTMatte] Add resources (#26317) · a09130fe
      NielsRogge authored
      Add resource
      a09130fe
    • NielsRogge's avatar
      Add Nougat (#25942) · ace74d16
      NielsRogge authored
      
      
      * Add conversion script
      
      * Add NougatImageProcessor
      
      * Add crop margin
      
      * More improvements
      
      * Add docs, READMEs
      
      * Remove print statements
      
      * Include model_max_length
      
      * Add NougatTokenizerFast
      
      * Fix imports
      
      * Improve postprocessing
      
      * Improve image processor
      
      * Fix image processor
      
      * Improve normalize method
      
      * More improvements
      
      * More improvements
      
      * Add processor, improve docs
      
      * Simplify fast tokenizer
      
      * Remove test file
      
      * Fix docstrings
      
      * Use NougatProcessor in conversion script
      
      * Add is_levensthein_available
      
      * Add tokenizer tests
      
      * More improvements
      
      * Use numpy instead of opencv
      
      * Add is_cv2_available
      
      * Fix cv2_available
      
      * Add is_nltk_available
      
      * Add image processor tests, improve crop_margin
      
      * Add integration tests
      
      * Improve integration test
      
      * Use do_rescale instead of hacks, thanks Amy
      
      * Remove random_padding
      
      * Address comments
      
      * Address more comments
      
      * Add import
      
      * Address more comments
      
      * Address more comments
      
      * Address comment
      
      * Address comment
      
      * Set max_model_input_sizes
      
      * Add tests
      
      * Add requires_backends
      
      * Add Nougat to exotic tests
      
      * Use to_pil_image
      
      * Address comment regarding nltk
      
      * Add NLTK
      
      * Improve variable names, integration test
      
      * Add test
      
      * refactor, document, and test regexes
      
      * remove named capture groups, add comments
      
      * format
      
      * add non-markdown fixed tokenization
      
      * format
      
      * correct flakyness of args parse
      
      * add regex comments
      
      * test functionalities for crop_image, align long axis and expected output
      
      * add regex tests
      
      * remove cv2 dependency
      
      * test crop_margin equality between cv2 and python
      
      * refactor table regexes to markdown
      
      add newline
      
      * change print to log, improve doc
      
      * fix high count tables correction
      
      * address PR comments: naming, linting, asserts
      
      * Address comments
      
      * Add copied from
      
      * Update conversion script
      
      * Update conversion script to convert both small and base versions
      
      * Add inference example
      
      * Add more info
      
      * Fix style
      
      * Add require annotators to test
      
      * Define all keyword arguments explicitly
      
      * Move cv2 annotator
      
      * Add tokenizer init method
      
      * Transfer checkpoints
      
      * Add reference to Donut
      
      * Address comments
      
      * Skip test
      
      * Remove cv2 method
      
      * Add copied from statements
      
      * Use cached_property
      
      * Fix docstring
      
      * Add file to not doctested
      
      ---------
      Co-authored-by: default avatarPablo Montalvo <pablo.montalvo.leroux@gmail.com>
      ace74d16
  3. 25 Sep, 2023 6 commits
  4. 22 Sep, 2023 9 commits
  5. 21 Sep, 2023 5 commits
  6. 20 Sep, 2023 8 commits
    • Shijie Wu's avatar
      add custom RMSNorm to `ALL_LAYERNORM_LAYERS` (#26227) · e3a4bd2b
      Shijie Wu authored
      * add LlamaRMSNorm to ALL_LAYERNORM_LAYERS
      
      * fixup
      
      * add IdeficsRMSNorm to ALL_LAYERNORM_LAYERS and fixup
      e3a4bd2b
    • Younes Belkada's avatar
      [`Trainer`] Refactor trainer + bnb logic (#26248) · 0b5024ce
      Younes Belkada authored
      * refactor trainer + bnb logic
      
      * remove logger.info
      
      * oops
      0b5024ce
    • Arthur's avatar
      include changes from llama (#26260) · f94c9b3d
      Arthur authored
      * include changes from llama
      
      * add a test
      f94c9b3d
    • Jinho Park's avatar
      add bbox input validation (#26294) · 00247ea0
      Jinho Park authored
      00247ea0
    • fxmarty's avatar
      fix deepspeed available detection (#26252) · 24553206
      fxmarty authored
      24553206
    • Matt's avatar
      Rewrite for custom code warning messages (#26291) · f29fe745
      Matt authored
      Quick britpicking for some warning messages!
      f29fe745
    • Funtowicz Morgan's avatar
      Integrate AMD GPU in CI/CD environment (#26007) · 2d71307d
      Funtowicz Morgan authored
      
      
      * Add a Dockerfile for PyTorch + ROCm based on official AMD released artifact
      
      * Add a new artifact single-amdgpu testing on main
      
      * Attempt to test the workflow without merging.
      
      * Changed BERT to check if things are triggered
      
      * Meet the dependencies graph on workflow
      
      * Revert BERT changes
      
      * Add check_runners_amdgpu to correctly mount and check availability
      
      * Rename setup to setup_gpu for CUDA and add setup_amdgpu for AMD
      
      * Fix all the needs.setup -> needs.setup_[gpu|amdgpu] dependencies
      
      * Fix setup dependency graph to use check_runner_amdgpu
      
      * Let's do the runner status check only on AMDGPU target
      
      * Update the Dockerfile.amd to put ourselves in / rather than /var/lib
      
      * Restore the whole setup for CUDA too.
      
      * Let's redisable them
      
      * Change BERT to trigger tests
      
      * Restore BERT
      
      * Add torchaudio with rocm 5.6 to AMD Dockerfile (#26050)
      
      fix dockerfile
      Co-authored-by: default avatarFelix Marty <felix@hf.co>
      
      * Place AMD GPU tests in a separate workflow (correct branch) (#26105)
      
      AMDGPU CI lives in an other workflow
      
      * Fix invalid job name is dependencies.
      
      * Remove tests multi-amdgpu for now.
      
      * Use single-amdgpu
      
      * Use --net=host for now.
      
      * Remote host networking.
      
      * Removed duplicated check_runners_amdgpu step
      
      * Let's tag machine-types with mi210 for now.
      
      * Machine type should be only mi210
      
      * Remove unnecessary push.branches item
      
      * Apply review suggestions moving from `x-amdgpu` to `x-gpu` introducing `amd-gpu` and `miXXX` labels.
      
      * Remove amdgpu from step names.
      
      * finalize
      
      * delete
      
      ---------
      Co-authored-by: default avatarfxmarty <9808326+fxmarty@users.noreply.github.com>
      Co-authored-by: default avatarFelix Marty <felix@hf.co>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      2d71307d
    • Jinho Park's avatar
      Update bros checkpoint (#26277) · 37c205eb
      Jinho Park authored
      * fix bros integration test
      
      * update bros checkpoint
      37c205eb