1. 14 Sep, 2022 14 commits
    • Sylvain Gugger's avatar
      4eb36f29
    • Shinya Otani's avatar
      Add support for Japanese GPT-NeoX-based model by ABEJA, Inc. (#18814) · f5f430e5
      Shinya Otani authored
      * add gpt-neox-japanese model and tokenizer as new model
      
      * Correction to PR's comment for GPT NeoX Japanese
      - Fix to be able to use gpu
      - Add comment # Copied... at the top of RotaryEmbedding
      - Implement nn.Linear instead of original linear class
      - Add generation test under @slow
      
      * fix bias treatment for gpt-neox-japanese
      
      * Modidy gpt-neox-japanese following PR
      - add doc for bias_dropout_add
      - style change following a PR comment
      
      * add document for gpt-neox-japanese
      
      * remove unused import from gpt-neox-japanese
      
      * fix README for gpt-neox-japanese
      f5f430e5
    • Yih-Dar's avatar
      Fix `DocumentQuestionAnsweringPipelineTests` (#19023) · 6a9726ec
      Yih-Dar authored
      
      
      * Fix DocumentQuestionAnsweringPipelineTests
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      6a9726ec
    • Sylvain Gugger's avatar
      Typo fix · 1207deb8
      Sylvain Gugger authored
      1207deb8
    • Sylvain Gugger's avatar
      e1224a2a
    • Sylvain Gugger's avatar
      0b567aa4
    • Yih-Dar's avatar
      Fix CI for `PegasusX` (#19025) · 77b18783
      Yih-Dar authored
      
      
      * Skip test_torchscript_output_attentions for PegasusXModelTest
      
      * fix test_inference_no_head
      
      * fix test_inference_head
      
      * fix test_seq_to_seq_generation
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      77b18783
    • Partho's avatar
      added type hints (#19015) · 77ea35b9
      Partho authored
      77ea35b9
    • NielsRogge's avatar
      [CookieCutter] Clarify questions (#18959) · fc21c9be
      NielsRogge authored
      
      
      * Clarify cookiecutter questions
      
      * Update first question
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      fc21c9be
    • Sylvain Gugger's avatar
      Make AutoProcessor a magic loading class for all modalities (#18963) · 6f8f2f6a
      Sylvain Gugger authored
      * Make AutoProcessor a magic loading class for all modalities
      
      * Quality
      6f8f2f6a
    • Sylvain Gugger's avatar
      a2a3afbc
    • Ekagra Ranjan's avatar
    • NielsRogge's avatar
      Add Deformable DETR (#17281) · 59407bbe
      NielsRogge authored
      
      
      * First draft
      
      * More improvements
      
      * Improve model, add custom CUDA code
      
      * Import torch before
      
      * Add script that imports custom layer
      
      * Add everything in new ops directory
      
      * Import custom layer in modeling file
      
      * Fix ARCHIVE_MAP typo
      
      * Creating the custom kernel on the fly.
      
      * Import custom layer in modeling file
      
      * More improvements
      
      * Fix CUDA loading
      
      * More improvements
      
      * Improve conversion script
      
      * Improve conversion script
      
      * Make it work until encoder_outputs
      
      * Make forward pass work
      
      * More improvements
      
      * Make logits match original implementation
      
      * Make implementation also support single_scale model
      
      * Add support for single_scale and dilation checkpoint
      
      * Add support for with_box_refine model
      
      * Support also two stage model
      
      * Improve tests
      
      * Fix more tests
      
      * Make more tests pass
      
      * Upload all models to the hub
      
      * Clean up some code
      
      * Improve decoder outputs
      
      * Rename intermediate hidden states and reference points
      
      * Improve model outputs
      
      * Move tests to dedicated folder
      
      * Improve model outputs
      
      * Fix retain_grad test
      
      * Improve docs
      
      * Clean up and make test_initialization pass
      
      * Improve variable names
      
      * Add copied from statements
      
      * Improve docs
      
      * Fix style
      
      * Improve docs
      
      * Improve docs, move tests to model folder
      
      * Fix rebase
      
      * Remove DetrForSegmentation from auto mapping
      
      * Apply suggestions from code review
      
      * Improve variable names and docstrings
      
      * Apply some more suggestions from code review
      
      * Apply suggestion from code review
      
      * better docs and variables names
      
      * hint to num_queries and two_stage confusion
      
      * remove asserts and code refactor
      
      * add exception if two_stage is True and with_box_refine is False
      
      * use f-strings
      
      * Improve docs and variable names
      
      * Fix code quality
      
      * Fix rebase
      
      * Add require_torch_gpu decorator
      
      * Add pip install ninja to CI jobs
      
      * Apply suggestion of @sgugger
      
      * Remove DeformableDetrForObjectDetection from auto mapping
      
      * Remove DeformableDetrModel from auto mapping
      
      * Add model to toctree
      
      * Add model back to mappings, skip model in pipeline tests
      
      * Apply @sgugger's suggestion
      
      * Fix imports in the init
      
      * Fix copies
      
      * Add CPU implementation
      
      * Comment out GPU function
      
      * Undo previous change
      
      * Apply more suggestions
      
      * Remove require_torch_gpu annotator
      
      * Fix quality
      
      * Add logger.info
      
      * Fix logger
      
      * Fix variable names
      
      * Fix initializaztion
      
      * Add missing initialization
      
      * Update checkpoint name
      
      * Add model to doc tests
      
      * Add CPU/GPU equivalence test
      
      * Add Deformable DETR to pipeline tests
      
      * Skip model for object detection pipeline
      Co-authored-by: default avatarNicolas Patry <patry.nicolas@protonmail.com>
      Co-authored-by: default avatarNouamane Tazi <nouamane98@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <Sylvain.gugger@gmail.com>
      59407bbe
    • Ahmed Elnaggar's avatar
      Add Support to Gradient Checkpointing for LongT5 (#18977) · 5a70a77b
      Ahmed Elnaggar authored
      FlaxLongT5PreTrainedModel is missing "enable_gradient_checkpointing" function. This gives an error if someone tries to enable gradient checkpointing for longt5.
      This pull request fixes it.
      5a70a77b
  2. 13 Sep, 2022 10 commits
  3. 12 Sep, 2022 14 commits
  4. 10 Sep, 2022 2 commits