1. 29 Jun, 2022 10 commits
  2. 28 Jun, 2022 10 commits
  3. 27 Jun, 2022 6 commits
  4. 25 Jun, 2022 1 commit
  5. 24 Jun, 2022 6 commits
    • kumapo's avatar
      2ef94ee0
    • willtai's avatar
      Add type hints for gptneox models (#17858) · ef28a402
      willtai authored
      * feat: Add type hints for GPTNeoxForCausalLM and GPTNeoXModel
      
      * fix: removed imported Dict type
      
      * fix: Removed unused List import
      ef28a402
    • Suraj Patil's avatar
    • rooa's avatar
      Add CodeGen model (#17443) · d6b6fb99
      rooa authored
      
      
      * Add CodeGen model
      
      * Add missing key and switch order of super()
      
      * Fix torch.ones init with uint8 instead of bool
      
      * Address comments: copy statements and doc
      
      * update tests
      
      * remove old model parallel
      
      * fix batch gen tests
      
      * fix batch gen test
      
      * update test_gpt2_sample_max_time
      
      * fix codgen test and revert gpt2 test change
      
      * Fix incorrect tie_word_embedding value, typo, URL
      
      * Fix model order in README and styling
      
      * Reorder model list alphabetically
      
      * Set tie_word_embedding to False by default
      
      * Apply suggestions from code review
      
      * Better attn mask name & remove attn masked_bias
      
      * add tokenizer for codegen
      
      * quality
      
      * doc tokenizer
      
      * fix-copies
      
      * add CodeGenTokenizer in converter
      
      * make truncation optional
      
      * add test for truncation
      
      * add copyright
      
      * fix-copies
      
      * fix fast tokenizer decode
      
      * Update src/transformers/models/codegen/tokenization_codegen.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * increase vocab_size in tests
      Co-authored-by: default avatarpatil-suraj <surajp815@gmail.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      d6b6fb99
    • NaN's avatar
      Fix Constrained beam search duplication and weird output issue (#17814) · bc7a6fdc
      NaN authored
      * fix(ConstrainedBeamSearchScorer.step_sentence_constraint): avoid hypothesis duplication between topk and advance
      
      * fix(GenerationMixin.constrained_beam_search): appropriately assign beam scores instead of token scores
      bc7a6fdc
    • NielsRogge's avatar
      Improve vision models (#17731) · 09178705
      NielsRogge authored
      
      
      * Improve vision models
      
      * Add a lot of improvements
      
      * Remove to_2tuple from swin tests
      
      * Fix TF Swin
      
      * Fix more tests
      
      * Fix copies
      
      * Improve more models
      
      * Fix ViTMAE test
      
      * Add channel check for TF models
      
      * Add proper channel check for TF models
      
      * Apply suggestion from code review
      
      * Apply suggestions from code review
      
      * Add channel check for Flax models, apply suggestion
      
      * Fix bug
      
      * Add tests for greyscale images
      
      * Add test for interpolation of pos encodigns
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      09178705
  6. 23 Jun, 2022 7 commits