1. 04 Jan, 2023 1 commit
  2. 03 Jan, 2023 2 commits
    • samuelpullely's avatar
      Enable `decoder_attention_mask` in `generate` function (#20726) · 15c68c67
      samuelpullely authored
      * Enable `decoder_attention_mask` in `generate` function
      
      * Make style corrections
      
      * Run `make repo-consistency`
      
      * Add integration test
      15c68c67
    • NielsRogge's avatar
      Add GIT (GenerativeImage2Text) (#20295) · 9c6f7485
      NielsRogge authored
      
      
      * First draft
      
      * Make model instantiation work
      
      * Fix copied from statement
      
      * More fixes
      
      * Add correct output head
      
      * Improve configuration
      
      * Add conversion script
      
      * Improve conversion script
      
      * Remove token_type_ids
      
      * Fix conversion of projection layers
      
      * Convert all weights
      
      * Use cats image
      
      * Make logits match
      
      * Generate caption on cats image
      
      * Add GITProcessor
      
      * Update conversion script
      
      * Add support for more checkpoints
      
      * Fix conversion script
      
      * Add initial tests
      
      * Remove cross-attention
      
      * More improvements
      
      * Remove is_decoder
      
      * Improve model tests
      
      * Improve tests
      
      * Improve model outputs
      
      * Fix model outputs equivalence
      
      * Fix more tests
      
      * Remove unused code
      
      * Use generate to generate text, no use of cache for now
      
      * Use generate more appropriately
      
      * Fix config tests
      
      * Fix style
      
      * Add support for use_cache
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Fix style
      
      * Fix GIT vision encoder
      
      * Update README
      
      * Fix integration test
      
      * Set bos and eos token ids
      
      * Improve docs
      
      * Improve code
      
      * Add support for provided attention_mask
      
      * Add copied from statement
      
      * Fix gradient checkpointing test
      
      * Set model_input_names
      
      * Investigate model_input_names
      
      * Remove script
      
      * Fix model inputs
      
      * Fix docstring
      
      * Rename GIT to Git
      
      * Support more models
      
      * Add support for textvqa model
      
      * Add video support
      
      * Extend conversion script for video
      
      * Add support for large variant
      
      * Add support for more models
      
      * Fix config archive map
      
      * Update integration test
      
      * Fix README
      
      * Fix CLIP mean and std
      
      * Update processor
      
      * Fix use_cache for video, thanks @gante
      
      * Remove print statements
      
      * Remove assertion
      
      * Add processor tests
      
      * Fix model_input_names
      
      * Use Auto API for processor
      
      * Fix processor tests
      
      * Fix integration test
      
      * Fix pipeline test
      
      * Make tests faster
      
      * Update conversion script
      
      * Update conversion script
      
      * Convert more checkpoints
      
      * Update conversion script
      
      * Fix typo
      
      * Update docstrings
      
      * Improve code snippets
      
      * Fix doc tests
      
      * Add more code examples茅
      
      * Fix doc tests
      
      * Add integration tests
      
      * Fix unused variable
      
      * revert
      
      * Add GIT to Japanese README
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      9c6f7485
  3. 31 Dec, 2022 1 commit
    • Hao Wang's avatar
      update pyknp to rhoknp (#20890) · 375801d5
      Hao Wang authored
      * update pyknp to rhoknp
      
      * fix linter
      
      * fix linter
      
      * fix linter
      
      * fix linter
      
      * fix linter
      
      * support rhoknp==1.1.0, fix testcase
      375801d5
  4. 27 Dec, 2022 1 commit
  5. 23 Dec, 2022 1 commit
  6. 21 Dec, 2022 4 commits
  7. 20 Dec, 2022 1 commit
  8. 19 Dec, 2022 1 commit
  9. 16 Dec, 2022 1 commit
    • NielsRogge's avatar
      Add Swin2SR (#19784) · 26dd041c
      NielsRogge authored
      
      
      * First draft
      
      * Add more improvements
      
      * Improve forward pass
      
      * Fix layernorm
      
      * Add upscaler
      
      * More improvements
      
      * More improvements
      
      * More improvements
      
      * Improve conversion script
      
      * Add preprocessing
      
      * Make output match original implementation
      
      * Add additional attributes
      
      * Add support for more models
      
      * Support more models
      
      * Add support for real world sr
      
      * Add initial Swin2SRFeatureExtractor
      
      * Add ImageSuperResolutionOutput
      
      * Make more tests pass
      
      * Use BaseModelOutput
      
      * Fix one more test
      
      * Fix more tests
      
      * Fix another test
      
      * Fix all tests
      
      * Rename to Swin2SRImageProcessor
      
      * Fix toctree
      
      * Fix toctree
      
      * Fix rebase
      
      * Improve Swin2SRImageProcessor
      
      * Remove feature extractor file
      
      * Improve model
      
      * Improve conversion script
      
      * Fix integration test
      
      * Fix init
      
      * Fix conversion script
      
      * Address comments
      
      * Improve upsampler
      
      * Add NearestConvUpsampler
      
      * Improve pixel shuffle upsampler
      
      * Improve auxiliary upsampler
      
      * Improve conversion script
      
      * Rename conv_last to final_convolution
      
      * Fix rebase
      
      * Improve upsample module
      
      * Add padding to image processor
      
      * Fix bug
      
      * Update padding
      
      * Remove print statement and fix integration test
      
      * Improve docs
      
      * Add image processor tests
      
      * Convert all checkpoints, fix tests茅
      
      * Remove print statements
      
      * Fix import
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      26dd041c
  10. 14 Dec, 2022 4 commits
  11. 13 Dec, 2022 2 commits
  12. 12 Dec, 2022 3 commits
    • amyeroberts's avatar
      Add decorator for flaky Donut tests (#20739) · 1416b5d9
      amyeroberts authored
      * Add decorator for flaky tests
      
      * Fix up
      1416b5d9
    • Ariel Ekgren's avatar
      Add gpt-sw3 model to transformers (#20209) · 5f94855d
      Ariel Ekgren authored
      
      
      * Add templates for gpt-sw3
      
      * Add templates for gpt-sw3
      
      * Added sentencepiece tokenizer
      
      * intermediate commit with many changes
      
      * fixed conflicts
      
      * Init commit for tokenization port
      
      * Tokenization progress
      
      * Remove fast tokenizer
      
      * Clean up and rename spm.model -> spiece.model
      
      * Remove TF -> PT conversion script template, Clean up Megatron -> PT script
      
      * Optimize encode & decode performance
      
      * added new attention
      
      * added new attention
      
      * attention for gpt-sw3 working
      
      * attention good
      
      * Cache is now working
      
      * fixed attention mask so that it works with causal attention
      
      * fixed badbmm bug for cpu and caching
      
      * updated config with correct parameters
      
      * Refactor and leave optimizations as separate functions to avoid breaking expected functionality
      
      * Fix special tokens mapping for both tokenizers
      
      * cleaning up of code and comments
      
      * HF compatible attention outputs
      
      * Tokenizer now passing tests, add documentation
      
      * Update documentation
      
      * reverted back to base implementation after checking that it is identical to pretrained model
      
      * updated gpt-sw3 config
      
      * updated conversion script
      
      * aligned parameters with gpt-sw3 config
      
      * changed default scale_attn_by_inverse_layer_idx to true
      
      * removed flag from conversion script
      
      * added temporary model path
      
      * reverted back to functioning convert script
      
      * small changes to default config
      
      * updated tests for gpt-sw3
      
      * make style, make quality, minor cleanup
      
      * Change local paths to testing online repository
      
      * Change name: GptSw3 -> GPTSw3
      
      * Remove GPTSw3TokenizerFast references
      
      * Use official model repository and add more model sizes
      
      * Added reference to 6.7b model
      
      * Add GPTSw3DoubleHeadsModel to IGNORE_NON_AUTO_CONFIGURED, like GPT2DoubleHeadsModel
      
      * Remove pointers to non-existing TFGPTSw3
      
      * Add GPTSw3 to docs/_toctree.yml
      
      * Remove TF artifacts from GPTSw3 in __init__ files
      
      * Update README:s with 'make fix-copies'
      
      * Add 20b model to archive list
      
      * Add documentation for GPT-Sw3
      
      * Fix typo in documentation for GPT-Sw3
      
      * Do 'make fix-copies' again after having updated docs
      
      * Fix some typos in docs
      
      * Update src/transformers/models/gpt_sw3/configuration_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/configuration_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/__init__.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/__init__.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/convert_megatron_to_pytorch.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/modeling_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update tests/models/gpt_sw3/test_tokenization_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/modeling_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/modeling_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Resolve comments from PR feedback
      
      * Resolve more comments from PR feedback, also set use_cache=True in convert script
      
      * Add '# Copied from' comments for GPTSw3 modeling
      
      * Set 'is_parallelizable = False'
      
      * Remove '# Copied from' where code was modified and add 'with x->y' when appropriate
      
      * Remove parallelize in mdx
      
      * make style, make quality
      
      * Update GPTSw3Config default values and corresponding documentation
      
      * Update src/transformers/models/gpt_sw3/tokenization_gpt_sw3.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/__init__.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Clean up and protect GPTSw3Tokenizer imports with is_sentencepiece_available
      
      * Make style, make quality
      
      * Add dummy object for GPTSw3Tokenizer via 'make fix-copies'
      
      * make fix-copies
      
      * Remove GPTSw3 modeling classes
      
      * make style, make quality
      
      * Add GPTSw3 auto-mappings for other GPT2 heads
      
      * Update docs/source/en/model_doc/gpt-sw3.mdx
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/convert_megatron_to_pytorch.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_sw3/tokenization_gpt_sw3.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Remove old TODO-comment
      
      * Add example usage to GPTSw3Tokenizer docstring
      
      * make style, make quality
      
      * Add implementation details and example usage to gpt-sw3.mdx
      Co-authored-by: default avatarJoeyOhman <joeyoh@kth.se>
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      5f94855d
    • Yih-Dar's avatar
  13. 09 Dec, 2022 3 commits
  14. 08 Dec, 2022 4 commits
  15. 07 Dec, 2022 5 commits
    • Cole Howard's avatar
      Add TFBartForSequenceClassification (#20570) · fc95386e
      Cole Howard authored
      * read to load
      
      * base functionality
      
      * revert init
      
      * fix dummy data
      
      * moving right along
      
      * moving right along
      
      * finally
      
      * cleanup
      
      * pull out comment
      
      * add test
      
      * update docstring for main class
      
      * flake comments and rewriting copies from make repo-consistency`
      
      * remove irrelevant differences/accidental spaces
      
      * put copies back after space removals
      
      * mid
      
      * final test pass
      
      * stray comment
      
      * update test file
      
      * update test file
      
      * fixup
      
      * black
      
      * missed
      
      * black missed one more
      
      * sytle
      
      * add doc update
      
      * fix order of output class
      
      * comment
      
      * Revert "comment"
      
      This reverts commit 03f86b6948808461939cc8ad4ad74305dfb67700.
      
      * remove redundant function, and redundant reshape
      
      * move change out of common
      
      * style
      
      * put common spaces back
      
      * reorder kwargs in output
      
      * doc style
      fc95386e
    • Sanchit Gandhi's avatar
      [Whisper] Fix forced decoder ids (#20652) · 77382e91
      Sanchit Gandhi authored
      * [Whisper] Fix forced decoder ids
      
      * fix test
      77382e91
    • Younes Belkada's avatar
      Add `dpt-hybrid` support (#20645) · 7c5eaf9e
      Younes Belkada authored
      
      
      * add `dpt-hybrid` support
      
      * refactor
      
      * final changes, all tests pass
      
      * final cleanups
      
      * final changes
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * fix docstring
      
      * fix typo
      
      * change `vit_hybrid` to `hybrid`
      
      * replace dataclass
      
      * add docstring
      
      * move dataclasses
      
      * fix test
      
      * add `PretrainedConfig` support for `backbone_config`
      
      * fix docstring
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * remove `embedding_type` and replace it by `is_hybrid`
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      7c5eaf9e
    • NielsRogge's avatar
      Add BiT + ViT hybrid (#20550) · d151a8c5
      NielsRogge authored
      
      
      * First draft
      
      * More improvements
      
      * Add backbone, first draft of ViT hybrid
      
      * Add AutoBackbone
      
      * More improvements
      
      * Fix bug
      
      * More improvements
      
      * More improvements
      
      * Convert ViT-hybrid
      
      * More improvements
      
      * add patch bit
      
      * Fix style
      
      * Improve code
      
      * cleaned v1
      
      * more cleaning
      
      * more refactoring
      
      * Improve models, add tests
      
      * Add docs and tests
      
      * Make more tests pass
      
      * Improve default backbone config
      
      * Update model_type
      
      * Fix more tests
      
      * Add more copied from statements
      
      * More improvements
      
      * Add push to hub to conversion scripts
      
      * clean
      
      * more cleanup
      
      * clean
      
      * replace to
      
      * fix
      
      * Update src/transformers/models/bit/configuration_bit.py
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * fix base model prefix
      
      * more cleaning
      
      * get rid of stem
      
      * clean
      
      * replace flag
      
      * Update src/transformers/models/bit/configuration_bit.py
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * Update src/transformers/models/bit/configuration_bit.py
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * add check
      
      * another check
      
      * fix for hybrid vit
      
      * final fix
      
      * update config
      
      * fix class name
      
      * fix `make fix-copies`
      
      * remove `use_activation`
      
      * Update src/transformers/models/bit/configuration_bit.py
      
      * rm unneeded file
      
      * Add BiT image processor
      
      * rm unneeded file
      
      * add doc
      
      * Add image processor to conversion script
      
      * Add ViTHybrid image processor
      
      * Add resources
      
      * Move bit to correct position
      
      * Fix auto mapping
      
      * Rename hybrid to Hybrid
      
      * Fix name in toctree
      
      * Fix READMEs'
      
      * Improve config
      
      * Simplify GroupNormActivation layer
      
      * fix test + make style
      
      * Improve config
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * remove comment
      
      * remove comment
      
      * replace
      
      * replace
      
      * remove all conv_layer
      
      * refactor norm_layer
      
      * revert x
      
      * add copied from
      
      * last changes + integration tests
      
      * make fixup
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * fix name
      
      * fix message
      
      * remove assert and refactor
      
      * refactor + make fixup
      
      * refactor - add  + sfety checker
      
      * fix docstring + checkpoint names
      
      * fix merge issues
      
      * fix function name
      
      * fix copies
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * fix model checkpoint
      
      * fix doctest output
      
      * vit name on doc
      
      * fix name on doc
      
      * fix small nits
      
      * fixed integration tests
      
      * final changes - slow tests pass
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      Co-authored-by: default avataryounesbelkada <younesbelkada@gmail.com>
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      d151a8c5
    • NielsRogge's avatar
      [MaskFormer] Add support for ResNet backbone (#20483) · b610c47f
      NielsRogge authored
      
      
      * Add SwinBackbone
      
      * Add hidden_states_before_downsampling support
      
      * Fix Swin tests
      
      * Improve conversion script
      
      * Add id2label mappings
      
      * Add vistas mapping
      
      * Update comments
      
      * Fix backbone
      
      * Improve tests
      
      * Extend conversion script
      
      * Add Swin conversion script
      
      * Fix style
      
      * Revert config attribute
      
      * Remove SwinBackbone from main init
      
      * Remove unused attribute
      
      * Use encoder for ResNet backbone
      
      * Improve conversion script and add integration test
      
      * Apply suggestion
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      b610c47f
  16. 06 Dec, 2022 1 commit
  17. 05 Dec, 2022 5 commits
    • Sylvain Gugger's avatar
      Fix test for file not found (#20604) · 5764efe5
      Sylvain Gugger authored
      5764efe5
    • Sanchit Gandhi's avatar
      [Whisper] Fix decoder ids methods (#20599) · 74fb524e
      Sanchit Gandhi authored
      * [Whisper] Fix decoder ids methods
      
      * enum property
      74fb524e
    • Younes Belkada's avatar
      [Vision] `.to` function for ImageProcessors (#20536) · ef0f85cd
      Younes Belkada authored
      
      
      * add v1 with tests
      
      * add checker
      
      * simplified version
      
      * update docstring
      
      * better version
      
      * fix docstring + change order
      
      * make style
      
      * tests + change conditions
      
      * final tests
      
      * modify docstring
      
      * Update src/transformers/feature_extraction_utils.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * replace by `ValueError`
      
      * fix logic
      
      * apply suggestions
      
      * `dtype` is not needed
      
      * adapt suggestions
      
      * remove `_parse_args_to_device`
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      ef0f85cd
    • Kamal Raj Kanakarajan's avatar
      Add BioGPT (#20420) · 13e73668
      Kamal Raj Kanakarajan authored
      * biogpt initial commit
      
      * updated init
      
      * fix faster decoding with use_cache
      
      * 1. fix input_ids and input_embeds with correct device
      2. added _keys_to_ignore_on_load_missing
      3. updated prepare_inputs_for_generation
      
      * add activation_dropout and scale_embedding
      
      * replace fsmt attention with bart attention
      
      * added test
      
      * run make fix-copies
      
      * doc init and fix build
      
      * updated README with proper information
      
      * 1. added tips to docs
      2. updated BioGptTokenizer func
      
      * 1. added tokenizer test
      2. refactor tokenizer
      
      * make fixup
      
      * add biogpt fairseq to hf converter
      
      * updated layer names more
      similar to original checkpoints
      
      * config update doc string and set defaults
      
      * added "#copied" from bart model and
      updated doc strings
      
      * enable model_input_names in tokenizer
      
      * 1.  positionalembedding depending on attention_mask
      2. added attention mask to prepare for generation
      
      * added test to verify past and generation
      
      * BioGptLMHeadModel -> BioGptForCausalLM
      
      * fix typo
      
      * tokenization and test
      Copyright and updated assertion
      
      * updated Copyright and
      one func at time in line
      
      * Copyright updates and
      minor doc fix
      
      * replace assertion with ValueError
      
      * rm extra space
      
      * added code syntax
      
      * revert cmnt position change
      
      * add tokenizer to auto
      
      * updated doc string
      
      * tokenizer doc string update
      
      * biogpt hub model update to microsoft/biogpt
      
      * make fixup
      
      * rm cmnt to fix flake8 5.0.4 vs 6 error
      13e73668
    • Francisco Kurucz's avatar
      e135a6c9