1. 24 May, 2022 2 commits
    • Jason Phang's avatar
      [WIP] Adding GPT-NeoX-20B (#16659) · 71e60272
      Jason Phang authored
      
      
      * initial
      
      * first try
      
      * working 20B
      
      * 20B tokenizers
      
      * Docs
      
      * Import fixes for missing classes
      
      * Update docs, fixup
      
      * black formatting
      
      * isort
      
      * flake
      
      * dummy objects
      
      * documentation
      
      * Documentation yml
      
      * more docs
      
      * tweaks for tests
      
      * tokenization auto
      
      * fix neox tests
      
      * test
      
      * test
      
      * einsum
      
      * address PR feedback
      
      * Documentation
      
      * Update README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_neox/__init__.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_neox/configuration_gpt_neox.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Remove undefined LaTeX syntax
      
      * Update to full url to avoid confusion about if that's supposed to refer to the Hub
      
      * fix auto
      
      * move tests
      
      * documentation fix
      
      * more doc fixes
      
      * test refactor
      
      * fix import
      
      * fix import
      
      * fix import
      
      * fix import
      
      * fix import
      
      * style fixes
      
      * More modeling fixes
      Co-authored-by: default avatarJason Phang <zp489@gr057.hpc.nyu.edu>
      Co-authored-by: default avatarStella Biderman <stellabiderman@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      71e60272
    • NielsRogge's avatar
      Add LayoutLMv3 (#17060) · 31ee80d5
      NielsRogge authored
      
      
      * Make forward pass work
      
      * More improvements
      
      * Remove unused imports
      
      * Remove timm dependency
      
      * Improve loss calculation of token classifier
      
      * Fix most tests
      
      * Add docs
      
      * Add model integration test
      
      * Make all tests pass
      
      * Add LayoutLMv3FeatureExtractor
      
      * Improve integration test + make fixup
      
      * Add example script
      
      * Fix style
      
      * Add LayoutLMv3Processor
      
      * Fix style
      
      * Add option to add visual labels
      
      * Make more tokenizer tests pass
      
      * Fix more tests
      
      * Make more tests pass
      
      * Fix bug and improve docs
      
      * Fix import of processors
      
      * Improve docstrings
      
      * Fix toctree and improve docs
      
      * Fix auto tokenizer
      
      * Move tests to model folder
      
      * Move tests to model folder
      
      * change default behavior add_prefix_space
      
      * add prefix space for fast
      
      * add_prefix_spcae set to True for Fast
      
      * no space before `unique_no_split` token
      
      * add test to hightligh special treatment of added tokens
      
      * fix `test_batch_encode_dynamic_overflowing` by building a long enough example
      
      * fix `test_full_tokenizer` with add_prefix_token
      
      * Fix tokenizer integration test
      
      * Make the code more readable
      
      * Add tests for LayoutLMv3Processor
      
      * Fix style
      
      * Add model to README and update init
      
      * Apply suggestions from code review
      
      * Replace asserts by value errors
      
      * Add suggestion by @ducviet00
      
      * Add model to doc tests
      
      * Simplify script
      
      * Improve README
      
      * a step ahead to fix
      
      * Update pair_input_test
      
      * Make all tokenizer tests pass - phew
      
      * Make style
      
      * Add LayoutLMv3 to CI job
      
      * Fix auto mapping
      
      * Fix CI job name
      
      * Make all processor tests pass
      
      * Make tests of LayoutLMv2 and LayoutXLM consistent
      
      * Add copied from statements to fast tokenizer
      
      * Add copied from statements to slow tokenizer
      
      * Remove add_visual_labels attribute
      
      * Fix tests
      
      * Add link to notebooks
      
      * Improve docs of LayoutLMv3Processor
      
      * Fix reference to section
      Co-authored-by: default avatarSaulLu <lucilesaul.com@gmail.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      31ee80d5
  2. 23 May, 2022 3 commits
  3. 18 May, 2022 1 commit
  4. 17 May, 2022 3 commits
  5. 16 May, 2022 6 commits
  6. 13 May, 2022 4 commits
    • Ignacio Talavera's avatar
      Guide to create custom models in Spanish (#17158) · ee393c00
      Ignacio Talavera authored
      
      
      * file copied and toctree updated
      
      * Intro and configuration translated
      
      * model section translated
      
      * enter hotfix
      
      * Translation over, correction pending
      
      * Typos and corrections
      
      * Update docs/source/es/create_a_model.mdx
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      * Update docs/source/es/create_a_model.mdx
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      * Update docs/source/es/create_a_model.mdx
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      * Update docs/source/es/create_a_model.mdx
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      ee393c00
    • Gerardo Huerta Robles's avatar
      Translated version of model_sharing.mdx doc to spanish (#16184) · 16be4229
      Gerardo Huerta Robles authored
      
      
      * Translated version of model_sharing to spanish
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Addind model sharing to _toctree.yml
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      16be4229
    • Fellip Silva Alves's avatar
      [ fast_tokenizers.mdx ] - Added translation to portuguese to tutorial (#17076) · f9024814
      Fellip Silva Alves authored
      
      
      * [ fast_tokenizers.mdx ] - Added translation to portuguese to tutorial
      
      * Delete docs/source/pt-br directory
      
      * [ fast_tokenizers.mdx ] - Continuing work on file
      
      * [ fast_tokenizers.mdx ] - Continuing work on file
      
      * Add fast tokenizers to _toctree.yml
      
      * Eliminated config and toctree.yml
      
      * Nits in fast_tokenizers.mdx
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      f9024814
    • Rafael Zimmer's avatar
      Added translation of installation.mdx to Portuguese Issue #16824 (#16979) · 85fc4559
      Rafael Zimmer authored
      
      
      * Added translation of installation.mdx to Portuguese, as well
      as default templates of _toctree.yml and _config.py
      
      * [ build_documentation.yml ] - Updated doc_builder to build
      documentation in Portuguese.
      [ pipeline_tutorial.mdx ] - Created translation for the pipeline_tutorial.mdx.
      
      * [ build_pr_documentation.yml ] - Added pt language to pr_documentation builder.
      
      [ pipeline_tutorial.mdx ] - Grammar changes.
      
      * [ accelerate.mdx ] - Translated to Portuguese the acceleration tutorial.
      
      * [ multilingual.mdx ] - Added portuguese translation for multilingual tutorial.
      
      [ training.mdx ] - Added portuguese translation for training tutorial.
      
      * [ preprocessing.mdx ] - WIP
      
      * Update _toctree.yml
      
      * Adding Pré-processamento to _toctree.yml
      
      * Update accelerate.mdx
      
      * Nits and eliminate preprocessing file while it is ready
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      85fc4559
  7. 12 May, 2022 8 commits
  8. 11 May, 2022 2 commits
    • Amanpreet Singh's avatar
      [feat] Add FLAVA model (#16654) · a10f6183
      Amanpreet Singh authored
      * [WIP] Add FLAVA model
      
      This PR aims to add [FLAVA](ihttps://arxiv.org/abs/2112.04482) model to the transformers repo.
      
      Following checklist delineates the list of things to be done for this PR
      to be complete:
      
      [x] Flava init
      [x] Flava base models
      [x] Flava layers
      [x] Flava Configs
      [x] Flava encoders
      [x] Flava pretraining models
      [ ] Flava classification/retrieval models (To be added in a separate PR)
      [x] Documentation updates 
      [x] Imports updates 
      [x] Argstring updates
      [x] Flava pretrained checkpoints 
      [x] Flava tests
      [x] Flava processors 
      [x] Sanity check
      [x] Lint
      a10f6183
    • hasan salim kanmaz's avatar
      [WIP] Enable reproducibility for distributed trainings (#16907) · c33f6046
      hasan salim kanmaz authored
      
      
      * add seed worker and set_deterministic_seed_for_cuda function to enforce reproducability
      
      * change function name to enable determinism, add docstrings, reproducability support for tf
      
      * change function name to enable_determinism_for_distributed_training
      
      * revert changes in set_seed and call set_seed within enable_full_determinism
      
      * add one position argument for seed_worker function
      
      * add full_determinism flag in training args and call enable_full_determinism when it is true
      
      * add enable_full_determinism to documentation
      
      * apply make fixup after the last commit
      
      * Update src/transformers/training_args.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      c33f6046
  9. 10 May, 2022 2 commits
  10. 09 May, 2022 4 commits
  11. 06 May, 2022 1 commit
  12. 05 May, 2022 1 commit
  13. 04 May, 2022 3 commits