1. 07 Dec, 2021 2 commits
    • Stas Bekman's avatar
      [deepspeed] fix --load_best_model_at_end (#14652) · b66c5ab2
      Stas Bekman authored
      * [deepspeed] fix load_best_model_at_end
      
      * try with pull_request_target
      
      * revert: try with pull_request_target
      
      * style
      
      * add test
      
      * cleanup
      b66c5ab2
    • Ryokan RI's avatar
      Add mLUKE (#14640) · 30646a0a
      Ryokan RI authored
      * implement MLukeTokenizer and LukeForMaskedLM
      
      * update tests
      
      * update docs
      
      * add LukeForMaskedLM to check_repo.py
      
      * update README
      
      * fix test and specify the entity pad id in tokenization_(m)luke
      
      * fix EntityPredictionHeadTransform
      30646a0a
  2. 06 Dec, 2021 7 commits
  3. 03 Dec, 2021 2 commits
  4. 02 Dec, 2021 3 commits
  5. 01 Dec, 2021 5 commits
    • Li-Huai (Allan) Lin's avatar
      Fix mask token handling (#14364) · 934e2799
      Li-Huai (Allan) Lin authored
      * Fix mask token handling
      
      * Revert "Fix mask token handling"
      
      This reverts commit daaa3f5291b1f71e5bc3604ca281c000000c4648.
      
      * Fix FNet mask token tokenization
      934e2799
    • Sylvain Gugger's avatar
      Doc new front (#14590) · 4df7d05a
      Sylvain Gugger authored
      
      
      * Convert PretrainedConfig doc to Markdown
      
      * Use syntax
      
      * Add necessary doc files (#14496)
      
      * Doc fixes (#14499)
      
      * Fixes for the new front
      
      * Convert DETR file for table
      
      * Title is needed
      
      * Simplify a bit
      
      * Even simpler
      
      * Remove imports
      
      * Fix typo in toctree (#14516)
      
      * Fix checkpoints badge
      
      * Update versions.yml format (#14517)
      
      * Doc new front github actions (#14512)
      
      * Doc new front github actions
      
      * Fix docstring
      
      * Fix feature extraction utils import (#14515)
      
      * Address Julien's comments
      
      * Push to doc-builder
      
      * Ready for merge
      
      * Remove old build and deploy
      
      * Doc misc fixes (#14583)
      
      * Rm versions.yml from doc
      
      * Fix converting.rst
      
      * Rm pretrained_models from toctree
      
      * Fix index links (#14567)
      
      * Fix links in README
      
      * Localized READMEs
      
      * Fix copy script
      
      * Fix find doc script
      
      * Update README_ko.md
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      
      * Adapt build command to new CLI tools (#14578)
      
      * Fix typo
      
      * Fix doc interlinks (#14589)
      
      * Convert PretrainedConfig doc to Markdown
      
      * Use syntax
      
      * Rm pattern <[a-z]+(.html).*>
      
      * Rm huggingface.co/transformers/master
      
      * Rm .html
      
      * Rm .html from index.mdx
      
      * Rm .html from model_summary.rst
      
      * Update index.mdx rm html
      
      * Update remove .html
      
      * Fix inner doc links
      
      * Fix interlink in preprocssing.rst
      
      * Update pr_checks
      Co-authored-by: default avatarSylvain Gugger <sylvain.gugger@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Convert PretrainedConfig doc to Markdown
      
      * Use syntax
      
      * Add necessary doc files (#14496)
      
      * Doc fixes (#14499)
      
      * Fixes for the new front
      
      * Convert DETR file for table
      
      * Title is needed
      
      * Simplify a bit
      
      * Even simpler
      
      * Remove imports
      
      * Fix checkpoints badge
      
      * Fix typo in toctree (#14516)
      
      * Update versions.yml format (#14517)
      
      * Doc new front github actions (#14512)
      
      * Doc new front github actions
      
      * Fix docstring
      
      * Fix feature extraction utils import (#14515)
      
      * Address Julien's comments
      
      * Push to doc-builder
      
      * Ready for merge
      
      * Remove old build and deploy
      
      * Doc misc fixes (#14583)
      
      * Rm versions.yml from doc
      
      * Fix converting.rst
      
      * Rm pretrained_models from toctree
      
      * Fix index links (#14567)
      
      * Fix links in README
      
      * Localized READMEs
      
      * Fix copy script
      
      * Fix find doc script
      
      * Update README_ko.md
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      
      * Adapt build command to new CLI tools (#14578)
      
      * Fix typo
      
      * Fix doc interlinks (#14589)
      
      * Convert PretrainedConfig doc to Markdown
      
      * Use syntax
      
      * Rm pattern <[a-z]+(.html).*>
      
      * Rm huggingface.co/transformers/master
      
      * Rm .html
      
      * Rm .html from index.mdx
      
      * Rm .html from model_summary.rst
      
      * Update index.mdx rm html
      
      * Update remove .html
      
      * Fix inner doc links
      
      * Fix interlink in preprocssing.rst
      
      * Update pr_checks
      Co-authored-by: default avatarSylvain Gugger <sylvain.gugger@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Styling
      Co-authored-by: default avatarMishig Davaadorj <mishig.davaadorj@coloradocollege.edu>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      4df7d05a
    • Stas Bekman's avatar
      fix autocast for older pytorch · 14cc50d0
      Stas Bekman authored
      14cc50d0
    • Suraj Patil's avatar
      FlaxGPTJ (#14396) · 4c0dd199
      Suraj Patil authored
      * add flax gptj
      
      * no bias in attention dense
      
      * no wpe
      
      * fix rotary embeddings
      
      * fix rotary embeds
      
      * fix rotray embeds
      
      * quality
      
      * doc and quality
      
      * fix equivalence tests
      4c0dd199
    • Jamie DeAntonis's avatar
      WIP: Support for Training with BF16 (#13207) · 70996a54
      Jamie DeAntonis authored
      
      
      * started bf16 integration
      
      * minor changes
      
      * code now runs
      
      * style
      
      * lay foundation for bf16 testing
      
      * lay foundation for bf16 testing
      
      * start the tests
      
      * better bf16 check
      
      * style
      
      * 2 separate checkers - one for bf16 support, another for bf16+autocast
      
      * Update src/transformers/training_args.py
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      
      * a couple of comment resolutions
      
      * more comment resolutions
      
      * resolved a small bug
      
      * just some print statemtns
      
      * added todo marking
      
      * added a todo
      
      * adjust for API change s/fast_dtype/dtype/
      
      * fix style
      
      * merge 2 bf16 util functions
      
      * bf16 now does scaling too
      
      * Add support for bfloat16
      
      * Revert T5 layernorm to float32
      
      This is based on the comment at https://github.com/huggingface/transformers/pull/14448/files#r752660929 and the PyTorch PR https://github.com/pytorch/pytorch/pull/66920
      
       .
      
      * Add comment about conversion to float32 before returning the numpy data
      
      * Add comment about AMP-bfloat16 incompatibility
      
      * Fix formatting
      
      * typo
      
      * reformer / bf16
      
      * cleanup
      
      * require at least pt-1.10
      
      * fix
      
      * will deal with deepspeed separately
      
      * cleanup
      
      * revert
      
      * cleanup
      
      * fp16_full_eval and bf16_full_eval are separate modes
      
      * proper deprecation
      
      * cleanup
      
      * test and fixes
      
      * spelling
      
      * cleanup
      
      * add a note that this API is experimental
      Co-authored-by: default avatarjamie <jamie@cortx.com>
      Co-authored-by: default avatarStas Bekman <stas@stason.org>
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      Co-authored-by: default avatarsuriya <suriya@cortx.com>
      Co-authored-by: default avatarManuel R. Ciosici <manuelrciosici@gmail.com>
      70996a54
  6. 30 Nov, 2021 5 commits
    • Suraj Patil's avatar
      VisionTextDualEncoder (#13511) · fc1d97f2
      Suraj Patil authored
      
      
      * init vision_text_dual_encoder
      
      * fix merge
      
      * remove extra heads
      
      * fix tests
      
      * remove VISION_TEXT_DUAL_ENCODER_PRETRAINED_CONFIG_ARCHIVE_MAP
      
      * remove archive map
      
      * fix imports
      
      * fix more imports
      
      * fix init
      
      * delete tokenizers
      
      * fix imports
      
      * clean
      
      * support clip's vision model
      
      * handle None config
      
      * begin tests
      
      * more test and few fixes
      
      * warn about newly init weights
      
      * more tests
      
      * add loss to model
      
      * remove extra classes from doc
      
      * add processor
      
      * doc and small fixes
      
      * add start docstr
      
      * update flax model
      
      * flax tests
      
      * more flax tests
      
      * doc
      
      * quality
      
      * doc and quality
      
      * fix doc
      
      * doc
      
      * remove comments
      
      * update warning
      
      * quality
      
      * fix docs
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * replace asserts, fix imports
      
      * update imports
      
      * fix import
      
      * address some review comments
      
      * fix check
      
      * reduce tolerance
      
      * fix test
      
      * add flax integration test
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * address Sylvain's comments
      
      * fix style
      
      * add pt_flax_equivalence test in PT tests
      
      * add pt integration test
      
      * update test
      
      * use pre-trained checkpoint in examples
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      fc1d97f2
    • Thomas Viehmann's avatar
      use functional interface for softmax in attention (#14198) · 6ed9882d
      Thomas Viehmann authored
      * use functional interface instead of instantiating module and immediately calling it
      
      * fix torch.nn.functional to nn.functional. Thank you Stas!
      6ed9882d
    • giacomo snidero's avatar
      Add documentation for multi-label classification (#14168) · 4176bc16
      giacomo snidero authored
      * "update example docstring multilabel example
      
      * update example docstring multilabel example
      4176bc16
    • Daniel Stancl's avatar
      [Flax] Add FlaxBlenderbot (#13633) · faacd747
      Daniel Stancl authored
      
      
      * Init Flax implementation for Blenderbot
      
      * Add a majority of stuff except for tests
      
      * make style quality
      
      * Add tests and fix some bugs
      
      * Add tests
      
      * Clean source code and fix some bugs
      
      * Fix copies and docs
      
      * Fix jax device condition for tests
      
      * Fix layer norm in the encoder
      
      * Fix a few typos in the test file
      
      * make fix-copies
      
      * make fix-copies
      
      * fix layer norm
      
      * Fix Flax params dtype (#13090)
      
      * Fix PR reference (#13098)
      
      * make fix-copies
      
      * Update tests/test_modeling_flax_blenderbot.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      faacd747
    • Kamal Raj's avatar
      Tapas tf (#13393) · c468a87a
      Kamal Raj authored
      * TF Tapas first commit
      
      * updated docs
      
      * updated logger message
      
      * updated pytorch weight conversion
      script to support scalar array
      
      * added use_cache to tapas model config to
      work properly with tf input_processing
      
      * 1. rm embeddings_sum
      2. added # Copied
      3. + TFTapasMLMHead
      4. and lot other small fixes
      
      * updated docs
      
      * + test for tapas
      
      * updated testing_utils to check
      is_tensorflow_probability_available
      
      * converted model logits post processing using
      numpy to work with both PT and TF models
      
      * + TFAutoModelForTableQuestionAnswering
      
      * added TF support
      
      * added test for
      TFAutoModelForTableQuestionAnswering
      
      * added test for
      TFAutoModelForTableQuestionAnswering pipeline
      
      * updated auto model docs
      
      * fixed typo in import
      
      * added tensorflow_probability to run tests
      
      * updated MLM head
      
      * updated tapas.rst with TF  model docs
      
      * fixed optimizer import in docs
      
      * updated convert to np
      data from pt model is not
      `transformers.tokenization_utils_base.BatchEncoding`
      after pipeline upgrade
      
      * updated pipeline:
      1. with torch.no_gard removed, pipeline forward handles
      2. token_type_ids converted to numpy
      
      * updated docs.
      
      * removed `use_cache` from config
      
      * removed floats_tensor
      
      * updated code comment
      
      * updated Copyright Year and
      logits_aggregation Optional
      
      * updated docs and comments
      
      * updated docstring
      
      * fixed model weight loading
      
      * make fixup
      
      * fix indentation
      
      * added tf slow pipeline test
      
      * pip upgrade
      
      * upgrade python to 3.7
      
      * removed from_pt from tests
      
      * revert commit f18cfa9
      c468a87a
  7. 29 Nov, 2021 4 commits
  8. 26 Nov, 2021 1 commit
  9. 24 Nov, 2021 2 commits
    • Lysandre Debut's avatar
      f2e90bcb
    • Vladimir Maryasin's avatar
      add cache_dir for tokenizer verification loading (#14508) · 6c4d688f
      Vladimir Maryasin authored
      When loading a pretrained tokenizer, a verification is done to ensure
      that the actual tokenizer class matches the class it was called from.
      If the tokenizer is absent, its config file is loaded from the repo.
      
      However, the cache_dir for downloading is not provided, which leads to
      ignoring of the user-specified cache_dir, storing files in several
      places and and may result in incorrect warnings when the default
      cache_dir is unreachsble.
      
      This commit fixes that.
      6c4d688f
  10. 23 Nov, 2021 1 commit
  11. 22 Nov, 2021 3 commits
  12. 21 Nov, 2021 2 commits
  13. 19 Nov, 2021 3 commits