1. 06 Dec, 2021 14 commits
  2. 05 Dec, 2021 1 commit
  3. 03 Dec, 2021 6 commits
  4. 02 Dec, 2021 8 commits
  5. 01 Dec, 2021 6 commits
    • Stas Bekman's avatar
      [doc] bf16/tf32 guide (#14579) · fbe278c7
      Stas Bekman authored
      
      
      * [doc] bf16/tf32 guide
      
      * expand
      
      * expand
      
      * Update docs/source/performance.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      fbe278c7
    • Li-Huai (Allan) Lin's avatar
      Fix mask token handling (#14364) · 934e2799
      Li-Huai (Allan) Lin authored
      * Fix mask token handling
      
      * Revert "Fix mask token handling"
      
      This reverts commit daaa3f5291b1f71e5bc3604ca281c000000c4648.
      
      * Fix FNet mask token tokenization
      934e2799
    • Sylvain Gugger's avatar
      Doc new front (#14590) · 4df7d05a
      Sylvain Gugger authored
      
      
      * Convert PretrainedConfig doc to Markdown
      
      * Use syntax
      
      * Add necessary doc files (#14496)
      
      * Doc fixes (#14499)
      
      * Fixes for the new front
      
      * Convert DETR file for table
      
      * Title is needed
      
      * Simplify a bit
      
      * Even simpler
      
      * Remove imports
      
      * Fix typo in toctree (#14516)
      
      * Fix checkpoints badge
      
      * Update versions.yml format (#14517)
      
      * Doc new front github actions (#14512)
      
      * Doc new front github actions
      
      * Fix docstring
      
      * Fix feature extraction utils import (#14515)
      
      * Address Julien's comments
      
      * Push to doc-builder
      
      * Ready for merge
      
      * Remove old build and deploy
      
      * Doc misc fixes (#14583)
      
      * Rm versions.yml from doc
      
      * Fix converting.rst
      
      * Rm pretrained_models from toctree
      
      * Fix index links (#14567)
      
      * Fix links in README
      
      * Localized READMEs
      
      * Fix copy script
      
      * Fix find doc script
      
      * Update README_ko.md
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      
      * Adapt build command to new CLI tools (#14578)
      
      * Fix typo
      
      * Fix doc interlinks (#14589)
      
      * Convert PretrainedConfig doc to Markdown
      
      * Use syntax
      
      * Rm pattern <[a-z]+(.html).*>
      
      * Rm huggingface.co/transformers/master
      
      * Rm .html
      
      * Rm .html from index.mdx
      
      * Rm .html from model_summary.rst
      
      * Update index.mdx rm html
      
      * Update remove .html
      
      * Fix inner doc links
      
      * Fix interlink in preprocssing.rst
      
      * Update pr_checks
      Co-authored-by: default avatarSylvain Gugger <sylvain.gugger@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Convert PretrainedConfig doc to Markdown
      
      * Use syntax
      
      * Add necessary doc files (#14496)
      
      * Doc fixes (#14499)
      
      * Fixes for the new front
      
      * Convert DETR file for table
      
      * Title is needed
      
      * Simplify a bit
      
      * Even simpler
      
      * Remove imports
      
      * Fix checkpoints badge
      
      * Fix typo in toctree (#14516)
      
      * Update versions.yml format (#14517)
      
      * Doc new front github actions (#14512)
      
      * Doc new front github actions
      
      * Fix docstring
      
      * Fix feature extraction utils import (#14515)
      
      * Address Julien's comments
      
      * Push to doc-builder
      
      * Ready for merge
      
      * Remove old build and deploy
      
      * Doc misc fixes (#14583)
      
      * Rm versions.yml from doc
      
      * Fix converting.rst
      
      * Rm pretrained_models from toctree
      
      * Fix index links (#14567)
      
      * Fix links in README
      
      * Localized READMEs
      
      * Fix copy script
      
      * Fix find doc script
      
      * Update README_ko.md
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      
      * Adapt build command to new CLI tools (#14578)
      
      * Fix typo
      
      * Fix doc interlinks (#14589)
      
      * Convert PretrainedConfig doc to Markdown
      
      * Use syntax
      
      * Rm pattern <[a-z]+(.html).*>
      
      * Rm huggingface.co/transformers/master
      
      * Rm .html
      
      * Rm .html from index.mdx
      
      * Rm .html from model_summary.rst
      
      * Update index.mdx rm html
      
      * Update remove .html
      
      * Fix inner doc links
      
      * Fix interlink in preprocssing.rst
      
      * Update pr_checks
      Co-authored-by: default avatarSylvain Gugger <sylvain.gugger@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Styling
      Co-authored-by: default avatarMishig Davaadorj <mishig.davaadorj@coloradocollege.edu>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      4df7d05a
    • Stas Bekman's avatar
      fix autocast for older pytorch · 14cc50d0
      Stas Bekman authored
      14cc50d0
    • Suraj Patil's avatar
      FlaxGPTJ (#14396) · 4c0dd199
      Suraj Patil authored
      * add flax gptj
      
      * no bias in attention dense
      
      * no wpe
      
      * fix rotary embeddings
      
      * fix rotary embeds
      
      * fix rotray embeds
      
      * quality
      
      * doc and quality
      
      * fix equivalence tests
      4c0dd199
    • Jamie DeAntonis's avatar
      WIP: Support for Training with BF16 (#13207) · 70996a54
      Jamie DeAntonis authored
      
      
      * started bf16 integration
      
      * minor changes
      
      * code now runs
      
      * style
      
      * lay foundation for bf16 testing
      
      * lay foundation for bf16 testing
      
      * start the tests
      
      * better bf16 check
      
      * style
      
      * 2 separate checkers - one for bf16 support, another for bf16+autocast
      
      * Update src/transformers/training_args.py
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      
      * a couple of comment resolutions
      
      * more comment resolutions
      
      * resolved a small bug
      
      * just some print statemtns
      
      * added todo marking
      
      * added a todo
      
      * adjust for API change s/fast_dtype/dtype/
      
      * fix style
      
      * merge 2 bf16 util functions
      
      * bf16 now does scaling too
      
      * Add support for bfloat16
      
      * Revert T5 layernorm to float32
      
      This is based on the comment at https://github.com/huggingface/transformers/pull/14448/files#r752660929 and the PyTorch PR https://github.com/pytorch/pytorch/pull/66920
      
       .
      
      * Add comment about conversion to float32 before returning the numpy data
      
      * Add comment about AMP-bfloat16 incompatibility
      
      * Fix formatting
      
      * typo
      
      * reformer / bf16
      
      * cleanup
      
      * require at least pt-1.10
      
      * fix
      
      * will deal with deepspeed separately
      
      * cleanup
      
      * revert
      
      * cleanup
      
      * fp16_full_eval and bf16_full_eval are separate modes
      
      * proper deprecation
      
      * cleanup
      
      * test and fixes
      
      * spelling
      
      * cleanup
      
      * add a note that this API is experimental
      Co-authored-by: default avatarjamie <jamie@cortx.com>
      Co-authored-by: default avatarStas Bekman <stas@stason.org>
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      Co-authored-by: default avatarsuriya <suriya@cortx.com>
      Co-authored-by: default avatarManuel R. Ciosici <manuelrciosici@gmail.com>
      70996a54
  6. 30 Nov, 2021 5 commits