1. 06 Feb, 2024 1 commit
  2. 02 Feb, 2024 1 commit
    • Klaus Hipp's avatar
      [Docs] Fix spelling and grammar mistakes (#28825) · 721ee783
      Klaus Hipp authored
      * Fix typos and grammar mistakes in docs and examples
      
      * Fix typos in docstrings and comments
      
      * Fix spelling of `tokenizer` in model tests
      
      * Remove erroneous spaces in decorators
      
      * Remove extra spaces in Markdown link texts
      721ee783
  3. 11 Jan, 2024 1 commit
  4. 04 Jan, 2024 1 commit
  5. 27 Nov, 2023 1 commit
  6. 17 Nov, 2023 1 commit
  7. 02 Oct, 2023 1 commit
  8. 27 Sep, 2023 1 commit
  9. 16 Aug, 2023 1 commit
  10. 27 Jul, 2023 1 commit
    • Sanchit Gandhi's avatar
      Add bloom flax (#25094) · e9310363
      Sanchit Gandhi authored
      
      
      * First commit
      
      * step 1 working
      
      * add alibi
      
      * placeholder for `scan`
      
      * add matrix mult alibi
      
      * beta scaling factor for bmm
      
      * working v1 - simple forward pass
      
      * move layer_number from attribute to arg in call
      
      * partial functioning scan
      
      * hacky working scan
      
      * add more modifs
      
      * add test
      
      * update scan for new kwarg order
      
      * fix position_ids problem
      
      * fix bug in attention layer
      
      * small fix
      
      - do the alibi broadcasting only once
      
      * prelim refactor
      
      * finish refactor
      
      * alibi shifting
      
      * incorporate dropout_add to attention module
      
      * make style
      
      * make padding work again
      
      * update
      
      * remove bogus file
      
      * up
      
      * get generation to work
      
      * clean code a bit
      
      * added small tests
      
      * adding albii test
      
      * make CI tests pass:
      
      - change init weight
      - add correct tuple for output attention
      - add scan test
      - make CI tests work
      
      * fix few nits
      
      * fix nit onnx
      
      * fix onnx nit
      
      * add missing dtype args to nn.Modules
      
      * remove debugging statements
      
      * fix scan generate
      
      * Update modeling_flax_bloom.py
      
      * Update test_modeling_flax_bloom.py
      
      * Update test_modeling_flax_bloom.py
      
      * Update test_modeling_flax_bloom.py
      
      * fix small test issue + make style
      
      * clean up
      
      * Update tests/models/bloom/test_modeling_flax_bloom.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * fix function name
      
      * small fix test
      
      * forward contrib credits from PR17761
      
      * Fix failing test
      
      * fix small typo documentation
      
      * fix non passing test
      
      - remove device from build alibi
      
      * refactor call
      
      - refactor `FlaxBloomBlockCollection` module
      
      * make style
      
      * upcast to fp32
      
      * cleaner way to upcast
      
      * remove unused args
      
      * remove layer number
      
      * fix scan test
      
      * make style
      
      * fix i4 casting
      
      * fix slow test
      
      * Update src/transformers/models/bloom/modeling_flax_bloom.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * remove `layer_past`
      
      * refactor a bit
      
      * fix `scan` slow test
      
      * remove useless import
      
      * major changes
      
      - remove unused code
      - refactor a bit
      - revert import `torch`
      
      * major refactoring
      
      - change build alibi
      
      * remove scan
      
      * fix tests
      
      * make style
      
      * clean-up alibi
      
      * add integration tests
      
      * up
      
      * fix batch norm conversion
      
      * style
      
      * style
      
      * update pt-fx cross tests
      
      * update copyright
      
      * Update src/transformers/modeling_flax_pytorch_utils.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * per-weight check
      
      * style
      
      * line formats
      
      ---------
      Co-authored-by: default avataryounesbelkada <younesbelkada@gmail.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      Co-authored-by: default avatarhaileyschoelkopf <haileyschoelkopf@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      e9310363
  11. 03 Jul, 2023 1 commit
    • Arthur's avatar
      [`Umt5`] Add google's umt5 to `transformers` (#24477) · 799df10a
      Arthur authored
      
      
      * add tokenization template
      
      * update conversion script
      
      * update modeling code
      
      * update
      
      * update convert checkpoint
      
      * update modeling
      
      * revert changes on convert script
      
      * new conversion script for new format
      
      * correct position bias
      
      * cleaning a bit
      
      * Credit co authors
      Co-authored-by: default avataragemagician <ahmed.elnaggar@tum.de>
      
      Co-authored-by: stefan-it
      <>
      
      * styling
      
      * Add docq
      
      * fix copies
      
      * add co author
      
      * Other Author
      
      * Merge branch 'main' of https://github.com/huggingface/transformers
      
       into add-umt5
      
      * add testing
      
      * nit
      
      * Update docs/source/en/model_doc/umt5.mdx
      Co-authored-by: default avatarStefan Schweter <stefan@schweter.it>
      
      * fix t5
      
      * actual fix?
      
      * revert wrong changes
      
      * remove
      
      * update test
      
      * more fixes
      
      * revert some changes
      
      * add SPIECE_UNDERLINE
      
      * add a commone xample
      
      * upfate
      
      * fix copies
      
      * revert changes on t5 conversion script
      
      * revert bytefallback changes since there was no addition yet
      
      * fixup
      
      * fixup
      
      * ingore umt5 cutom testing folder
      
      * fix readmes
      
      * revertT5 changes
      
      * same outputs
      
      * fixup
      
      * update example
      
      * Apply suggestions from code review
      
      * style
      
      * draft addition of all new files
      
      * current update
      
      * fix attention and stuff
      
      * finish refactoring
      
      * auto config
      
      * fixup
      
      * more nits
      
      * add umt5 to init
      
      * use md format
      
      * Update README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * revert changes on mt5
      
      * revert mt4 changes
      
      * update test
      
      * more fixes
      
      * add to mapping
      
      * fix-copies
      
      * fix copies
      
      * foix retain grad
      
      * fix some tests
      
      * nits
      
      * done
      
      * Update src/transformers/models/umt5/modeling_umt5.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update docs/source/en/model_doc/umt5.md
      
      * Update src/transformers/models/umt5/__init__.py
      
      * Update docs/source/en/model_doc/umt5.md
      Co-authored-by: default avatarStefan Schweter <stefan@schweter.it>
      
      * Update src/transformers/models/umt5/modeling_umt5.py
      
      * update conversion script + use google checkpoints
      
      * nits
      
      * update test and modelling
      
      * stash slow convert
      
      * update fixupd
      
      * don't change slow
      
      ---------
      
      Co-authored-by: stefan-it <>
      Co-authored-by: default avatarStefan Schweter <stefan@schweter.it>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      799df10a
  12. 29 Jun, 2023 1 commit
    • amyeroberts's avatar
      Update old existing feature extractor references (#24552) · ae454f41
      amyeroberts authored
      * Update old existing feature extractor references
      
      * Typo
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      
      * Address comments from review - update 'feature extractor'
      Co-authored by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
      ae454f41
  13. 20 Jun, 2023 1 commit
  14. 04 Apr, 2023 1 commit
    • Shubhamai's avatar
      Flax Regnet (#21867) · 90067748
      Shubhamai authored
      * initial commit
      
      * review changes
      
      * post model PR merge
      
      * updating doc
      90067748
  15. 24 Mar, 2023 1 commit
    • Shubhamai's avatar
      Resnet flax (#21472) · a0cbbba3
      Shubhamai authored
      
      
      * [WIP] flax resnet
      
      * added pretrained flax models, results reproducible
      
      * Added pretrained flax models, results reproducible
      
      * working on tests
      
      * no real code change, just some comments
      
      * [flax] adding support for batch norm layers
      
      * fixing bugs related to pt+flax integration
      
      * removing loss from modeling flax output class
      
      * fixing classifier tests
      
      * fixing comments, model output
      
      * cleaning comments
      
      * review changes
      
      * review changes
      
      * Apply suggestions from code review
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * renaming Flax to PyTorch
      
      ---------
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      a0cbbba3
  16. 14 Mar, 2023 1 commit
  17. 01 Mar, 2023 1 commit
    • Alara Dirik's avatar
      Add ALIGN to transformers (#21741) · 269b0549
      Alara Dirik authored
      Adds the ALIGN model to transformers. ALIGN is introduced in "Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision" by Chao Jia, Yinfei Yang, Ye Xia, Yi-Ting Chen, Zarana Parekh, Hieu Pham, Quoc V. Le, Yunhsuan Sung, Zhen Li, Tom Duerig.
      269b0549
  18. 20 Feb, 2023 2 commits
    • Alara Dirik's avatar
      Add EfficientNet (#21563) · 49ab1623
      Alara Dirik authored
      * Add EfficientNet to transformers
      49ab1623
    • tanreinama's avatar
      add GPTSAN model (reopen) (#21291) · f56174ac
      tanreinama authored
      * add GPTSAN-Japanese
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN (update for review)
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * fix typo in comment text
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * add GPTSAN
      
      * fix document and comments
      
      * fix class name GPTSAN->GPTSan
      
      * fix import and test for tokenizer
      f56174ac
  19. 07 Feb, 2023 1 commit
    • Stefan Schweter's avatar
      Add XLM-V to Model Doc (#21498) · 7e51a441
      Stefan Schweter authored
      * doc: introduce new section for XLM-V model
      
      * doc: mention more details for XLM-V integration
      
      * docs: paper abstract in italics, model identifier for base model added
      
      * doc: mention new XLM-V support
      
      * auto: add XLM-V mapping
      
      * doc: run make fix-copies ;)
      7e51a441
  20. 06 Feb, 2023 1 commit
  21. 19 Jan, 2023 1 commit
    • Jitesh Jain's avatar
      Add OneFormer Model (#20577) · 5b949623
      Jitesh Jain authored
      * Add Oneformer Model
      
      * Add OneFormer Tests
      
      * Add UNIVERSAL_SEGMENTATION_MAPPING
      
      * Fix config
      
      * 馃悰 Fix error encountered while writing tests
      
      * 馃敤 Fix instance segmentation post processing
      
      * Format Files and Add Documentation
      
      * Add Documentation mdx file
      
      * Run make fixup
      
      * Run make fix-copies
      
      * Remove unnecessary code
      
      * Format modeling_oneformer.py
      
      * Add OneFormer to ImageSegmentationPipeline
      
      * Format files
      
      * Add Demo link to Readme
      
      * Fix fomatting errors
      
      * Fix test failures
      
      * Update Table in index.mdx
      
      * Fix version
      
      * Fix style
      
      * Remove OneFormer from TF
      
      * Fix Imports
      
      * Fix dummy objects
      
      * Fix tests
      
      * Add newline
      
      * Remove OneFormerFeatureExtractor
      
      * Remove CUDA Kernels
      
      * Use AutoBackbone for Swin
      
      * Fix description
      
      * Use Image Processor
      
      * Fix copies
      
      * Fix formatting
      
      * Fix import order
      
      * Fix flake8 errors
      
      * Fix doc errors
      
      * Add Hindi Readme entry
      
      * Update supported backbones
      
      * Update supported backbones
      
      * Undo Changes
      
      * Fix type of config
      
      * Fix isort
      
      * Fix auto.mdx
      
      * Fix swin config
      
      * Replace DinatBackbone with AutoBackbone
      
      * Use SwinBackbone
      
      * Use SwinBackbone
      
      * Fix conversion script
      
      * Fix arguments
      
      * Add argument description
      
      * Fix style
      
      * Add OneFormerProcessor
      
      * Fix OneFormerProcessor Tests
      
      * Fix mapping
      
      * Fix imports
      
      * Fix inits
      
      * Fix style
      
      * Fix comment
      
      * Fix docstring
      
      * Move OneFormer to MultiModal
      
      * Fix Copies
      
      * Remove size divisor
      
      * Fix check_repo.py
      
      * Fix copies
      
      * Add Processor for Testing Pipeline
      
      * Fix padding for tokens
      
      * Fix variables
      
      * Fix formatting with correct black version
      
      * Add Image Processor Test
      
      * Apply suggestions
      
      * Revert common modeling
      
      * Add check for task
      
      * Fix conversion script
      
      * Fix initialization order
      
      * Fix tests
      
      * Undo Pipeline Changes
      
      * Fix layers in MLP
      
      * Fix copies
      
      * Update image paths
      
      * Fix copies
      
      * Apply suggestions
      5b949623
  22. 16 Jan, 2023 1 commit
  23. 12 Dec, 2022 1 commit
  24. 21 Nov, 2022 1 commit
  25. 15 Nov, 2022 1 commit
  26. 09 Nov, 2022 1 commit
  27. 23 Sep, 2022 1 commit
  28. 20 Sep, 2022 1 commit
  29. 16 Sep, 2022 1 commit
  30. 08 Sep, 2022 1 commit
  31. 01 Sep, 2022 1 commit
  32. 11 Aug, 2022 1 commit
    • flozi00's avatar
      german docs translation (#18544) · 5d3f0374
      flozi00 authored
      * Create _config.py
      
      * Create _toctree.yml
      
      * Create index.mdx
      
      not sure about "du / ihr" oder "sie"
      
      * Create quicktour.mdx
      
      * Update _toctree.yml
      
      * Update build_documentation.yml
      
      * Update build_pr_documentation.yml
      
      * fix build
      
      * Update index.mdx
      
      * Update quicktour.mdx
      
      * Create installation.mdx
      
      * Update _toctree.yml
      5d3f0374