1. 19 Jan, 2023 1 commit
    • Clémentine Fourrier's avatar
      Graphormer model for Graph Classification (#20968) · 87208a05
      Clémentine Fourrier authored
      
      
      * [FT] First commit for graphormer architecture.
      
      The model has no tokenizer, as it uses a collator and preprocessing function for its input management.
      Architecture to be tested against original one.
      The arch might need to be changed to fit the checkpoint, but a revert to the original arch will make the code less nice to read.
      TODO: doc
      
      * [FIX] removed test model
      
      * [FIX] import error
      
      * [FIX] black and flake
      
      * [DOC] added paper refs
      
      * [FIX] [DOC]
      
      * [FIX] black
      
      * [DOC] Updated READMEs
      
      * [FIX] Order of imports + rm Tokenizer calls
      
      * [FIX] Moved assert in class to prevent doc build failure
      
      * [FIX] make fix-copies
      
      * [Doc] update from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * [FIX] Removed Graphormer from Sequence classification model list
      
      * [DOC] Added HF copyright to Cython file
      
      * [DOC] Fixed comments
      
      * [FIX] typos in class doc + removed config classes.
      
      Todo: update doc from paper definitions
      
      * [FIX] Removed dependency to fairseq, and replaced all asserts with Exception management
      
      * [FIX] Homogeneized initialization of weights to pretrained constructor
      
      * [FIX] [CP] Updated multi_hop parameter to get same results as in original implementation
      
      * [DOC] Relevant parameter description in the configuration file
      
      * [DOC] Updated doc and comments in main graphormer file
      
      * [FIX] make style and quality checks
      
      * [DOC] Fix doc format
      
      * [FIX] [WIP] Updated part of the tests, though still a wip
      
      * [FIX] [WIP]
      
      * [FIX] repo consistency
      
      * [FIX] Changed input names for more understandability
      
      * [FIX] [BUG] updated num_classes params for propagation in the model
      
      * simplified collator
      
      * [FIX] Updated tests to follow new naming pattern
      
      * [TESTS] Updated test suite along with model
      
      * |FIX] rm tokenizer import
      
      * [DOC] add link to graphormerdoc
      
      * Changed section in doc from text model to graph model
      
      * Apply suggestions from code review
      
      Spacing, inits
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * [DOC] Explain algos_graphormer functions
      
      * Cython soft import protection
      
      * Rm call to Callable in configuration graphormer
      
      * [FIX] replaced asserts with Exceptions
      
      * Add org to graphormer checkpoints
      
      * Prefixed classes with Graphormer
      
      * Management of init functions
      
      * format
      
      * fixes
      
      * fix length file
      
      * update indent
      
      * relaunching ci
      
      * Errors for missing cython imports
      
      * fix style
      
      * fix style doc
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      87208a05
  2. 18 Jan, 2023 1 commit
    • jeffhataws's avatar
      Add AWS Neuron torchrun support (#20806) · c59d71b2
      jeffhataws authored
      * Add XLA torchrun support
      
      * Clarify that currently DDP doesn't work with torch.distributed XLA backend yet
      
      * Enable DDP with torchrun and XLA (now available in PT-XLA 1.13)
      
      * Add check for AWS Neuron availability and AWS Neuron specific compiler flag
      
      * Change the new test's name to TestTrainerDistributedNeuronCore
      
      * Remove "assert" and replace raised exception
      
      * Remove compiler flag as it is optional. If needed, will be another PR.
      
      * Use TORCHELASTIC_RUN_ID to determine whether torchrun is used
      c59d71b2
  3. 08 Dec, 2022 1 commit
    • Nathan Raw's avatar
      Add video classification pipeline (#20151) · 9e56aff5
      Nathan Raw authored
      * 🚧 wip video classification pipeline
      
      * 🚧 wip - add is_decord_available check
      
      * 🐛 add missing import
      
      *  add tests
      
      * 🔧 add decord to setup extras
      
      * 🚧 add is_decord_available
      
      *  add video-classification pipeline
      
      * 📝 add video classification pipe to docs
      
      * 🐛 add missing VideoClassificationPipeline import
      
      * 📌 add decord install in test runner
      
      *  fix url inputs to video-classification pipeline
      
      *  updates from review
      
      * 📝 add video cls pipeline to docs
      
      * 📝 add docstring
      
      * 🔥 remove unused import
      
      * 🔥 remove some code
      
      * 📝 docfix
      9e56aff5
  4. 30 Nov, 2022 1 commit
  5. 18 Nov, 2022 1 commit
    • Ali Hassani's avatar
      Add Neighborhood Attention Transformer (NAT) and Dilated NAT (DiNAT) models (#20219) · fc4a993e
      Ali Hassani authored
      * Add DiNAT
      
      * Adds DiNAT + tests
      
      * Minor fixes
      
      * Added HF model
      
      * Add natten to dependencies.
      
      * Cleanup
      
      * Minor fixup
      
      * Reformat
      
      * Optional NATTEN import.
      
      * Reformat & add doc to _toctree
      
      * Reformat (finally)
      
      * Dummy objects for DiNAT
      
      * Add NAT + minor changes
      
      Adds NAT as its own independent model + docs, tests
      Adds NATTEN to ext deps to ensure ci picks it up.
      
      * Remove natten from `all` and `dev-torch` deps, add manual pip install to ci tests
      
      * Minor fixes.
      
      * Fix READMEs.
      
      * Requested changes to docs + minor fixes.
      
      * Requested changes.
      
      * Add NAT/DiNAT tests to layoutlm_job
      
      * Correction to Dinat doc.
      
      * Requested changes.
      fc4a993e
  6. 15 Nov, 2022 1 commit
  7. 14 Nov, 2022 1 commit
  8. 21 Oct, 2022 1 commit
  9. 18 Oct, 2022 1 commit
  10. 12 Oct, 2022 1 commit
  11. 07 Oct, 2022 1 commit
    • Sylvain Gugger's avatar
      Rework pipeline tests (#19366) · 9ac586b3
      Sylvain Gugger authored
      * Rework pipeline tests
      
      * Try to fix Flax tests
      
      * Try to put it before
      
      * Use a new decorator instead
      
      * Remove ignore marker since it doesn't work
      
      * Filter pipeline tests
      
      * Woopsie
      
      * Use the fitlered list
      
      * Clean up and fake modif
      
      * Remove init
      
      * Revert fake modif
      9ac586b3
  12. 05 Oct, 2022 1 commit
    • r-terada's avatar
      Add sudachi and jumanpp tokenizers for bert_japanese (#19043) · 2f53ab57
      r-terada authored
      * add sudachipy and jumanpp tokenizers for bert_japanese
      
      * use ImportError instead of ModuleNotFoundError in SudachiTokenizer and JumanppTokenizer
      
      * put test cases of test_tokenization_bert_japanese in one line
      
      * add require_sudachi and require_jumanpp decorator for testing
      
      * add sudachi and pyknp(jumanpp) to dependencies
      
      * remove sudachi_dict_small and sudachi_dict_full from dependencies
      
      * empty commit for ci
      2f53ab57
  13. 30 Sep, 2022 2 commits
    • Sylvain Gugger's avatar
      Poc to use safetensors (#19175) · 3e2dd7f9
      Sylvain Gugger authored
      
      
      * Poc to use safetensors
      
      * Typo
      
      * Final version
      
      * Add tests
      
      * Save with the right name!
      
      * Update tests/test_modeling_common.py
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      
      * Support for sharded checkpoints
      
      * Test from Hub part 1
      
      * Test from hub part 2
      
      * Fix regular checkpoint sharding
      
      * Bump for fixes
      Co-authored-by: default avatarJulien Chaumond <julien@huggingface.co>
      3e2dd7f9
    • NielsRogge's avatar
      Add MarkupLM (#19198) · f3d2f7a6
      NielsRogge authored
      
      
      * First draft
      
      * Make basic test work
      
      * Fix most tokenizer tests
      
      * More improvements
      
      * Make more tests pass
      
      * Fix more tests
      
      * Fix some code quality
      
      * Improve truncation
      
      * Implement feature extractor
      
      * Improve feature extractor and add tests
      
      * Improve feature extractor tests
      
      * Fix pair_input test partly
      
      * Add fast tokenizer
      
      * Improve implementation
      
      * Fix rebase
      
      * Fix rebase
      
      * Fix most of the tokenizer tests.
      
      * propose solution for fast
      
      * add: integration test for fasttokenizer, warning for decode, fix template in slow tokenizer
      
      * add: modify markuplmconverter
      
      * add: some modify on converter and tokenizerfast
      
      * Fix style, copies
      
      * Make fixup
      
      * Update tokenization_markuplm.py
      
      * Update test_tokenization_markuplm.py
      
      * Update markuplm related
      
      * Improve processor, add integration test
      
      * Add processor test file
      
      * Improve processor
      
      * Improve processor tests
      
      * Fix more processor tests
      
      * Fix processor tests
      
      * Update docstrings
      
      * Add Copied from statements
      
      * Add more Copied from statements
      
      * Add code examples
      
      * Improve code examples
      
      * Add model to doc tests
      
      * Adding dependency check
      
      * Add dummy file
      
      * Add requires_backends
      
      * Add model to toctree
      
      * Fix more things, disable dependency check for now
      
      * Apply more suggestions
      
      * Add soft dependency
      
      * Add annotators to tests
      
      * Fix style
      
      * Remove from_slow=True
      
      * Remove print statements
      
      * Add sanity check
      
      * Fix processor test
      
      * Fix processor tests, add more docs
      
      * Add doc tests for mdx file
      
      * Add more tips
      
      * Apply suggestions
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      Co-authored-by: default avatarlockon-n <45759388+lockon-n@users.noreply.github.com>
      Co-authored-by: default avatarSaulLu <lucilesaul.com@gmail.com>
      Co-authored-by: default avatarlockon-n <dd098309@126.com>
      f3d2f7a6
  14. 22 Sep, 2022 1 commit
  15. 11 Aug, 2022 1 commit
  16. 10 Aug, 2022 1 commit
  17. 25 Jul, 2022 1 commit
  18. 21 Jul, 2022 1 commit
  19. 13 Jul, 2022 1 commit
    • Wei's avatar
      Enable torchdynamo with torch_tensorrt(fx path) (#17765) · 7ea6ccc2
      Wei authored
      
      
      * enable fx2trt
      
      * Update perf_train_gpu_one.mdx
      
      * Update perf_train_gpu_one.mdx
      
      * add lib check
      
      * update
      
      * format
      
      * update
      
      * fix import check
      
      * fix isort
      
      * improve doc
      
      * refactor ctx manager
      
      * fix isort
      
      * black format
      
      * isort fix
      
      * fix format
      
      * update args
      
      * update black
      
      * cleanups
      
      * Update perf_train_gpu_one.mdx
      
      * code refactor
      
      * code refactor to init
      
      * remove redundancy
      
      * isort
      
      * replace self.args with args
      Co-authored-by: default avatarStas Bekman <stas@stason.org>
      7ea6ccc2
  20. 12 Jul, 2022 1 commit
  21. 29 Jun, 2022 1 commit
  22. 27 Jun, 2022 1 commit
    • Matt's avatar
      Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d
      Matt authored
      * Add a TF in-graph tokenizer for BERT
      
      * Add from_pretrained
      
      * Add proper truncation, option handling to match other tokenizers
      
      * Add proper imports and guards
      
      * Add test, fix all the bugs exposed by said test
      
      * Fix truncation of paired texts in graph mode, more test updates
      
      * Small fixes, add a (very careful) test for savedmodel
      
      * Add tensorflow-text dependency, make fixup
      
      * Update documentation
      
      * Update documentation
      
      * make fixup
      
      * Slight changes to tests
      
      * Add some docstring examples
      
      * Update tests
      
      * Update tests and add proper lowercasing/normalization
      
      * make fixup
      
      * Add docstring for padding!
      
      * Mark slow tests
      
      * make fixup
      
      * Fall back to BertTokenizerFast if BertTokenizer is unavailable
      
      * Fall back to BertTokenizerFast if BertTokenizer is unavailable
      
      * make fixup
      
      * Properly handle tensorflow-text dummies
      ee0d001d
  23. 24 Jun, 2022 1 commit
    • NielsRogge's avatar
      Improve vision models (#17731) · 09178705
      NielsRogge authored
      
      
      * Improve vision models
      
      * Add a lot of improvements
      
      * Remove to_2tuple from swin tests
      
      * Fix TF Swin
      
      * Fix more tests
      
      * Fix copies
      
      * Improve more models
      
      * Fix ViTMAE test
      
      * Add channel check for TF models
      
      * Add proper channel check for TF models
      
      * Apply suggestion from code review
      
      * Apply suggestions from code review
      
      * Add channel check for Flax models, apply suggestion
      
      * Fix bug
      
      * Add tests for greyscale images
      
      * Add test for interpolation of pos encodigns
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      09178705
  24. 21 Jun, 2022 1 commit
  25. 20 Jun, 2022 1 commit
  26. 08 Jun, 2022 1 commit
  27. 31 May, 2022 1 commit
  28. 25 May, 2022 1 commit
  29. 09 May, 2022 1 commit
  30. 29 Apr, 2022 2 commits
  31. 19 Apr, 2022 1 commit
  32. 15 Apr, 2022 1 commit
  33. 23 Mar, 2022 1 commit
    • Sylvain Gugger's avatar
      Reorganize file utils (#16264) · 4975002d
      Sylvain Gugger authored
      * Split file_utils in several submodules
      
      * Fixes
      
      * Add back more objects
      
      * More fixes
      
      * Who exactly decided to import that from there?
      
      * Second suggestion to code with code review
      
      * Revert wront move
      
      * Fix imports
      
      * Adapt all imports
      
      * Adapt all imports everywhere
      
      * Revert this import, will fix in a separate commit
      4975002d
  34. 15 Mar, 2022 1 commit
  35. 23 Feb, 2022 1 commit
  36. 02 Feb, 2022 1 commit
    • Ayush Chaurasia's avatar
      Add W&B backend for hyperparameter sweep (#14582) · c74f3d4c
      Ayush Chaurasia authored
      # Add support for W&B hyperparameter sweep
      This PR:
      * allows using wandb for running hyperparameter search.
      * The runs are visualized on W&B sweeps dashboard
      * This supports runnning sweeps on parallel devices, all reporting to the same central dashboard.
      
      ### Usage
      **To run new a hyperparameter search:**
      ```
      trainer.hyperparameter_search(
          backend="wandb", 
          project="transformers_sweep", # name of the project
          n_trials=5,
          metric="eval/loss", # metric to be optimized, default 'eval/loss'. A warning is raised if the passed metric is not found
      )
      ```
      This outputs a sweep id. Eg. `my_project/sweep_id`
      
      **To run sweeps on parallel devices:**
      Just pass sweep id which you want to run parallel
      ```
      trainer.hyperparameter_search(
          backend="wandb", 
          sweep_id = "my_project/sweep_id"
      )
      ```
      c74f3d4c
  37. 14 Jan, 2022 1 commit
  38. 10 Jan, 2022 1 commit
    • cody-moveworks's avatar
      Make OpenAIGPTTokenizer work with SpaCy 2.x and 3.x (#15019) · a54961c5
      cody-moveworks authored
      * Make OpenAIGPTTokenizer work with SpaCy 3.x
      
      SpaCy 3.x introduced an API change to creating the tokenizer that
      breaks OpenAIGPTTokenizer. The old API for creating the tokenizer in
      SpaCy 2.x no longer works under SpaCy 3.x, but the new API for creating
      the tokenizer in SpaCy 3.x DOES work under SpaCy 2.x. Switching to the
      new API should allow OpenAIGPTTokenizer to work under both SpaCy 2.x and
      SpaCy 3.x versions.
      
      * Add is_spacy_available and is_ftfy_available methods to file utils
      
      * Add spacy and ftfy unittest decorator to testing utils
      
      * Add tests for OpenAIGPTTokenizer that require spacy and ftfy
      
      * Modify CircleCI config to run tests that require spacy and ftfy
      
      * Remove unneeded unittest decorators are reuse test code
      
      * Run make fixup
      a54961c5