1. 08 Dec, 2021 1 commit
  2. 22 Nov, 2021 1 commit
  3. 26 Oct, 2021 1 commit
  4. 21 Sep, 2021 1 commit
  5. 07 Sep, 2021 1 commit
  6. 02 Sep, 2021 1 commit
    • Nathan Raw's avatar
      Add PyTorch image classification example (#13134) · 76c4d8bf
      Nathan Raw authored
      *  add pytorch image classification example
      
      * 🔥 remove utils.py
      
      * 💄 fix flake8 style issues
      
      * 🔥 remove unnecessary line
      
      *  limit dataset sizes
      
      * 📌 update reqs
      
      * 🎨 restructure - use datasets lib
      
      * 🎨 import transforms directly
      
      * 📝 add comments
      
      * 💄 style
      
      * 🔥 remove flag
      
      * 📌 update requirement warning
      
      * 📝 add vision README.md
      
      * 📝 update README.md
      
      * 📝 update README.md
      
      * 🎨 add image-classification tag to model card
      
      * 🚚 rename vision ️ image-classification
      
      * 📝 update image-classification README.md
      76c4d8bf
  7. 09 Jun, 2021 1 commit
    • NielsRogge's avatar
      Add DETR (#11653) · d3eacbb8
      NielsRogge authored
      
      
      * Squash all commits of modeling_detr_v7 branch into one
      
      * Improve docs
      
      * Fix tests
      
      * Style
      
      * Improve docs some more and fix most tests
      
      * Fix slow tests of ViT, DeiT and DETR
      
      * Improve replacement of batch norm
      
      * Restructure timm backbone forward
      
      * Make DetrForSegmentation support any timm backbone
      
      * Fix name of output
      
      * Address most comments by @LysandreJik
      
      * Give better names for variables
      
      * Conditional imports + timm in setup.py
      
      * Address additional comments by @sgugger
      
      * Make style, add require_timm and require_vision to testsé
      
      * Remove train_backbone attribute of DetrConfig, add methods to freeze/unfreeze backbone
      
      * Add png files to fixtures
      
      * Fix type hint
      
      * Add timm to workflows
      
      * Add `BatchNorm2d` to the weight initialization
      
      * Fix retain_grad test
      
      * Replace model checkpoints by Facebook namespace
      
      * Fix name of checkpoint in test
      
      * Add user-friendly message when scipy is not available
      
      * Address most comments by @patrickvonplaten
      
      * Remove return_intermediate_layers attribute of DetrConfig and simplify Joiner
      
      * Better initialization
      
      * Scipy is necessary to get sklearn metrics
      
      * Rename TimmBackbone to DetrTimmConvEncoder and rename DetrJoiner to DetrConvModel
      
      * Make style
      
      * Improve docs and add 2 community notebooks
      Co-authored-by: default avatarLysandre <lysandre.debut@reseau.eseo.fr>
      d3eacbb8
  8. 12 May, 2021 1 commit
  9. 07 May, 2021 1 commit
  10. 21 Apr, 2021 1 commit
  11. 06 Apr, 2021 1 commit
    • Sylvain Gugger's avatar
      Auto feature extractor (#11097) · 403d530e
      Sylvain Gugger authored
      * AutoFeatureExtractor
      
      * Init and first tests
      
      * Tests
      
      * Damn you gitignore
      
      * Quality
      
      * Defensive test for when not all backends are here
      
      * Use pattern for Speech2Text models
      403d530e
  12. 13 Feb, 2021 1 commit
    • Nicolas Patry's avatar
      Conversion from slow to fast for BPE spm vocabs contained an error. (#10120) · c9837a0d
      Nicolas Patry authored
      * Conversion from slow to fast for BPE spm vocabs contained an error.
      
      - There is only 1 test currently (tokenizers + slow) that used the modified path
      and it's reformer, which does not contain any ids modification so the
      bug was silent for now.
      - The real issue is that vocab variable was overloaded by
      SentencePieceExtractor, leading to Slow specific vocab oddities to be
      completely ignored
      - The bug was reported here https://github.com/huggingface/transformers/issues/9518
      - Ran the complete tokenization test suite with slow without error
      (`RUN_SLOW=1 pytest -sv tests/test_tokenization_*`)
      
      * Remove rebase error.
      
      * Adding the fixture.
      c9837a0d
  13. 19 Jan, 2021 1 commit
  14. 18 Dec, 2020 1 commit
  15. 08 Dec, 2020 1 commit
  16. 09 Nov, 2020 1 commit
  17. 22 Oct, 2020 1 commit
  18. 18 Oct, 2020 1 commit
    • Thomas Wolf's avatar
      [Dependencies|tokenizers] Make both SentencePiece and Tokenizers optional dependencies (#7659) · ba8c4d0a
      Thomas Wolf authored
      * splitting fast and slow tokenizers [WIP]
      
      * [WIP] splitting sentencepiece and tokenizers dependencies
      
      * update dummy objects
      
      * add name_or_path to models and tokenizers
      
      * prefix added to file names
      
      * prefix
      
      * styling + quality
      
      * spliting all the tokenizer files - sorting sentencepiece based ones
      
      * update tokenizer version up to 0.9.0
      
      * remove hard dependency on sentencepiece 🎉
      
      * and removed hard dependency on tokenizers 🎉
      
      
      
      * update conversion script
      
      * update missing models
      
      * fixing tests
      
      * move test_tokenization_fast to main tokenization tests - fix bugs
      
      * bump up tokenizers
      
      * fix bert_generation
      
      * update ad fix several tokenizers
      
      * keep sentencepiece in deps for now
      
      * fix funnel and deberta tests
      
      * fix fsmt
      
      * fix marian tests
      
      * fix layoutlm
      
      * fix squeezebert and gpt2
      
      * fix T5 tokenization
      
      * fix xlnet tests
      
      * style
      
      * fix mbart
      
      * bump up tokenizers to 0.9.2
      
      * fix model tests
      
      * fix tf models
      
      * fix seq2seq examples
      
      * fix tests without sentencepiece
      
      * fix slow => fast  conversion without sentencepiece
      
      * update auto and bert generation tests
      
      * fix mbart tests
      
      * fix auto and common test without tokenizers
      
      * fix tests without tokenizers
      
      * clean up tests lighten up when tokenizers + sentencepiece are both off
      
      * style quality and tests fixing
      
      * add sentencepiece to doc/examples reqs
      
      * leave sentencepiece on for now
      
      * style quality split hebert and fix pegasus
      
      * WIP Herbert fast
      
      * add sample_text_no_unicode and fix hebert tokenization
      
      * skip FSMT example test for now
      
      * fix style
      
      * fix fsmt in example tests
      
      * update following Lysandre and Sylvain's comments
      
      * Update src/transformers/testing_utils.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/testing_utils.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/tokenization_utils_base.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/tokenization_utils_base.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      ba8c4d0a
  19. 09 Oct, 2020 1 commit
  20. 10 Sep, 2020 1 commit
    • Yu Liu's avatar
      Albert pretrain datasets/ datacollator (#6168) · 762cba3b
      Yu Liu authored
      
      
      * add dataset for albert pretrain
      
      * datacollator for albert pretrain
      
      * naming, comprehension, file reading change
      
      * data cleaning is no needed after this modification
      
      * delete prints
      
      * fix a bug
      
      * file structure change
      
      * add tests for albert datacollator
      
      * remove random seed
      
      * add back len and get item function
      
      * sample file for testing and test code added
      
      * format change for black
      
      * more format change
      
      * Style
      
      * var assignment issue resolve
      
      * add back wrongly deleted DataCollatorWithPadding in init file
      
      * Style
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      Co-authored-by: default avatarLysandre <lysandre.debut@reseau.eseo.fr>
      762cba3b
  21. 07 May, 2020 1 commit
    • Julien Chaumond's avatar
      BIG Reorganize examples (#4213) · 0ae96ff8
      Julien Chaumond authored
      * Created using Colaboratory
      
      * [examples] reorganize files
      
      * remove run_tpu_glue.py as superseded by TPU support in Trainer
      
      * Bugfix: int, not tuple
      
      * move files around
      0ae96ff8
  22. 11 Jan, 2020 1 commit
  23. 06 Jan, 2020 2 commits
  24. 22 Dec, 2019 1 commit