1. 10 Sep, 2020 2 commits
  2. 02 Sep, 2020 1 commit
  3. 31 Aug, 2020 1 commit
  4. 26 Aug, 2020 1 commit
  5. 24 Aug, 2020 1 commit
  6. 20 Aug, 2020 1 commit
    • Sylvain Gugger's avatar
      Trainer automatically drops unused columns in nlp datasets (#6449) · e5f45227
      Sylvain Gugger authored
      * Add a classmethod to easily build a Trainer from nlp dataset and metric
      
      * Fix docstrings
      
      * Split train/eval
      
      * Formatting
      
      * Log dropped columns + docs
      
      * Authorize callable activations
      
      * Poc for auto activation
      
      * Be framework-agnostic
      
      * Formatting
      
      * Remove class method
      
      * Remove unnecessary code
      e5f45227
  7. 05 Aug, 2020 1 commit
    • Sylvain Gugger's avatar
      Tf model outputs (#6247) · c67d1a02
      Sylvain Gugger authored
      * TF outputs and test on BERT
      
      * Albert to DistilBert
      
      * All remaining TF models except T5
      
      * Documentation
      
      * One file forgotten
      
      * TF outputs and test on BERT
      
      * Albert to DistilBert
      
      * All remaining TF models except T5
      
      * Documentation
      
      * One file forgotten
      
      * Add new models and fix issues
      
      * Quality improvements
      
      * Add T5
      
      * A bit of cleanup
      
      * Fix for slow tests
      
      * Style
      c67d1a02
  8. 30 Jul, 2020 1 commit
    • Sylvain Gugger's avatar
      Switch from return_tuple to return_dict (#6138) · 91cb9546
      Sylvain Gugger authored
      
      
      * Switch from return_tuple to return_dict
      
      * Fix test
      
      * [WIP] Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleC… (#5614)
      
      * Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleChoice} models and tests
      
      * AutoModels
      
      
      Tiny tweaks
      
      * Style
      
      * Final changes before merge
      
      * Re-order for simpler review
      
      * Final fixes
      
      * Addressing @sgugger's comments
      
      * Test MultipleChoice
      
      * Rework TF trainer (#6038)
      
      * Fully rework training/prediction loops
      
      * fix method name
      
      * Fix variable name
      
      * Fix property name
      
      * Fix scope
      
      * Fix method name
      
      * Fix tuple index
      
      * Fix tuple index
      
      * Fix indentation
      
      * Fix variable name
      
      * fix eval before log
      
      * Add drop remainder for test dataset
      
      * Fix step number + fix logging datetime
      
      * fix eval loss value
      
      * use global step instead of step + fix logging at step 0
      
      * Fix logging datetime
      
      * Fix global_step usage
      
      * Fix breaking loop + logging datetime
      
      * Fix step in prediction loop
      
      * Fix step breaking
      
      * Fix train/test loops
      
      * Force TF at least 2.2 for the trainer
      
      * Use assert_cardinality to facilitate the dataset size computation
      
      * Log steps per epoch
      
      * Make tfds compliant with TPU
      
      * Make tfds compliant with TPU
      
      * Use TF dataset enumerate instead of the Python one
      
      * revert previous commit
      
      * Fix data_dir
      
      * Apply style
      
      * rebase on master
      
      * Address Sylvain's comments
      
      * Address Sylvain's and Lysandre comments
      
      * Trigger CI
      
      * Remove unused import
      
      * Switch from return_tuple to return_dict
      
      * Fix test
      
      * Add recent model
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      Co-authored-by: default avatarJulien Plu <plu.julien@gmail.com>
      91cb9546
  9. 27 Jul, 2020 1 commit
  10. 21 Jul, 2020 1 commit
  11. 10 Jul, 2020 2 commits
    • Sylvain Gugger's avatar
      Document model outputs (#5673) · 7fad617d
      Sylvain Gugger authored
      
      
      * Document model outputs
      
      * Update docs/source/main_classes/output.rst
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      7fad617d
    • Sylvain Gugger's avatar
      Change model outputs types to self-document outputs (#5438) · edfd82f5
      Sylvain Gugger authored
      * [WIP] Proposal for model outputs
      
      * All Bert models
      
      * Make CI green maybe?
      
      * Fix ONNX test
      
      * Isolate ModelOutput from pt and tf
      
      * Formatting
      
      * Add Electra models
      
      * Auto-generate docstrings from outputs
      
      * Add TF outputs
      
      * Add some BERT models
      
      * Revert TF side
      
      * Remove last traces of TF changes
      
      * Fail with a clear error message
      
      * Add Albert and work through Bart
      
      * Add CTRL and DistilBert
      
      * Formatting
      
      * Progress on Bart
      
      * Renames and finish Bart
      
      * Formatting
      
      * Fix last test
      
      * Add DPR
      
      * Finish Electra and add FlauBERT
      
      * Add GPT2
      
      * Add Longformer
      
      * Add MMBT
      
      * Add MobileBert
      
      * Add GPT
      
      * Formatting
      
      * Add Reformer
      
      * Add Roberta
      
      * Add T5
      
      * Add Transformer XL
      
      * Fix test
      
      * Add XLM + fix XLMForTokenClassification
      
      * Style + XLMRoberta
      
      * Add XLNet
      
      * Formatting
      
      * Add doc of return_tuple arg
      edfd82f5
  12. 26 Jun, 2020 1 commit
  13. 25 Jun, 2020 1 commit
  14. 23 Jun, 2020 1 commit
  15. 22 Jun, 2020 1 commit
    • Patrick von Platen's avatar
      Benchmarks (#4912) · fa0be6d7
      Patrick von Platen authored
      * finish benchmark
      
      * fix isort
      
      * fix setup cfg
      
      * retab
      
      * fix time measuring of tf graph mode
      
      * fix tf cuda
      
      * clean code
      
      * better error message
      fa0be6d7
  16. 10 Jun, 2020 1 commit
  17. 09 Jun, 2020 1 commit
    • Patrick von Platen's avatar
      [Benchmark] add tpu and torchscipt for benchmark (#4850) · 2cfb947f
      Patrick von Platen authored
      
      
      * add tpu and torchscipt for benchmark
      
      * fix name in tests
      
      * "fix email"
      
      * make style
      
      * better log message for tpu
      
      * add more print and info for tpu
      
      * allow possibility to print tpu metrics
      
      * correct cpu usage
      
      * fix test for non-install
      
      * remove bugus file
      
      * include psutil in testing
      
      * run a couple of times before tracing in torchscript
      
      * do not allow tpu memory tracing for now
      
      * make style
      
      * add torchscript to env
      
      * better name for torch tpu
      Co-authored-by: default avatarPatrick von Platen <patrick@huggingface.co>
      2cfb947f
  18. 27 May, 2020 1 commit
    • Patrick von Platen's avatar
      [Benchmark] Memory benchmark utils (#4198) · 96f57c9c
      Patrick von Platen authored
      
      
      * improve memory benchmarking
      
      * correct typo
      
      * fix current memory
      
      * check torch memory allocated
      
      * better pytorch function
      
      * add total cached gpu memory
      
      * add total gpu required
      
      * improve torch gpu usage
      
      * update memory usage
      
      * finalize memory tracing
      
      * save intermediate benchmark class
      
      * fix conflict
      
      * improve benchmark
      
      * improve benchmark
      
      * finalize
      
      * make style
      
      * improve benchmarking
      
      * correct typo
      
      * make train function more flexible
      
      * fix csv save
      
      * better repr of bytes
      
      * better print
      
      * fix __repr__ bug
      
      * finish plot script
      
      * rename plot file
      
      * delete csv and small improvements
      
      * fix in plot
      
      * fix in plot
      
      * correct usage of timeit
      
      * remove redundant line
      
      * remove redundant line
      
      * fix bug
      
      * add hf parser tests
      
      * add versioning and platform info
      
      * make style
      
      * add gpu information
      
      * ensure backward compatibility
      
      * finish adding all tests
      
      * Update src/transformers/benchmark/benchmark_args.py
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * Update src/transformers/benchmark/benchmark_args_utils.py
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * delete csv files
      
      * fix isort ordering
      
      * add out of memory handling
      
      * add better train memory handling
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      96f57c9c
  19. 11 May, 2020 1 commit
    • Bram Vanroy's avatar
      Simplify cache vars and allow for TRANSFORMERS_CACHE env (#4226) · 61d22f9c
      Bram Vanroy authored
      * simplify cache vars and allow for TRANSFORMERS_CACHE env
      
      As it currently stands, "TRANSFORMERS_CACHE" is not an accepted variable. It seems that the these variables were not updated when moving from version pytorch_transformers to transformers. In addition, the fallback procedure could be improved. and simplified. Pathlib seems redundant here.
      
      * Update file_utils.py
      61d22f9c
  20. 29 Apr, 2020 1 commit
    • Julien Chaumond's avatar
      CDN urls (#4030) · 455c6390
      Julien Chaumond authored
      * [file_utils] use_cdn + documentation
      
      * Move to cdn. urls for weights
      
      * [urls] Hotfix for bert-base-japanese
      455c6390
  21. 27 Apr, 2020 1 commit
  22. 22 Apr, 2020 1 commit
    • Julien Chaumond's avatar
      Trainer (#3800) · dd9d483d
      Julien Chaumond authored
      * doc
      
      * [tests] Add sample files for a regression task
      
      * [HUGE] Trainer
      
      * Feedback from @sshleifer
      
      * Feedback from @thomwolf + logging tweak
      
      * [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes
      
      * [glue] Use default max_seq_length of 128 like before
      
      * [glue] move DataTrainingArguments around
      
      * [ner] Change interface of InputExample, and align run_{tf,pl}
      
      * Re-align the pl scripts a little bit
      
      * ner
      
      * [ner] Add integration test
      
      * Fix language_modeling with API tweak
      
      * [ci] Tweak loss target
      
      * Don't break console output
      
      * amp.initialize: model must be on right device before
      
      * [multiple-choice] update for Trainer
      
      * Re-align to 827d6d6e
      dd9d483d
  23. 09 Apr, 2020 1 commit
  24. 24 Feb, 2020 1 commit
  25. 06 Feb, 2020 2 commits
  26. 23 Jan, 2020 4 commits
  27. 20 Jan, 2020 3 commits
  28. 16 Jan, 2020 1 commit
  29. 13 Jan, 2020 1 commit
  30. 07 Jan, 2020 2 commits
  31. 06 Jan, 2020 1 commit