- 26 Aug, 2020 3 commits
-
-
Lysandre Debut authored
* Logging * Style * hf_logging > utils.logging * Address @thomwolf's comments * Update test * Update src/transformers/benchmark/benchmark_utils.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Revert bad change Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Jay Yip authored
-
Patrick von Platen authored
* add tie_word_embeddings * correct word embeddings in modeling utils * make style * make config param only relevant for torch * make style * correct typo * delete deprecated arg in transo-xl
-
- 25 Aug, 2020 11 commits
-
-
Sam Shleifer authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Tomo Lazovich authored
* [squad] make examples and dataset accessible from SquadDataset object * [squad] add support for legacy cache files
-
Funtowicz Morgan authored
-
Patrick von Platen authored
-
Funtowicz Morgan authored
Signed-off-by:Morgan Funtowicz <funtowiczmo@gmail.com>
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* More tests to Trainer * Add warning in the doc
-
Sylvain Gugger authored
-
Yohei Tamura authored
* add overload for type checker * black
-
Jay authored
-
Sylvain Gugger authored
-
- 24 Aug, 2020 8 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Add optuna hyperparameter search to Trainer * @julien-c suggestions Co-authored-by:
Julien Chaumond <chaumond@gmail.com> * Make compute_objective an arg function * Formatting * Rework to make it easier to add ray * Formatting * Initial support for Ray * Formatting * Polish and finalize * Add trial id to checkpoint with Ray * Smaller default * Use GPU in ray if available * Formatting * Fix test * Update install instruction Co-authored-by:
Richard Liaw <rliaw@berkeley.edu> * Address review comments * Formatting post-merge Co-authored-by:
Julien Chaumond <chaumond@gmail.com> Co-authored-by:
Richard Liaw <rliaw@berkeley.edu>
-
Sylvain Gugger authored
* Run new isort * More changes * Update CI, CONTRIBUTING and benchmarks
-
Teven authored
* Fixed DataCollatorForLanguageModeling + PermutationLanguageModeling not accepting lists of lists * Update data_collator.py * black was grumpy
-
sgugger authored
-
Sylvain Gugger authored
* Don't reset the type of the dataset * Formatting * Update trainer.py Co-authored-by:Teven <teven.lescao@gmail.com>
-
Jared T Nielsen authored
-
- 21 Aug, 2020 2 commits
-
-
Suraj Patil authored
* added CamembertForCausalLM * add in __init__ and auto model * style * doc
-
josephrocca authored
-
- 20 Aug, 2020 8 commits
-
-
Sylvain Gugger authored
* Add a classmethod to easily build a Trainer from nlp dataset and metric * Fix docstrings * Split train/eval * Formatting * Log dropped columns + docs * Authorize callable activations * Poc for auto activation * Be framework-agnostic * Formatting * Remove class method * Remove unnecessary code
-
Sam Shleifer authored
-
Ivan Dolgov authored
* xlnet fp16 bug fix * comment cast added * Update modeling_xlnet.py Co-authored-by:Kevin Canwen Xu <canwenxu@126.com>
-
Joe Davison authored
* TFTrainer dataset doc & fix evaluation bug discussed in #6551 * add docstring to test/eval datasets
-
Sylvain Gugger authored
* Add tests to Trainer * Test if removing long breaks everything * Remove ugly hack * Fix distributed test * Use float for number of epochs
-
sgugger authored
-
Prajjwal Bhargava authored
* removed redundant arg in prepare_inputs * made same change in prediction_loop
-
Oren Amsalem authored
-
- 19 Aug, 2020 5 commits
-
-
Sylvain Gugger authored
-
Suraj Patil authored
-
Sam Shleifer authored
-
Pradhy729 authored
* Feed forward chunking for Distilbert & Albert * Added ff chunking for many other models * Change model signature * Added chunking for XLM * Cleaned up by removing some variables. * remove test_chunking flag Co-authored-by:patrickvonplaten <patrick.v.platen@gmail.com>
-
Patrick von Platen authored
* start adding tie encoder to decoder functionality * finish model tying * make style * Apply suggestions from code review * fix t5 list including cross attention * apply sams suggestions * Update src/transformers/modeling_encoder_decoder.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add max depth break point Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 18 Aug, 2020 3 commits
-
-
Sam Shleifer authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Ali Modarressi authored
* fixed label datatype for sts-b * naming update * make style * make style
-
Sam Shleifer authored
-