Commits · ba2400189b2242620868096ae49babf93bd9ce00 · chenpangpang / transformers

18 Jul, 2020 2 commits
- [seq2seq] MAX_LEN env var for MT commands (#5837) · ba240018
  Sam Shleifer authored Jul 17, 2020
  
  ba240018
- Lightning Updates for v0.8.5 (#5798) · 529850ae
  Nathan Raw authored Jul 17, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
  529850ae
17 Jul, 2020 1 commit
- [seq2seq] Don't copy self.source in sortishsampler (#5818) · e238e3d5
  Sam Shleifer authored Jul 17, 2020
  
  e238e3d5
16 Jul, 2020 1 commit
- [seq2seq] pack_dataset.py rewrites dataset in max_tokens format (#5819) · 283500ff
  Sam Shleifer authored Jul 16, 2020
  
  283500ff
15 Jul, 2020 2 commits
- [fix] check code quality (#5772) · 1a647abf
  Sam Shleifer authored Jul 15, 2020
  
  1a647abf
- [cleanup] T5 test, warnings (#5761) · d0486c8b
  Sam Shleifer authored Jul 15, 2020
  
  d0486c8b
14 Jul, 2020 1 commit

docs(wandb): explain how to use W&B integration (#5607) · 4d5a8d65

Boris Dayma authored Jul 14, 2020



* docs(wandb): explain how to use W&B integration

fix #5262

* Also mention TensorBoard
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

4d5a8d65

10 Jul, 2020 1 commit

Update The Big Table of Tasks · 201d23f2

Julien Chaumond authored Jul 10, 2020


Co-Authored-By: Suraj Patil <surajp815@gmail.com>
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

201d23f2

09 Jul, 2020 1 commit

Test XLA examples (#5583) · 0533cf47

Lysandre Debut authored Jul 09, 2020

* Test XLA examples

* Style

* Using `require_torch_tpu`

* Style

* No need for pytest

0533cf47

08 Jul, 2020 1 commit

Add DeeBERT (entropy-based early exiting for *BERT) (#5477) · cfbb9829

Ji Xin authored Jul 07, 2020

* Add deebert code

* Add readme of deebert

* Add test for deebert

Update test for Deebert

* Update DeeBert (README, class names, function refactoring); remove requirements.txt

* Format update

* Update test

* Update readme and model init methods

cfbb9829

07 Jul, 2020 5 commits

readme for benchmark (#5363) · fde217c6
Patrick von Platen authored Jul 07, 2020

fde217c6

Add mbart-large-cc25, support translation finetuning (#5129) · 353b8f1e

Sam Shleifer authored Jul 07, 2020

improve unittests for finetuning, especially w.r.t testing frozen parameters
fix freeze_embeds for T5
add streamlit setup.cfg

353b8f1e

[Almost all TF models] TF clean up: add missing CLM / MLM loss; fix T5 naming... · 4dc65591

Patrick von Platen authored Jul 07, 2020


[Almost all TF models] TF clean up: add missing CLM / MLM loss; fix T5 naming and keras compile (#5395)

* add first version of clm tf

* make style

* add more tests for bert

* update tf clm loss

* fix tests

* correct tf ner script

* add mlm loss

* delete bogus file

* clean tf auto model + add tests

* finish adding clm loss everywhere

* fix training in distilbert

* fix flake8

* save intermediate

* fix tf t5 naming

* remove prints

* finish up

* up

* fix tf gpt2

* fix new test utils import

* fix flake8

* keep backward compatibility

* Update src/transformers/modeling_tf_albert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_roberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_mobilebert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_distilbert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply sylvains suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4dc65591

[examples] Add trainer support for question-answering (#4829) · e49393c3

Suraj Patil authored Jul 07, 2020



* add SquadDataset

* add DataCollatorForQuestionAnswering

* update __init__

* add run_squad with  trainer

* add DataCollatorForQuestionAnswering in __init__

* pass data_collator to trainer

* doc tweak

* Update run_squad_trainer.py

* Update __init__.py

* Update __init__.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e49393c3

Added data collator for permutation (XLNet) language modeling and related calls (#5522) · 3dcb748e

Shashank Gupta authored Jul 07, 2020

* Added data collator for XLNet language modeling and related calls

Added DataCollatorForXLNetLanguageModeling in data/data_collator.py
to generate necessary inputs for language modeling training with
XLNetLMHeadModel. Also added related arguments, logic and calls in
examples/language-modeling/run_language_modeling.py.

Resolves: #4739, #2008 (partially)

* Changed name to `DataCollatorForPermutationLanguageModeling`

Changed the name of `DataCollatorForXLNetLanguageModeling` to the more general `DataCollatorForPermutationLanguageModelling`.
Removed the `--mlm` flag requirement for the new collator and defined a separate `--plm_probability` flag for its use.
CTRL uses a CLM loss just like GPT and GPT-2, so should work out of the box with this script (provided `past` is taken care of
similar to `mems` for XLNet).
Changed calls and imports appropriately.

* Added detailed comments, changed variable names

Added more detailed comments to `DataCollatorForPermutationLanguageModeling` in `data/data_collator.py` to explain working. Also cleaned up variable names and made them more informative.

* Added tests for new data collator

Added tests in `tests/test_trainer.py` for DataCollatorForPermutationLanguageModeling based on those in DataCollatorForLanguageModeling. A specific test has been added to check for odd-length sequences.

* Fixed styling issues

3dcb748e

06 Jul, 2020 1 commit
- The `add_space_before_punct_symbol` is only for TransfoXL (#5549) · 9d9b872b
  Lysandre Debut authored Jul 06, 2020
  
  9d9b872b
01 Jul, 2020 3 commits

Clean up diffs in Trainer/TFTrainer (#5417) · 734a28a7

Sylvain Gugger authored Jul 01, 2020



* Cleanup and unify Trainer/TFTrainer

* Forgot to adapt TFTrainingArgs

* In tf scripts n_gpu -> n_replicas

* Update src/transformers/training_args.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* Formatting

* Fix typo
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

734a28a7

Move tests/utils.py -> transformers/testing_utils.py (#5350) · 13deb95a
Sam Shleifer authored Jul 01, 2020

13deb95a
Fix examples titles and optimization doc page (#5408) · 4ade7491
Sylvain Gugger authored Jul 01, 2020

4ade7491

30 Jun, 2020 4 commits

In the run_ner.py example, give the optional label arg a default value (#5326) · 501040fd

Hong Xu authored Jun 30, 2020

Otherwise, if label is not specified, the following error occurs:

	Traceback (most recent call last):
	  File "run_ner.py", line 303, in <module>
	    main()
	  File "run_ner.py", line 101, in main
	    model_args, data_args, training_args = parser.parse_json_file(json_file=os.path.abspath(sys.argv[1]))
	  File "/home/user/anaconda3/envs/bert/lib/python3.7/site-packages/transformers/hf_argparser.py", line 159, in parse_json_file
	    obj = dtype(**inputs)
	TypeError: __init__() missing 1 required positional argument: 'labels'

501040fd

examples/seq2seq: never override $WANDB_PROJECT (#5407) · 27a7fe7a
Sam Shleifer authored Jun 30, 2020

27a7fe7a
Upload DistilBART artwork (#5394) · 331d8d29
Kevin Canwen Xu authored Jun 30, 2020

331d8d29

Update Bertabs example to work again (#5355) · 9a473f1e

MichaelJanz authored Jun 30, 2020



* Fix the bug 'Attempted relative import with no known parent package' when using the bertabs example. Also change the used model from bertabs-finetuned-cnndm, since it seems not be accessible anymore

* Update run_summarization.py
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

9a473f1e

29 Jun, 2020 2 commits

[seq2seq docs] Move evaluation down, fix typo (#5365) · a316a6aa
Sam Shleifer authored Jun 29, 2020

a316a6aa

[Docs] Benchmark docs (#5360) · 4bcc35cd

Patrick von Platen authored Jun 29, 2020



* first doc version

* add benchmark docs

* fix typos

* improve README

* Update docs/source/benchmarks.rst
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* fix naming and docs
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

4bcc35cd

28 Jun, 2020 2 commits
- save_pretrained: mkdir(exist_ok=True) (#5258) · 45e26125
  Sam Shleifer authored Jun 28, 2020
```
* all save_pretrained methods mkdir if not os.path.exists
```
  45e26125
- [examples] fix example links (#5344) · 12dfbd4f
  Suraj Patil authored Jun 28, 2020
  
  12dfbd4f
26 Jun, 2020 4 commits
- examples/seq2seq/run_eval.py fixes and docs (#5322) · 393b8dc0
  Sam Shleifer authored Jun 26, 2020
  
  393b8dc0
- [pl_examples] default warmup steps=0 (#5316) · 5543b30a
  Sam Shleifer authored Jun 26, 2020
  
  5543b30a
- [tokenizers] Updates data processors, docstring, examples and model cards to the new API (#5308) · 601d4d69
  Thomas Wolf authored Jun 26, 2020
```
* remove references to old API in docstring - update data processors

* style

* fix tests - better type checking error messages

* better type checking

* include awesome fix by @LysandreJik for #5310

* updated doc and examples
```
  601d4d69
- [Benchmarks] improve Example Plotter (#5245) · 79a82cc0
  Patrick von Platen authored Jun 26, 2020
```
* improve plotting

* better labels

* fix time plot
```
  79a82cc0
25 Jun, 2020 3 commits
- Closes #5218 · 7cc15bdd
  Lysandre Debut authored Jun 25, 2020
  
  7cc15bdd
- [examples/seq2seq] more README improvements (#5274) · e008d520
  Sam Shleifer authored Jun 25, 2020
  
  e008d520
- examples/seq2seq supports translation (#5202) · 40457bce
  Sam Shleifer authored Jun 24, 2020
  
  40457bce
24 Jun, 2020 5 commits
- [HANS] Fix label_list for RoBERTa/BART (class flipping) (#5196) · 4965aee0
  Victor SANH authored Jun 24, 2020
```
* fix weirdness in roberta/bart for mnli trained checkpoints

* black compliance

* isort code check
```
  4965aee0
- [Benchmark] Extend Benchmark to all model type extensions (#5241) · 9fe09cec
  Patrick von Platen authored Jun 24, 2020
```
* add benchmark for all kinds of models

* improved import

* delete bogus files

* make style
```
  9fe09cec
- Add hugs (#5225) · 7c41057d
  Sylvain Gugger authored Jun 24, 2020
  
  7c41057d
- Use the script in utils (#5224) · 5e85b324
  Sylvain Gugger authored Jun 24, 2020
  
  5e85b324
- Fix PABEE division by zero error (#5233) · 54e9ce78
  Kevin Canwen Xu authored Jun 24, 2020
```
* Fix PABEE division by zero error

* patience=0 by default
```
  54e9ce78
23 Jun, 2020 1 commit
- [pl_examples] revert deletion of optimizer_step (#5227) · 76e5af4c
  Sam Shleifer authored Jun 23, 2020
  
  76e5af4c