Commits · 77abd1e79fc37efb45bde0041f9dad0eabc55517 · chenpangpang / transformers

26 Aug, 2020 3 commits

Lysandre Debut authored Aug 26, 2020



* Logging

* Style

* hf_logging > utils.logging

* Address @thomwolf's comments

* Update test

* Update src/transformers/benchmark/benchmark_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Revert bad change
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

77abd1e7

Fix tf boolean mask in graph mode (#6741) · 461ae868
Jay Yip authored Aug 26, 2020

461ae868

Add "tie_word_embeddings" config param (#6692) · 925f34bb

Patrick von Platen authored Aug 26, 2020

* add tie_word_embeddings

* correct word embeddings in modeling utils

* make style

* make config param only relevant for torch

* make style

* correct typo

* delete deprecated arg in transo-xl

925f34bb

25 Aug, 2020 11 commits
- T5Tokenizer adds EOS token if not already added (#5866) · 62449570
  Sam Shleifer authored Aug 25, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  62449570
- [squad] make examples and dataset accessible from SquadDataset object (#6710) · 7e6397a7
  Tomo Lazovich authored Aug 25, 2020
```
* [squad] make examples and dataset accessible from SquadDataset object

* [squad] add support for legacy cache files
```
  7e6397a7
- Fix ONNX test_quantize unittest (#6716) · ac9702c2
  Funtowicz Morgan authored Aug 25, 2020
  
  ac9702c2
- add missing keys (#6719) · d17cce22
  Patrick von Platen authored Aug 25, 2020
  
  d17cce22
- tensor.nonzero() is deprecated in PyTorch 1.6 (#6715) · 625318f5
  Funtowicz Morgan authored Aug 25, 2020
```
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
```
  625318f5
- Add tokenizer to Trainer (#6689) · 124c3d6a
  Sylvain Gugger authored Aug 25, 2020
  
  124c3d6a
- More tests to Trainer (#6699) · abc02021
  Sylvain Gugger authored Aug 25, 2020
```
* More tests to Trainer

* Add warning in the doc
```
  abc02021
- Use generators tqdm progressbars (#6696) · f5bad031
  Sylvain Gugger authored Aug 25, 2020
  
  f5bad031
- Add typing.overload for convert_ids_tokens (#6637) · 841f0715
  Yohei Tamura authored Aug 25, 2020
```
* add overload for type checker

* black
```
  841f0715
- Remove hard-coded uses of float32 to fix mixed precision use (#6648) · 4fca874e
  Jay authored Aug 25, 2020
  
  4fca874e
- Fix hyperparameter_search doc (#6695) · d20cbb88
  Sylvain Gugger authored Aug 24, 2020
  
  d20cbb88
24 Aug, 2020 8 commits

Move unused args to kwargs (#6694) · 6b4c6176
Sylvain Gugger authored Aug 24, 2020

6b4c6176
Lat fix for Ray HP search (#6691) · 8f98faf9
Sylvain Gugger authored Aug 24, 2020

8f98faf9

Add hyperparameter search to Trainer (#6576) · 3a7fdd3f

Sylvain Gugger authored Aug 24, 2020



* Add optuna hyperparameter search to Trainer

* @julien-c suggestions
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Make compute_objective an arg function

* Formatting

* Rework to make it easier to add ray

* Formatting

* Initial support for Ray

* Formatting

* Polish and finalize

* Add trial id to checkpoint with Ray

* Smaller default

* Use GPU in ray if available

* Formatting

* Fix test

* Update install instruction
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Address review comments

* Formatting post-merge
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

3a7fdd3f

Update repo to isort v5 (#6686) · a5737779
Sylvain Gugger authored Aug 24, 2020
```
* Run new isort

* More changes

* Update CI, CONTRIBUTING and benchmarks
```
a5737779

Fixed DataCollatorForLanguageModeling not accepting lists of lists (#6685) · d329c9b0

Teven authored Aug 24, 2020

* Fixed DataCollatorForLanguageModeling + PermutationLanguageModeling not accepting lists of lists

* Update data_collator.py

* black was grumpy

d329c9b0

Missing commit · 0a850d21
sgugger authored Aug 24, 2020

0a850d21

Don't reset the dataset type + plug for rm unused columns (#6683) · b30879fe

Sylvain Gugger authored Aug 24, 2020



* Don't reset the type of the dataset

* Formatting

* Update trainer.py
Co-authored-by: Teven <teven.lescao@gmail.com>

b30879fe

Specify config filename (#6626) · 1a779ad7
Jared T Nielsen authored Aug 24, 2020

1a779ad7

21 Aug, 2020 2 commits
- CamembertForCausalLM (#6577) · d0e42a7b
  Suraj Patil authored Aug 21, 2020
```
* added CamembertForCausalLM

* add in __init__ and auto model

* style

* doc
```
  d0e42a7b
- Remove accidental comment (#6629) · bdf7e5de
  josephrocca authored Aug 21, 2020
  
  bdf7e5de
20 Aug, 2020 8 commits

Trainer automatically drops unused columns in nlp datasets (#6449) · e5f45227

Sylvain Gugger authored Aug 20, 2020

* Add a classmethod to easily build a Trainer from nlp dataset and metric

* Fix docstrings

* Split train/eval

* Formatting

* Log dropped columns + docs

* Authorize callable activations

* Poc for auto activation

* Be framework-agnostic

* Formatting

* Remove class method

* Remove unnecessary code

e5f45227

Regression test for pegasus bugfix (#6606) · 5bf4465e
Sam Shleifer authored Aug 20, 2020

5bf4465e

XLNet Bug when training with apex 16-bit precision (#6567) · 95395837

Ivan Dolgov authored Aug 20, 2020



* xlnet fp16 bug fix

* comment cast added

* Update modeling_xlnet.py
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

95395837

TFTrainer dataset doc & fix evaluation bug (#6618) · f9d280a9

Joe Davison authored Aug 20, 2020

* TFTrainer dataset doc & fix evaluation bug

discussed in #6551

* add docstring to test/eval datasets

f9d280a9

Add tests to Trainer (#6605) · 573bdb0a

Sylvain Gugger authored Aug 20, 2020

* Add tests to Trainer

* Test if removing long breaks everything

* Remove ugly hack

* Fix distributed test

* Use float for number of epochs

573bdb0a

Fix CI · b3e54698
sgugger authored Aug 20, 2020

b3e54698
removed redundant arg in prepare_inputs (#6614) · 33bf4264
Prajjwal Bhargava authored Aug 20, 2020
```
* removed redundant arg in prepare_inputs

* made same change in prediction_loop
```
33bf4264
[cleanup] remove confusing newline (#6603) · 93c5c9a5
Oren Amsalem authored Aug 20, 2020

93c5c9a5

19 Aug, 2020 5 commits

Fix #6575 (#6596) · 18ca0e91
Sylvain Gugger authored Aug 19, 2020

18ca0e91
[BartTokenizerFast] add prepare_seq2seq_batch (#6543) · 7581884d
Suraj Patil authored Aug 19, 2020

7581884d
tf generation utils: remove unused kwargs (#6591) · 9a86321b
Sam Shleifer authored Aug 19, 2020

9a86321b

Feed forward chunking others (#6365) · 2a7402cb

Pradhy729 authored Aug 19, 2020



* Feed forward chunking for Distilbert & Albert

* Added ff chunking for many other models

* Change model signature

* Added chunking for XLM

* Cleaned up by removing some variables.

* remove test_chunking flag
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

2a7402cb

[EncoderDecoder] Add functionality to tie encoder decoder weights (#6538) · fe0b85e7

Patrick von Platen authored Aug 19, 2020



* start adding tie encoder to decoder functionality

* finish model tying

* make style

* Apply suggestions from code review

* fix t5 list including cross attention

* apply sams suggestions

* Update src/transformers/modeling_encoder_decoder.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add max depth break point
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fe0b85e7

18 Aug, 2020 3 commits
- add BartConfig.force_bos_token_to_be_generated (#6526) · 1529bf96
  Sam Shleifer authored Aug 18, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  1529bf96
- Fixed label datatype for STS-B (#6492) · 5a81195e
  Ali Modarressi authored Aug 18, 2020
```
* fixed label datatype for sts-b

* naming update

* make style

* make style
```
  5a81195e
- [marian] converter supports models from new Tatoeba project (#6342) · 12d76241
  Sam Shleifer authored Aug 17, 2020
  
  12d76241