Commits · 930153e7d2d658267b7630a047a4bfc85b86042d · chenpangpang / transformers

28 Aug, 2020 1 commit
- [transformers-cli] fix logger getter (#6777) · 92ac2fa7
  Stas Bekman authored Aug 27, 2020
  
  92ac2fa7
27 Aug, 2020 4 commits

Format · 42fddacd
Lysandre authored Aug 27, 2020

42fddacd
[test schedulers] adjust to test the first step's reading (#6429) · dbfe34f2
Stas Bekman authored Aug 27, 2020
```
* [test schedulers] small improvement

* cleanup
```
dbfe34f2
[testing] replace hardcoded paths to allow running tests from anywhere (#6523) · e6b811f0
Stas Bekman authored Aug 27, 2020
```
* [testing] replace hardcoded paths to allow running tests from anywhere

* fix the merge conflict
```
e6b811f0

Add AdaFactor optimizer from fairseq (#6722) · 971d1802

Nikolai Yakovenko authored Aug 27, 2020



* AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM.

* update PR fixes, add basic test

* bug -- incorrect params in test

* bugfix -- import Adafactor into test

* bugfix -- removed accidental T5 include

* resetting T5 to master

* bugfix -- include Adafactor in __init__

* longer loop for adafactor test

* remove double error class declare

* lint

* black

* isort

* Update src/transformers/optimization.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* single docstring

* Cleanup docstring
Co-authored-by: Nikolai Y <nikolai.yakovenko@point72.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

971d1802

26 Aug, 2020 4 commits

[model_cards] Fix tiny typos · 3242e4d9
Julien Chaumond authored Aug 26, 2020

3242e4d9

[TF Longformer] Improve Speed for TF Longformer (#6447) · 858b7d58

Patrick von Platen authored Aug 26, 2020

* add tf graph compile tests

* fix conflict

* remove more tf transpose statements

* fix conflicts

* fix comment typos

* move function to class function

* fix black

* fix black

* make style

858b7d58

Black 20 release · a75c64d8
Lysandre authored Aug 26, 2020

a75c64d8

Centralize logging (#6434) · 77abd1e7

Lysandre Debut authored Aug 26, 2020



* Logging

* Style

* hf_logging > utils.logging

* Address @thomwolf's comments

* Update test

* Update src/transformers/benchmark/benchmark_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Revert bad change
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

77abd1e7

25 Aug, 2020 3 commits
- T5Tokenizer adds EOS token if not already added (#5866) · 62449570
  Sam Shleifer authored Aug 25, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  62449570
- Fix pegasus-xsum integration test (#6726) · e11d923b
  Sam Shleifer authored Aug 25, 2020
  
  e11d923b
- More tests to Trainer (#6699) · abc02021
  Sylvain Gugger authored Aug 25, 2020
```
* More tests to Trainer

* Add warning in the doc
```
  abc02021
24 Aug, 2020 1 commit
- Update repo to isort v5 (#6686) · a5737779
  Sylvain Gugger authored Aug 24, 2020
```
* Run new isort

* More changes

* Update CI, CONTRIBUTING and benchmarks
```
  a5737779
20 Aug, 2020 6 commits
- Regression test for pegasus bugfix (#6606) · 5bf4465e
  Sam Shleifer authored Aug 20, 2020
  
  5bf4465e
- One last threshold to raise · 86c07e63
  sgugger authored Aug 20, 2020
  
  86c07e63
- Move threshold up for flaky test with Electra (#6622) · e8af90c0
  Sylvain Gugger authored Aug 20, 2020
```
* Move threshold up for flaky test with Electra

* Update above as well
```
  e8af90c0
- [Tests] fix attention masks in Tests (#6621) · 505f2d74
  Patrick von Platen authored Aug 20, 2020
```
* fix distilbert

* fix typo
```
  505f2d74
- Add tests for Reformer tokenizer (#6485) · c9454507
  Denisa Roberts authored Aug 20, 2020
  
  c9454507
- Add tests to Trainer (#6605) · 573bdb0a
  Sylvain Gugger authored Aug 20, 2020
```
* Add tests to Trainer

* Test if removing long breaks everything

* Remove ugly hack

* Fix distributed test

* Use float for number of epochs
```
  573bdb0a
19 Aug, 2020 5 commits

[BartTokenizerFast] add prepare_seq2seq_batch (#6543) · 7581884d
Suraj Patil authored Aug 19, 2020

7581884d
fix model outputs test (#6593) · 8bcceace
Patrick von Platen authored Aug 19, 2020

8bcceace

Feed forward chunking others (#6365) · 2a7402cb

Pradhy729 authored Aug 19, 2020



* Feed forward chunking for Distilbert & Albert

* Added ff chunking for many other models

* Change model signature

* Added chunking for XLM

* Cleaned up by removing some variables.

* remove test_chunking flag
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

2a7402cb

[EncoderDecoder] Add functionality to tie encoder decoder weights (#6538) · fe0b85e7

Patrick von Platen authored Aug 19, 2020



* start adding tie encoder to decoder functionality

* finish model tying

* make style

* Apply suggestions from code review

* fix t5 list including cross attention

* apply sams suggestions

* Update src/transformers/modeling_encoder_decoder.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add max depth break point
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fe0b85e7

Fix bart base test (#6587) · ab42d748
Sam Shleifer authored Aug 18, 2020

ab42d748

18 Aug, 2020 2 commits
- add BartConfig.force_bos_token_to_be_generated (#6526) · 1529bf96
  Sam Shleifer authored Aug 18, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  1529bf96
- [marian] converter supports models from new Tatoeba project (#6342) · 12d76241
  Sam Shleifer authored Aug 17, 2020
  
  12d76241
17 Aug, 2020 5 commits
- [T5Tokenizer] add prepare_seq2seq_batch method (#6122) · 407da12e
  Suraj Patil authored Aug 17, 2020
```
* tests
```
  407da12e
- [BartTokenizer] add prepare s2s batch (#6212) · 2a77813d
  Suraj Patil authored Aug 17, 2020
```
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
```
  2a77813d
- Fix flaky ONNX tests (#6531) · b41cc0b8
  Funtowicz Morgan authored Aug 17, 2020
  
  b41cc0b8
- Remove deprecated assertEquals (#6532) · 37709b59
  Kevin Canwen Xu authored Aug 17, 2020
```
`assertEquals` is deprecated: https://stackoverflow.com/questions/930995/assertequals-vs-assertequal-in-python/931011
This PR replaces these deprecated methods.
```
  37709b59
- Support additional dictionaries for BERT Japanese tokenizers (#6515) · 48c6c613
  Masatoshi Suzuki authored Aug 17, 2020
```
* Update BERT Japanese tokenizers

* Update CircleCI config to download unidic

* Specify to use the latest dictionary packages
```
  48c6c613
14 Aug, 2020 2 commits

[EncoderDecoder] Add Cross Attention for GPT2 (#6415) · 1d6e71e1

Patrick von Platen authored Aug 14, 2020



* add cross attention layers for gpt2

* make gpt2 cross attention work

* finish bert2gpt2

* add explicit comments

* remove attention mask since not yet supported

* revert attn mask in pipeline

* Update src/transformers/modeling_gpt2.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_encoder_decoder.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1d6e71e1

MBartForConditionalGeneration (#6441) · 680f1337

Suraj Patil authored Aug 14, 2020

* add MBartForConditionalGeneration

* style

* rebase and fixes

* add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS

* fix docs

* don't ignore mbart

* doc

* fix mbart fairseq link

* put mbart before bart

* apply doc suggestions

680f1337

13 Aug, 2020 2 commits

Test model outputs equivalence (#6445) · f7cbc13d

Lysandre Debut authored Aug 13, 2020

* Test model outputs equivalence

* Fix failing tests

* From dict to kwargs

* DistilBERT

* Addressing @sgugger and @patrickvonplaten's comments

f7cbc13d

cleanup tf unittests: part 2 (#6260) · e983da0e

Stas Bekman authored Aug 13, 2020

* cleanup torch unittests: part 2

* remove trailing comma added by isort, and which breaks flake

* one more comma

* revert odd balls

* part 3: odd cases

* more ["key"] -> .key refactoring

* .numpy() is not needed

* more unncessary .numpy() removed

* more simplification

e983da0e

12 Aug, 2020 3 commits

add targets arg to fill-mask pipeline (#6239) · bc820476

Joe Davison authored Aug 12, 2020

* add targets arg to fill-mask pipeline

* add tests and more error handling

* quality

* update docstring

bc820476

[EncoderDecoder] Add encoder-decoder for roberta/ vanilla longformer (#6411) · 0735def8
Patrick von Platen authored Aug 12, 2020
```
* add encoder-decoder for roberta

* fix headmask

* apply Sylvains suggestions

* fix typo

* Apply suggestions from code review
```
0735def8

Fixes to make life easier with the nlp library (#6423) · e9c30314

Sylvain Gugger authored Aug 12, 2020



* allow using tokenizer.pad as a collate_fn in pytorch

* allow using tokenizer.pad as a collate_fn in pytorch

* Add documentation and tests

* Make attention mask the right shape

* Better test
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

e9c30314

11 Aug, 2020 2 commits

lr_schedulers: add get_polynomial_decay_schedule_with_warmup (#6361) · ece0903e

Stas Bekman authored Aug 11, 2020



* [wip] add get_polynomial_decay_schedule_with_warmup

* style

* add assert

* change lr_end to a much smaller default number

* check for exact equality

* [model_cards] electra-base-turkish-cased-ner (#6350)

* for electra-base-turkish-cased-ner

* Add metadata
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Temporarily de-activate TPU CI

* Update modeling_tf_utils.py (#6372)

fix typo: ckeckpoint->checkpoint

* the test now works again (#6371)

* correct pl link in readme (#6364)

* refactor almost identical tests (#6339)

* refactor almost identical tests

* important to add a clear assert error message

* make the assert error even more descriptive than the original bt

* Small docfile fixes (#6328)

* Patch models (#6326)

* TFAlbertFor{TokenClassification, MultipleChoice}

* Patch models

* BERT and TF BERT info


s

* Update check_repo

* Ci GitHub caching (#6382)

* Cache Github Actions CI

* Remove useless file

* Colab button (#6389)

* Add colab button

* Add colab link for tutorials

* Fix links for open in colab (#6391)

* Update src/transformers/optimization.py

consistently use lr_end=1e-7 default
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* [wip] add get_polynomial_decay_schedule_with_warmup

* style

* add assert

* change lr_end to a much smaller default number

* check for exact equality

* Update src/transformers/optimization.py

consistently use lr_end=1e-7 default
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove dup (leftover from merge)

* convert the test into the new refactored format

* stick to using the current_step as is, without ++
Co-authored-by: M. Yusuf Sarıgöz <yusufsarigoz@gmail.com>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Alexander Measure <ameasure@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

ece0903e

rename prepare_translation_batch -> prepare_seq2seq_batch (#6103) · be1520d3
Sam Shleifer authored Aug 11, 2020

be1520d3