Commits · 563485bf95f5c9fd066f2874019ea1e08d3c9770 · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "43114b89ba75a844ae5a61291a8cf40626a47b6e"

30 Aug, 2020 1 commit
- [tests] fix typos in inputs (#6818) · 563485bf
  Stas Bekman authored Aug 30, 2020
  
  563485bf
29 Aug, 2020 1 commit
- Pegasus finetune script: add --adafactor (#6811) · 0f58903b
  Sam Shleifer authored Aug 29, 2020
  
  0f58903b
28 Aug, 2020 5 commits

t5 model should make decoder_attention_mask (#6800) · 3cac867f
Sam Shleifer authored Aug 28, 2020

3cac867f
Fix style (#6803) · 20f77864
Sam Shleifer authored Aug 28, 2020

20f77864

prepare_seq2seq_batch makes labels/ decoder_input_ids made later. (#6654) · 9336086a

Sam Shleifer authored Aug 28, 2020

* broken test

* batch parity

* tests pass

* boom boom

* boom boom

* split out bart tokenizer tests

* fix tests

* boom boom

* Fixed dataset bug

* Fix marian

* Undo extra

* Get marian working

* Fix t5 tok tests

* Test passing

* Cleanup

* better assert msg

* require torch

* Fix mbart tests

* undo extra decoder_attn_mask change

* Fix import

* pegasus tokenizer can ignore src_lang kwargs

* unused kwarg test cov

* boom boom

* add todo for pegasus issue

* cover one word translation edge case

* Cleanup

* doc

9336086a

Transformer-XL: Improved tokenization with sacremoses (#6322) · cb276b41

RafaelWO authored Aug 28, 2020



* Improved tokenization with sacremoses

 * The TransfoXLTokenizer is now using sacremoses for tokenization
 * Added tokenization of comma-separated and floating point numbers.
 * Removed prepare_for_tokenization() from tokenization_transfo_xl.py because punctuation is handled by sacremoses
 * Added corresponding tests
 * Removed test comapring TransfoXLTokenizer and TransfoXLTokenizerFast
 * Added deprecation warning to TransfoXLTokenizerFast

* isort change
Co-authored-by: Teven <teven.lescao@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

cb276b41

[transformers-cli] fix logger getter (#6777) · 92ac2fa7
Stas Bekman authored Aug 27, 2020

92ac2fa7

27 Aug, 2020 4 commits

Format · 42fddacd
Lysandre authored Aug 27, 2020

42fddacd
[test schedulers] adjust to test the first step's reading (#6429) · dbfe34f2
Stas Bekman authored Aug 27, 2020
```
* [test schedulers] small improvement

* cleanup
```
dbfe34f2
[testing] replace hardcoded paths to allow running tests from anywhere (#6523) · e6b811f0
Stas Bekman authored Aug 27, 2020
```
* [testing] replace hardcoded paths to allow running tests from anywhere

* fix the merge conflict
```
e6b811f0

Add AdaFactor optimizer from fairseq (#6722) · 971d1802

Nikolai Yakovenko authored Aug 27, 2020



* AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM.

* update PR fixes, add basic test

* bug -- incorrect params in test

* bugfix -- import Adafactor into test

* bugfix -- removed accidental T5 include

* resetting T5 to master

* bugfix -- include Adafactor in __init__

* longer loop for adafactor test

* remove double error class declare

* lint

* black

* isort

* Update src/transformers/optimization.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* single docstring

* Cleanup docstring
Co-authored-by: Nikolai Y <nikolai.yakovenko@point72.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

971d1802

26 Aug, 2020 4 commits

[model_cards] Fix tiny typos · 3242e4d9
Julien Chaumond authored Aug 26, 2020

3242e4d9

[TF Longformer] Improve Speed for TF Longformer (#6447) · 858b7d58

Patrick von Platen authored Aug 26, 2020

* add tf graph compile tests

* fix conflict

* remove more tf transpose statements

* fix conflicts

* fix comment typos

* move function to class function

* fix black

* fix black

* make style

858b7d58

Black 20 release · a75c64d8
Lysandre authored Aug 26, 2020

a75c64d8

Centralize logging (#6434) · 77abd1e7

Lysandre Debut authored Aug 26, 2020



* Logging

* Style

* hf_logging > utils.logging

* Address @thomwolf's comments

* Update test

* Update src/transformers/benchmark/benchmark_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Revert bad change
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

77abd1e7

25 Aug, 2020 3 commits
- T5Tokenizer adds EOS token if not already added (#5866) · 62449570
  Sam Shleifer authored Aug 25, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  62449570
- Fix pegasus-xsum integration test (#6726) · e11d923b
  Sam Shleifer authored Aug 25, 2020
  
  e11d923b
- More tests to Trainer (#6699) · abc02021
  Sylvain Gugger authored Aug 25, 2020
```
* More tests to Trainer

* Add warning in the doc
```
  abc02021
24 Aug, 2020 1 commit
- Update repo to isort v5 (#6686) · a5737779
  Sylvain Gugger authored Aug 24, 2020
```
* Run new isort

* More changes

* Update CI, CONTRIBUTING and benchmarks
```
  a5737779
20 Aug, 2020 6 commits
- Regression test for pegasus bugfix (#6606) · 5bf4465e
  Sam Shleifer authored Aug 20, 2020
  
  5bf4465e
- One last threshold to raise · 86c07e63
  sgugger authored Aug 20, 2020
  
  86c07e63
- Move threshold up for flaky test with Electra (#6622) · e8af90c0
  Sylvain Gugger authored Aug 20, 2020
```
* Move threshold up for flaky test with Electra

* Update above as well
```
  e8af90c0
- [Tests] fix attention masks in Tests (#6621) · 505f2d74
  Patrick von Platen authored Aug 20, 2020
```
* fix distilbert

* fix typo
```
  505f2d74
- Add tests for Reformer tokenizer (#6485) · c9454507
  Denisa Roberts authored Aug 20, 2020
  
  c9454507
- Add tests to Trainer (#6605) · 573bdb0a
  Sylvain Gugger authored Aug 20, 2020
```
* Add tests to Trainer

* Test if removing long breaks everything

* Remove ugly hack

* Fix distributed test

* Use float for number of epochs
```
  573bdb0a
19 Aug, 2020 5 commits

[BartTokenizerFast] add prepare_seq2seq_batch (#6543) · 7581884d
Suraj Patil authored Aug 19, 2020

7581884d
fix model outputs test (#6593) · 8bcceace
Patrick von Platen authored Aug 19, 2020

8bcceace

Feed forward chunking others (#6365) · 2a7402cb

Pradhy729 authored Aug 19, 2020



* Feed forward chunking for Distilbert & Albert

* Added ff chunking for many other models

* Change model signature

* Added chunking for XLM

* Cleaned up by removing some variables.

* remove test_chunking flag
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

2a7402cb

[EncoderDecoder] Add functionality to tie encoder decoder weights (#6538) · fe0b85e7

Patrick von Platen authored Aug 19, 2020



* start adding tie encoder to decoder functionality

* finish model tying

* make style

* Apply suggestions from code review

* fix t5 list including cross attention

* apply sams suggestions

* Update src/transformers/modeling_encoder_decoder.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add max depth break point
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fe0b85e7

Fix bart base test (#6587) · ab42d748
Sam Shleifer authored Aug 18, 2020

ab42d748

18 Aug, 2020 2 commits
- add BartConfig.force_bos_token_to_be_generated (#6526) · 1529bf96
  Sam Shleifer authored Aug 18, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  1529bf96
- [marian] converter supports models from new Tatoeba project (#6342) · 12d76241
  Sam Shleifer authored Aug 17, 2020
  
  12d76241
17 Aug, 2020 5 commits
- [T5Tokenizer] add prepare_seq2seq_batch method (#6122) · 407da12e
  Suraj Patil authored Aug 17, 2020
```
* tests
```
  407da12e
- [BartTokenizer] add prepare s2s batch (#6212) · 2a77813d
  Suraj Patil authored Aug 17, 2020
```
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
```
  2a77813d
- Fix flaky ONNX tests (#6531) · b41cc0b8
  Funtowicz Morgan authored Aug 17, 2020
  
  b41cc0b8
- Remove deprecated assertEquals (#6532) · 37709b59
  Kevin Canwen Xu authored Aug 17, 2020
```
`assertEquals` is deprecated: https://stackoverflow.com/questions/930995/assertequals-vs-assertequal-in-python/931011
This PR replaces these deprecated methods.
```
  37709b59
- Support additional dictionaries for BERT Japanese tokenizers (#6515) · 48c6c613
  Masatoshi Suzuki authored Aug 17, 2020
```
* Update BERT Japanese tokenizers

* Update CircleCI config to download unidic

* Specify to use the latest dictionary packages
```
  48c6c613
14 Aug, 2020 2 commits

[EncoderDecoder] Add Cross Attention for GPT2 (#6415) · 1d6e71e1

Patrick von Platen authored Aug 14, 2020



* add cross attention layers for gpt2

* make gpt2 cross attention work

* finish bert2gpt2

* add explicit comments

* remove attention mask since not yet supported

* revert attn mask in pipeline

* Update src/transformers/modeling_gpt2.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_encoder_decoder.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1d6e71e1

MBartForConditionalGeneration (#6441) · 680f1337

Suraj Patil authored Aug 14, 2020

* add MBartForConditionalGeneration

* style

* rebase and fixes

* add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS

* fix docs

* don't ignore mbart

* doc

* fix mbart fairseq link

* put mbart before bart

* apply doc suggestions

680f1337

13 Aug, 2020 1 commit

Test model outputs equivalence (#6445) · f7cbc13d

Lysandre Debut authored Aug 13, 2020

* Test model outputs equivalence

* Fix failing tests

* From dict to kwargs

* DistilBERT

* Addressing @sgugger and @patrickvonplaten's comments

f7cbc13d