Commits · 86c07e634f3624cdf3f9e4e81ca53b808c4b22c6 · chenpangpang / transformers

20 Aug, 2020 14 commits

One last threshold to raise · 86c07e63
sgugger authored Aug 20, 2020

86c07e63
Move threshold up for flaky test with Electra (#6622) · e8af90c0
Sylvain Gugger authored Aug 20, 2020
```
* Move threshold up for flaky test with Electra

* Update above as well
```
e8af90c0

XLNet Bug when training with apex 16-bit precision (#6567) · 95395837

Ivan Dolgov authored Aug 20, 2020



* xlnet fp16 bug fix

* comment cast added

* Update modeling_xlnet.py
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

95395837

[Tests] fix attention masks in Tests (#6621) · 505f2d74
Patrick von Platen authored Aug 20, 2020
```
* fix distilbert

* fix typo
```
505f2d74
Add tests for Reformer tokenizer (#6485) · c9454507
Denisa Roberts authored Aug 20, 2020

c9454507

TFTrainer dataset doc & fix evaluation bug (#6618) · f9d280a9

Joe Davison authored Aug 20, 2020

* TFTrainer dataset doc & fix evaluation bug

discussed in #6551

* add docstring to test/eval datasets

f9d280a9

Add tests to Trainer (#6605) · 573bdb0a

Sylvain Gugger authored Aug 20, 2020

* Add tests to Trainer

* Test if removing long breaks everything

* Remove ugly hack

* Fix distributed test

* Use float for number of epochs

573bdb0a

add intro to nlp lib & dataset links to custom datasets tutorial (#6583) · 039d8d65
Joe Davison authored Aug 20, 2020
```
* add intro to nlp lib + links

* unique links...
```
039d8d65
Fix CI · b3e54698
sgugger authored Aug 20, 2020

b3e54698
removed redundant arg in prepare_inputs (#6614) · 33bf4264
Prajjwal Bhargava authored Aug 20, 2020
```
* removed redundant arg in prepare_inputs

* made same change in prediction_loop
```
33bf4264

Docs copy button misses ... prefixed code (#6518) · cabfdfaf

Romain Rigaux authored Aug 20, 2020

Tested in a local build of the docs.

e.g. Just above https://huggingface.co/transformers/task_summary.html#causal-language-modeling

Copy will copy the full code, e.g.

for token in top_5_tokens:
     print(sequence.replace(tokenizer.mask_token, tokenizer.decode([token])))

Instead of currently only:

for token in top_5_tokens:


>>> for token in top_5_tokens:
...     print(sequence.replace(tokenizer.mask_token, tokenizer.decode([token])))
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help reduce our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help increase our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help decrease our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help offset our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help improve our carbon footprint.

Docs for the option fix:
https://sphinx-copybutton.readthedocs.io/en/latest/

cabfdfaf

lighter 'make test' (#6512) · 61b5ee11
Stas Bekman authored Aug 20, 2020

61b5ee11
Typo fix in 04-onnx-export (#6595) · 3c3c46f5
Siddharth Jain authored Aug 20, 2020

3c3c46f5
[cleanup] remove confusing newline (#6603) · 93c5c9a5
Oren Amsalem authored Aug 20, 2020

93c5c9a5

19 Aug, 2020 7 commits

Fix #6575 (#6596) · 18ca0e91
Sylvain Gugger authored Aug 19, 2020

18ca0e91
[BartTokenizerFast] add prepare_seq2seq_batch (#6543) · 7581884d
Suraj Patil authored Aug 19, 2020

7581884d
fix model outputs test (#6593) · 8bcceace
Patrick von Platen authored Aug 19, 2020

8bcceace
tf generation utils: remove unused kwargs (#6591) · 9a86321b
Sam Shleifer authored Aug 19, 2020

9a86321b

Feed forward chunking others (#6365) · 2a7402cb

Pradhy729 authored Aug 19, 2020



* Feed forward chunking for Distilbert & Albert

* Added ff chunking for many other models

* Change model signature

* Added chunking for XLM

* Cleaned up by removing some variables.

* remove test_chunking flag
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

2a7402cb

[EncoderDecoder] Add functionality to tie encoder decoder weights (#6538) · fe0b85e7

Patrick von Platen authored Aug 19, 2020



* start adding tie encoder to decoder functionality

* finish model tying

* make style

* Apply suggestions from code review

* fix t5 list including cross attention

* apply sams suggestions

* Update src/transformers/modeling_encoder_decoder.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add max depth break point
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fe0b85e7

Fix bart base test (#6587) · ab42d748
Sam Shleifer authored Aug 18, 2020

ab42d748

18 Aug, 2020 13 commits
- add BartConfig.force_bos_token_to_be_generated (#6526) · 1529bf96
  Sam Shleifer authored Aug 18, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  1529bf96
- [Model card] Bert2GPT2 EncoderDecoder model (#6569) · 974bb4af
  Patrick von Platen authored Aug 18, 2020
```
* Bert2GPT2 EncoderDecoder model

* Update README.md
```
  974bb4af
- update xnli-mt url (#6580) · 6f972e14
  Suraj Patil authored Aug 18, 2020
  
  6f972e14
- [Pegasus Doc] minor typo (#6579) · fb6844af
  Suraj Patil authored Aug 18, 2020
```
Minor typo correction
@sshleifer
```
  fb6844af
- Create README.md (#6556) · aaab9ab1
  Manuel Romero authored Aug 18, 2020
  
  aaab9ab1
- Create README.md (#6557) · 1dfce0f0
  Manuel Romero authored Aug 18, 2020
  
  1dfce0f0
- [docs] Fix number of 'ug' occurrences in tokenizer_summary (#6574) · 7516bcf2
  Romain Rigaux authored Aug 18, 2020
  
  7516bcf2
- [docs] Fix wrong newline in the middle of a paragraph (#6573) · 5a5af22e
  Romain Rigaux authored Aug 18, 2020
  
  5a5af22e
- fix incorrect codecov reports (#6553) · 7659a8eb
  Stas Bekman authored Aug 18, 2020
```
As discussed at https://github.com/huggingface/transformers/issues/6317 codecov currently sends an invalid report when it fails to find a code coverage report for the base it checks against, so this gets fixed by:

-  require_base: yes        # don't report if there is no base coverage report

let's add this for clarity, this supposedly is already the default.

-  require_head: yes        # don't report if there is no head coverage report 

and perhaps no point reporting on doc changes as they don't make any difference and it just generates noise:

-  require_changes: true    # only comment if there was change in coverage
```
  7659a8eb
- github: add @stefan-it to bug-report template for all token-classification related bugs (#6489) · cfa26d2b
  Stefan Schweter authored Aug 18, 2020
  
  cfa26d2b
- Small typo fixes for model card: electra-base-german-uncased (#6555) · 1fdf372f
  Philip May authored Aug 18, 2020
```
* Update README.md

* Update model_cards/german-nlp-group/electra-base-german-uncased/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  1fdf372f
- Fixed label datatype for STS-B (#6492) · 5a81195e
  Ali Modarressi authored Aug 18, 2020
```
* fixed label datatype for sts-b

* naming update

* make style

* make style
```
  5a81195e
- [marian] converter supports models from new Tatoeba project (#6342) · 12d76241
  Sam Shleifer authored Aug 17, 2020
  
  12d76241
17 Aug, 2020 6 commits
- update with #s of sentences/tokens (#6546) · fb7330b3
  Jim Regan authored Aug 17, 2020
  
  fb7330b3
- Added first model card (#6530) · 63144701
  onepointconsulting authored Aug 17, 2020
```
* Added first model card

* Add metadata
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  63144701
- [model_cards] Add model cards for Urduhack model (roberta-urdu-small) (#6536) · 98ee8020
  Ikram Ali authored Aug 18, 2020
```
* [model_cards] roberta-urdu-small added.

* [model_cards] typo fixed.

* Tweak license format (yaml expects a simple string)

Co-authored-by: Ikram Ali <mrikram1989>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  98ee8020
- [model_cards] Add a new model for Irish (#6544) · 3a302904
  Jim Regan authored Aug 17, 2020
  
  3a302904
- [model_cards] Fix yaml for cedpsam/chatbot_fr · 07971d8b
  Julien Chaumond authored Aug 17, 2020
  
  07971d8b
- [T5Tokenizer] add prepare_seq2seq_batch method (#6122) · 407da12e
  Suraj Patil authored Aug 17, 2020
```
* tests
```
  407da12e