Commits · a8d6716ecb466d9bd2cff1e9c888774401513361 · chenpangpang / transformers

24 Aug, 2020 10 commits

Create PULL_REQUEST_TEMPLATE.md (#6660) · a8d6716e

Stas Bekman authored Aug 24, 2020



* Create PULL_REQUEST_TEMPLATE.md

Proposing to copy this neat feature from pytorch. This is a small template that let's a PR submitter tell which issue that PR closes.

* Update .github/PULL_REQUEST_TEMPLATE.md
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

a8d6716e

Lat fix for Ray HP search (#6691) · 8f98faf9
Sylvain Gugger authored Aug 24, 2020

8f98faf9

Add hyperparameter search to Trainer (#6576) · 3a7fdd3f

Sylvain Gugger authored Aug 24, 2020



* Add optuna hyperparameter search to Trainer

* @julien-c suggestions
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Make compute_objective an arg function

* Formatting

* Rework to make it easier to add ray

* Formatting

* Initial support for Ray

* Formatting

* Polish and finalize

* Add trial id to checkpoint with Ray

* Smaller default

* Use GPU in ray if available

* Formatting

* Fix test

* Update install instruction
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Address review comments

* Formatting post-merge
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

3a7fdd3f

Fix PL token classification examples (#6682) · dd522da0
vblagoje authored Aug 24, 2020

dd522da0
Update repo to isort v5 (#6686) · a5737779
Sylvain Gugger authored Aug 24, 2020
```
* Run new isort

* More changes

* Update CI, CONTRIBUTING and benchmarks
```
a5737779

Fixed DataCollatorForLanguageModeling not accepting lists of lists (#6685) · d329c9b0

Teven authored Aug 24, 2020

* Fixed DataCollatorForLanguageModeling + PermutationLanguageModeling not accepting lists of lists

* Update data_collator.py

* black was grumpy

d329c9b0

Missing commit · 0a850d21
sgugger authored Aug 24, 2020

0a850d21

Don't reset the dataset type + plug for rm unused columns (#6683) · b30879fe

Sylvain Gugger authored Aug 24, 2020



* Don't reset the type of the dataset

* Formatting

* Update trainer.py
Co-authored-by: Teven <teven.lescao@gmail.com>

b30879fe

Specify config filename (#6626) · 1a779ad7
Jared T Nielsen authored Aug 24, 2020

1a779ad7

added multiple model_cards for below models (#6666) · a622705e

Sagor Sarker authored Aug 24, 2020

* Create README.md

* Update README.md

* Create README.md

* Update README.md

* added multiple codeswitch model

a622705e

23 Aug, 2020 3 commits
- Add Roberta2Roberta shared · 16e38940
  Patrick von Platen authored Aug 23, 2020
  
  16e38940
- new paper bibtex (#6656) · f230a640
  Sam Shleifer authored Aug 23, 2020
  
  f230a640
- Add Roberta2Roberta model card · f235ee21
  Patrick von Platen authored Aug 23, 2020
  
  f235ee21
22 Aug, 2020 2 commits
- added model_card for model codeswitch-hineng-lid-lince and codeswitch-spaeng-lid-lince (#6663) · 068df740
  Sagor Sarker authored Aug 22, 2020
```
* Create README.md

* Update README.md

* Create README.md

* Update README.md
```
  068df740
- Correct bug in bert2bert-cnn_dailymail · 97bb2497
  Patrick von Platen authored Aug 22, 2020
```
Model was trained with the wrong tokenizer. Retrained with correct tokenizer - thanks for spotting @lhoestq !
```
  97bb2497
21 Aug, 2020 8 commits
- Add model card for electricidad-base-generator (#6650) · 0f94151d
  Manuel Romero authored Aug 21, 2020
```
I works like a charm!
Look at the output of the example code!
```
  0f94151d
- [Doc model summary] add MBart model summary (#6649) · cbda7293
  Suraj Patil authored Aug 21, 2020
  
  cbda7293
- Add T5-11B disclaimer · 9e8c494d
  Patrick von Platen authored Aug 21, 2020
```
@julien-c
```
  9e8c494d
- [Docs model summaries] Add pegasus to docs (#6640) · a4db4e30
  Patrick von Platen authored Aug 21, 2020
```
* add pegasus to docs

* Update docs/source/model_summary.rst
```
  a4db4e30
- CamembertForCausalLM (#6577) · d0e42a7b
  Suraj Patil authored Aug 21, 2020
```
* added CamembertForCausalLM

* add in __init__ and auto model

* style

* doc
```
  d0e42a7b
- Remove accidental comment (#6629) · bdf7e5de
  josephrocca authored Aug 21, 2020
  
  bdf7e5de
- model card for Spanish electra base (#6633) · efc74605
  Manuel Romero authored Aug 21, 2020
  
  efc74605
- Update ONNX doc to match the removal of --optimize argument. · b105f2c6
  Morgan Funtowicz authored Aug 21, 2020
```
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
```
  b105f2c6
20 Aug, 2020 16 commits

Trainer automatically drops unused columns in nlp datasets (#6449) · e5f45227

Sylvain Gugger authored Aug 20, 2020

* Add a classmethod to easily build a Trainer from nlp dataset and metric

* Fix docstrings

* Split train/eval

* Formatting

* Log dropped columns + docs

* Authorize callable activations

* Poc for auto activation

* Be framework-agnostic

* Formatting

* Remove class method

* Remove unnecessary code

e5f45227

Regression test for pegasus bugfix (#6606) · 5bf4465e
Sam Shleifer authored Aug 20, 2020

5bf4465e
One last threshold to raise · 86c07e63
sgugger authored Aug 20, 2020

86c07e63
Move threshold up for flaky test with Electra (#6622) · e8af90c0
Sylvain Gugger authored Aug 20, 2020
```
* Move threshold up for flaky test with Electra

* Update above as well
```
e8af90c0

XLNet Bug when training with apex 16-bit precision (#6567) · 95395837

Ivan Dolgov authored Aug 20, 2020



* xlnet fp16 bug fix

* comment cast added

* Update modeling_xlnet.py
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

95395837

[Tests] fix attention masks in Tests (#6621) · 505f2d74
Patrick von Platen authored Aug 20, 2020
```
* fix distilbert

* fix typo
```
505f2d74
Add tests for Reformer tokenizer (#6485) · c9454507
Denisa Roberts authored Aug 20, 2020

c9454507

TFTrainer dataset doc & fix evaluation bug (#6618) · f9d280a9

Joe Davison authored Aug 20, 2020

* TFTrainer dataset doc & fix evaluation bug

discussed in #6551

* add docstring to test/eval datasets

f9d280a9

Add tests to Trainer (#6605) · 573bdb0a

Sylvain Gugger authored Aug 20, 2020

* Add tests to Trainer

* Test if removing long breaks everything

* Remove ugly hack

* Fix distributed test

* Use float for number of epochs

573bdb0a

add intro to nlp lib & dataset links to custom datasets tutorial (#6583) · 039d8d65
Joe Davison authored Aug 20, 2020
```
* add intro to nlp lib + links

* unique links...
```
039d8d65
Fix CI · b3e54698
sgugger authored Aug 20, 2020

b3e54698
removed redundant arg in prepare_inputs (#6614) · 33bf4264
Prajjwal Bhargava authored Aug 20, 2020
```
* removed redundant arg in prepare_inputs

* made same change in prediction_loop
```
33bf4264

Docs copy button misses ... prefixed code (#6518) · cabfdfaf

Romain Rigaux authored Aug 20, 2020

Tested in a local build of the docs.

e.g. Just above https://huggingface.co/transformers/task_summary.html#causal-language-modeling

Copy will copy the full code, e.g.

for token in top_5_tokens:
     print(sequence.replace(tokenizer.mask_token, tokenizer.decode([token])))

Instead of currently only:

for token in top_5_tokens:


>>> for token in top_5_tokens:
...     print(sequence.replace(tokenizer.mask_token, tokenizer.decode([token])))
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help reduce our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help increase our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help decrease our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help offset our carbon footprint.
Distilled models are smaller than the models they mimic. Using them instead of the large versions would help improve our carbon footprint.

Docs for the option fix:
https://sphinx-copybutton.readthedocs.io/en/latest/

cabfdfaf

lighter 'make test' (#6512) · 61b5ee11
Stas Bekman authored Aug 20, 2020

61b5ee11
Typo fix in 04-onnx-export (#6595) · 3c3c46f5
Siddharth Jain authored Aug 20, 2020

3c3c46f5
[cleanup] remove confusing newline (#6603) · 93c5c9a5
Oren Amsalem authored Aug 20, 2020

93c5c9a5

19 Aug, 2020 1 commit
- Fix #6575 (#6596) · 18ca0e91
  Sylvain Gugger authored Aug 19, 2020
  
  18ca0e91