Commits · 4b3ee9cbc53c6cf6cee6bfae86cc2c6ec0778ee5 · chenpangpang / transformers

01 Sep, 2020 6 commits

Release: v3.1.0 · 4b3ee9cb
Lysandre authored Sep 01, 2020

4b3ee9cb

[Generate] Facilitate PyTorch generate using `ModelOutputs` (#6735) · afc4ece4

Patrick von Platen authored Sep 01, 2020

* fix generate for GPT2 Double Head

* fix gpt2 double head model

* fix  bart / t5

* also add for no beam search

* fix no beam search

* fix encoder decoder

* simplify t5

* simplify t5

* fix t5 tests

* fix BART

* fix transfo-xl

* fix conflict

* integrating sylvains and sams comments

* fix tf past_decoder_key_values

* fix enc dec test

afc4ece4

Restore PaddingStrategy.MAX_LENGTH on QAPipeline while no v2. (#6875) · 397f8196
Funtowicz Morgan authored Sep 01, 2020
```
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
```
397f8196
delete reinit (#6862) · a32d85f0
Sam Shleifer authored Sep 01, 2020

a32d85f0

Logging doc (#6852) · d5f1ffa0

Sylvain Gugger authored Sep 01, 2020



* Add logging doc

* Foamtting

* Update docs/source/main_classes/logging.rst

* Update src/transformers/utils/logging.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

d5f1ffa0

add a final report to all pytest jobs (#6861) · 59a6a32a

Stas Bekman authored Aug 31, 2020

we had it added for one job, please add it to all pytest jobs - we need the output of what tests were run to debug the codecov issue. thank you!

59a6a32a

31 Aug, 2020 15 commits

[fix] typo in available in helper function (#6859) · 431ab19d
Sam Shleifer authored Aug 31, 2020

431ab19d
Bart can make decoder_input_ids from labels (#6758) · 367235ee
Sam Shleifer authored Aug 31, 2020

367235ee
[s2s] command line args for faster val steps (#6833) · b9772897
Sam Shleifer authored Aug 31, 2020

b9772897
Fix marian slow test (#6854) · 8af1970e
Sam Shleifer authored Aug 31, 2020

8af1970e

Update ONNX notebook to include section on quantization. (#6831) · bbdba0a7

Funtowicz Morgan authored Aug 31, 2020



* Update ONNX notebook to include section on quantization.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Addressing ONNX team comments

bbdba0a7

Split hp search methods (#6857) · a59bcefb
Sylvain Gugger authored Aug 31, 2020
```
* Split the run_hp_search by backend

* Unused import
```
a59bcefb

Add checkpointing to Ray Tune HPO (#6747) · 23f9611c

krfricke authored Aug 31, 2020

* Introduce HPO checkpointing for PBT

* Moved checkpoint saving

* Fixed checkpoint subdir pass

* Fixed style

* Enable/disable checkpointing, check conditions for various tune schedulers incl. PBT

* Adjust number of GPUs to number of jobs

* Avoid mode pickling in ray

* Move hp search to integrations

23f9611c

Marian distill scripts + integration test (#6799) · 61b7ba93
Sam Shleifer authored Aug 31, 2020

61b7ba93

Only access loss tensor every logging_steps (#6802) · 02d09c8f

Jin Young (Daniel) Sohn authored Aug 31, 2020



* Only access loss tensor every logging_steps

* tensor.item() was being called every step. This must not be done
for XLA:TPU tensors as it's terrible for performance causing TPU<>CPU
communication at each step. On RoBERTa MLM for example, it reduces step
time by 30%, should be larger for smaller step time models/tasks.
* Train batch size was not correct in case a user uses the
`per_gpu_train_batch_size` flag
* Avg reduce loss accross eval shards

* Fix style (#6803)

* t5 model should make decoder_attention_mask (#6800)

* [s2s] Test hub configs in self-scheduled CI (#6809)

* [s2s] round runtime in run_eval (#6798)

* Pegasus finetune script: add --adafactor (#6811)

* [bart] rename self-attention -> attention (#6708)

* [tests] fix typos in inputs (#6818)

* Fixed open in colab link (#6825)

* Add model card for singbert lite. Update widget for singbert and singbert-large. (#6827)

* BR_BERTo model card (#6793)

* clearly indicate shuffle=False (#6312)

* Clarify shuffle

* clarify shuffle
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

* [s2s README] Add more dataset download instructions (#6737)

* Style

* Patch logging issue

* Set default logging level to `WARNING` instead of `INFO`

* TF Flaubert w/ pre-norm (#6841)

* Dataset and DataCollator for BERT Next Sentence Prediction (NSP) task (#6644)

* add datacollator and dataset for next sentence prediction task

* bug fix (numbers of special tokens & truncate sequences)

* bug fix (+ dict inputs support for data collator)

* add padding for nsp data collator; renamed cached files to avoid conflict.

* add test for nsp data collator

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

* Fix in Adafactor docstrings (#6845)

* Fix resuming training for Windows (#6847)

* Only access loss tensor every logging_steps

* tensor.item() was being called every step. This must not be done
for XLA:TPU tensors as it's terrible for performance causing TPU<>CPU
communication at each step. On RoBERTa MLM for example, it reduces step
time by 30%, should be larger for smaller step time models/tasks.
* Train batch size was not correct in case a user uses the
`per_gpu_train_batch_size` flag
* Avg reduce loss accross eval shards

* comments
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Thomas Ashish Cherian <6967017+PandaWhoCodes@users.noreply.github.com>
Co-authored-by: Zane Lim <zyuanlim@gmail.com>
Co-authored-by: Rodolfo De Nadai <rdenadai@gmail.com>
Co-authored-by: xujiaze13 <37360975+xujiaze13@users.noreply.github.com>
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Huang Lianzhe <hlz@pku.edu.cn>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

02d09c8f

Fix resuming training for Windows (#6847) · c48546c7
Sylvain Gugger authored Aug 31, 2020

c48546c7
Fix in Adafactor docstrings (#6845) · d2f9cb83
Sylvain Gugger authored Aug 31, 2020

d2f9cb83

Dataset and DataCollator for BERT Next Sentence Prediction (NSP) task (#6644) · 2de7ee03

Huang Lianzhe authored Aug 31, 2020



* add datacollator and dataset for next sentence prediction task

* bug fix (numbers of special tokens & truncate sequences)

* bug fix (+ dict inputs support for data collator)

* add padding for nsp data collator; renamed cached files to avoid conflict.

* add test for nsp data collator

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

2de7ee03

TF Flaubert w/ pre-norm (#6841) · 895d3946
Lysandre Debut authored Aug 31, 2020

895d3946
Set default logging level to `WARNING` instead of `INFO` · 4561f05c
Lysandre authored Aug 31, 2020

4561f05c
Patch logging issue · 05c32141
Lysandre authored Aug 31, 2020

05c32141

30 Aug, 2020 6 commits
- [s2s README] Add more dataset download instructions (#6737) · dfa10a41
  Sam Shleifer authored Aug 30, 2020
  
  dfa10a41
- clearly indicate shuffle=False (#6312) · 32fe4408
  xujiaze13 authored Aug 30, 2020
```
* Clarify shuffle

* clarify shuffle
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>
```
  32fe4408
- BR_BERTo model card (#6793) · 0eecacea
  Rodolfo De Nadai authored Aug 30, 2020
  
  0eecacea
- Add model card for singbert lite. Update widget for singbert and singbert-large. (#6827) · d176aaad
  Zane Lim authored Aug 30, 2020
  
  d176aaad
- Fixed open in colab link (#6825) · a5847619
  Thomas Ashish Cherian authored Aug 30, 2020
  
  a5847619
- [tests] fix typos in inputs (#6818) · 563485bf
  Stas Bekman authored Aug 30, 2020
  
  563485bf
29 Aug, 2020 3 commits
- [bart] rename self-attention -> attention (#6708) · 22933e66
  Sam Shleifer authored Aug 29, 2020
  
  22933e66
- Pegasus finetune script: add --adafactor (#6811) · 0f58903b
  Sam Shleifer authored Aug 29, 2020
  
  0f58903b
- [s2s] round runtime in run_eval (#6798) · ac47458a
  Sam Shleifer authored Aug 29, 2020
  
  ac47458a
28 Aug, 2020 9 commits

[s2s] Test hub configs in self-scheduled CI (#6809) · 5ab21b07
Sam Shleifer authored Aug 28, 2020

5ab21b07
t5 model should make decoder_attention_mask (#6800) · 3cac867f
Sam Shleifer authored Aug 28, 2020

3cac867f
Fix style (#6803) · 20f77864
Sam Shleifer authored Aug 28, 2020

20f77864

prepare_seq2seq_batch makes labels/ decoder_input_ids made later. (#6654) · 9336086a

Sam Shleifer authored Aug 28, 2020

* broken test

* batch parity

* tests pass

* boom boom

* boom boom

* split out bart tokenizer tests

* fix tests

* boom boom

* Fixed dataset bug

* Fix marian

* Undo extra

* Get marian working

* Fix t5 tok tests

* Test passing

* Cleanup

* better assert msg

* require torch

* Fix mbart tests

* undo extra decoder_attn_mask change

* Fix import

* pegasus tokenizer can ignore src_lang kwargs

* unused kwarg test cov

* boom boom

* add todo for pegasus issue

* cover one word translation edge case

* Cleanup

* doc

9336086a

Transformer-XL: Improved tokenization with sacremoses (#6322) · cb276b41

RafaelWO authored Aug 28, 2020



* Improved tokenization with sacremoses

 * The TransfoXLTokenizer is now using sacremoses for tokenization
 * Added tokenization of comma-separated and floating point numbers.
 * Removed prepare_for_tokenization() from tokenization_transfo_xl.py because punctuation is handled by sacremoses
 * Added corresponding tests
 * Removed test comapring TransfoXLTokenizer and TransfoXLTokenizerFast
 * Added deprecation warning to TransfoXLTokenizerFast

* isort change
Co-authored-by: Teven <teven.lescao@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

cb276b41

Add ProtBert model card (#6764) · 930153e7
Ahmed Elnaggar authored Aug 28, 2020

930153e7

[style] set the minimal required version for `black` (#6784) · 743d131d

Stas Bekman authored Aug 27, 2020

`make style` with `black` < 20.8b1 is a no go (in case some other package forced a lower version) - so make it explicit to avoid confusion

743d131d

PL: --adafactor option (#6776) · fb78a90d
Sam Shleifer authored Aug 27, 2020

fb78a90d
[transformers-cli] fix logger getter (#6777) · 92ac2fa7
Stas Bekman authored Aug 27, 2020

92ac2fa7

27 Aug, 2020 1 commit
- Format · 42fddacd
  Lysandre authored Aug 27, 2020
  
  42fddacd