Commits · 5c00918681d6b4027701eb46cea8f795da0d4064 · chenpangpang / transformers

23 Apr, 2021 10 commits

added support for exporting of t5 to onnx with past_key_values (#10651) · 5c009186
Kiran R authored Apr 23, 2021

5c009186
push (#11400) · 50f4539b
Patrick von Platen authored Apr 23, 2021

50f4539b

Sylvain Gugger authored Apr 23, 2021



* Initial support for upload to hub

* push -> upload

* Fixes + examples

* Fix torchhub test

* Torchhub test I hate you

* push_model_to_hub -> push_to_hub

* Apply mixin to other pretrained models

* Remove ABC inheritance

* Add tests

* Typo

* Run tests

* Install git-lfs

* Change approach

* Add push_to_hub to all

* Staging test suite

* Typo

* Maybe like this?

* More deps

* Cache

* Adapt name

* Quality

* MOAR tests

* Put it in testing_utils

* Docs + torchhub last hope

* Styling

* Wrong method

* Typos

* Update src/transformers/file_utils.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Address review comments

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bf2e0cf7

Fixed trainer total_flos relaoding in distributed mode (#11383) · 7bc86bea
Teven authored Apr 23, 2021
```
* Fixed trainer total_flos relaoding in distributed mode

* logging flos at the end of training
```
7bc86bea
make blenderbot test slow (#11395) · 74e84f1f
Patrick von Platen authored Apr 23, 2021

74e84f1f
fixed typos (#11391) · c3d6f339
Yoshitomo Matsubara authored Apr 23, 2021

c3d6f339
Fix typo in text (#11396) · a90d3f18
Max Del authored Apr 23, 2021

a90d3f18
correct conversion (#11394) · 2dc2d79a
Patrick von Platen authored Apr 23, 2021

2dc2d79a
correct typo (#11393) · b48cf712
Patrick von Platen authored Apr 23, 2021

b48cf712

[Flax] Big FlaxBert Refactor (#11364) · 8c9b5fcb

Patrick von Platen authored Apr 23, 2021

* improve flax

* refactor

* typos

* Update src/transformers/modeling_flax_utils.py

* Apply suggestions from code review

* Update src/transformers/modeling_flax_utils.py

* fix typo

* improve error tolerance

* typo

* correct nasty saving bug

* fix from pretrained

* correct tree map

* add note

* correct weight tying

8c9b5fcb

22 Apr, 2021 7 commits
- Fix Trainer with remove_unused_columns=False (#11382) · 3ed5e97b
  Sylvain Gugger authored Apr 22, 2021
```
* Fix Trainer with remove_unused_columns=False

* Typo
```
  3ed5e97b
- Fix typo (#11369) · 0f3ad150
  PenutChen authored Apr 22, 2021
  
  0f3ad150
- Correctly cast num_train_epochs to int (#11379) · 26173960
  Matt authored Apr 22, 2021
  
  26173960
- Add space (#11373) · 881945c0
  Takuya Makino authored Apr 22, 2021
  
  881945c0
- [run_translation.py] fix typo (#11372) · 5b5e4ca3
  johnson7788 authored Apr 22, 2021
```
fix typo
Co-authored-by: johnson <johnson@github.com>
```
  5b5e4ca3
- [Flax] Correct typo (#11374) · 58d8795d
  Patrick von Platen authored Apr 22, 2021
```
* finish

* fix copy
```
  58d8795d
- [Wav2Vec2] Fix special tokens for Wav2Vec2 tokenizer (#11349) · 880154d2
  Patrick von Platen authored Apr 22, 2021
```
* fix wav2vec2 tok

* up
```
  880154d2
21 Apr, 2021 13 commits

Add in torchhub · 6f14eab5
Sylvain Gugger authored Apr 21, 2021

6f14eab5
Add huggingface_hub dep for #11328 · ff26f8ee
Sylvain Gugger authored Apr 21, 2021

ff26f8ee

Fix token_type_ids error for big_bird model. (#11355) · 5e04d708

wlhgtc authored Apr 22, 2021



* MOD: fit chinese wwm to new datasets

* MOD: move wwm to new folder

* MOD: formate code

* Styling

* MOD add param and recover trainer

* MOD: add token_type_ids method for big bird

* MOD: format code

* MOD: format code
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

5e04d708

[contributing doc] explain/link to good first issue (#11346) · 5aaf5aac

Stas Bekman authored Apr 21, 2021



* explain/link to good first issue

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

5aaf5aac

Move old TF text classification script to legacy (#11361) · 6fe79e57
Matt authored Apr 21, 2021
```
And update README to explain the work-in-progress!
```
6fe79e57
Remove boiler plate code (#11340) · 50595a33
Patrick von Platen authored Apr 21, 2021
```
* remove boiler plate code

* adapt roberta

* correct docs

* finish refactor
```
50595a33
Merge new TF example script (#11360) · ac588594
Matt authored Apr 21, 2021
```
First of the new and more idiomatic TF examples!
```
ac588594
[testing doc] bring doc up to date (#11359) · 9f72e8f4
Stas Bekman authored Apr 21, 2021
```
* bring doc up to date

* fix
```
9f72e8f4

Extract metric_key_prefix during NotebookProgressCallback.on_evaluate (#11347) · 41f3133a

lewtun authored Apr 21, 2021

* Pass metric_key_prefix as kwarg to on_evaluate

* Replace eval_loss with metric_key_prefix_loss

* Default to "eval" if metric_key_prefix not in kwargs

* Add kwargs to CallbackHandler.on_evaluate signature

* Revert "Add kwargs to CallbackHandler.on_evaluate signature"

This reverts commit 8d4c85ed512f558f7579d36771e907b3379947b7.

* Revert "Pass metric_key_prefix as kwarg to on_evaluate"

This reverts commit 7766bfe2718601230ae593d37b1317bd53cfc075.

* Extract metric_key_prefix from metrics

41f3133a

Examples reorg (#11350) · dabeb152

Sylvain Gugger authored Apr 21, 2021



* Base move

* Examples reorganization

* Update references

* Put back test data

* Move conftest

* More fixes

* Move test data to test fixtures

* Update path

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments and clean
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

dabeb152

[deepspeed] fix resume from checkpoint (#11352) · ca7ff64f

Stas Bekman authored Apr 21, 2021

This PR fixes a bug that most likely somehow got exposed (not caused) by https://github.com/huggingface/transformers/pull/11318 - surprisingly the same test worked just fine before that other PR.

ca7ff64f

Honor contributors to models (#11329) · 74712e22
Sylvain Gugger authored Apr 21, 2021
```
* Honor contributors to models

* Fix typo

* Address review comments

* Add more authors
```
74712e22

Removed `max_length` from being mandatory within `generate`. (#11314) · aad95c7c

Nicolas Patry authored Apr 21, 2021

* Removed `max_length` from being mandatory within `generate`.

- Moving on to fully using `StoppingCriteria` for `greedy` and `sample`
modes.
- `max_length` still used for `beam_search` and `group_beam_search`
(Follow up PR)
- Fixes a bug with MaxLengthStoppingCriteria (we should stop as soon a
we hit the max_length, the comparison needs to be or equal, that affects
the tests).
- Added options to use `logits_processor` and `stopping_criteria`
directly within `generate` function (so some users can define their own
`logits_processor` and `stopping_criteria`).
- Modified the backward compat tests to make sure we issue a warning.

* Fix `max_length` argument in `generate`.

* Moving validate to being functional.

- Renamed `smax_length` to `stoppping_max_length`.

* Removing `logits_processor` and `stopping_criteria` from `generate`
arguments.

* Deepcopy.

* Fix global variable name.

aad95c7c

20 Apr, 2021 6 commits

Add an error message that fires when Reformer is not in training mode, but one... · 95dab34d
Yusuke Mori authored Apr 21, 2021
```
Add an error message that fires when Reformer is not in training mode, but one runs .backward() (#11117)
```
95dab34d
Update to use datasets remove_cloumns method (#11343) · f1b938fd
Sylvain Gugger authored Apr 20, 2021
```
* Update to use datasets remove_cloumns method

* Quality
```
f1b938fd
[GPTNeo] create local attention mask ones (#11335) · cfd2eaa8
Suraj Patil authored Apr 20, 2021
```
* create local attention mask ones

* remove old method, address patricks comment
```
cfd2eaa8
[Generate] Remove outdated code (#11331) · f464f10a
Patrick von Platen authored Apr 20, 2021
```
* remove update function

* update

* refactor more

* refactor
```
f464f10a

Added translation example script (#11196) · bfd83c17

rajvi-k authored Apr 20, 2021

* initial changes

* modified evaluation

* updated evaluation

* updated evaluation on text translation example script

* added translation example script

* Formatted translation example script

* Reformatted translation example

* Fixed evaluation bug and added support for other tokenisers

* Fixed evaluation bug and added support for other tokenisers

* Added translation example script

* Formatted summarization example script

* Removed typos from summarization example script

bfd83c17

Load checkpoint without re-creating the model (#11318) · c0328a6c
Sylvain Gugger authored Apr 19, 2021

c0328a6c

19 Apr, 2021 4 commits
- [Trainer] Add a progress bar for batches skipped (#11324) · 95037a16
  Sylvain Gugger authored Apr 19, 2021
  
  95037a16
- [Trainer] fix the placement on device with fp16_full_eval (#11322) · 95ffbe16
  Stas Bekman authored Apr 19, 2021
```
* fix the placement on device with fp16_full_eval

* deepspeed never goes on device
```
  95ffbe16
- modify double considering special tokens in `language_modeling.py` (#11275) · 3981ce3d
  TAE YOUNGDON authored Apr 20, 2021
```
* Update language_modeling.py

in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" 

so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do

* Update language_modeling.py
```
  3981ce3d
- move device statements outside if statements (#11292) · 5a34d8d9
  e authored Apr 19, 2021
  
  5a34d8d9