Commits · 8c9b5fcbaf27cbf1aa781670d598cf74c07b7e88 · chenpangpang / transformers

23 Apr, 2021 1 commit

[Flax] Big FlaxBert Refactor (#11364) · 8c9b5fcb

Patrick von Platen authored Apr 23, 2021

* improve flax

* refactor

* typos

* Update src/transformers/modeling_flax_utils.py

* Apply suggestions from code review

* Update src/transformers/modeling_flax_utils.py

* fix typo

* improve error tolerance

* typo

* correct nasty saving bug

* fix from pretrained

* correct tree map

* add note

* correct weight tying

8c9b5fcb

22 Apr, 2021 7 commits
- Fix Trainer with remove_unused_columns=False (#11382) · 3ed5e97b
  Sylvain Gugger authored Apr 22, 2021
```
* Fix Trainer with remove_unused_columns=False

* Typo
```
  3ed5e97b
- Fix typo (#11369) · 0f3ad150
  PenutChen authored Apr 22, 2021
  
  0f3ad150
- Correctly cast num_train_epochs to int (#11379) · 26173960
  Matt authored Apr 22, 2021
  
  26173960
- Add space (#11373) · 881945c0
  Takuya Makino authored Apr 22, 2021
  
  881945c0
- [run_translation.py] fix typo (#11372) · 5b5e4ca3
  johnson7788 authored Apr 22, 2021
```
fix typo
Co-authored-by: johnson <johnson@github.com>
```
  5b5e4ca3
- [Flax] Correct typo (#11374) · 58d8795d
  Patrick von Platen authored Apr 22, 2021
```
* finish

* fix copy
```
  58d8795d
- [Wav2Vec2] Fix special tokens for Wav2Vec2 tokenizer (#11349) · 880154d2
  Patrick von Platen authored Apr 22, 2021
```
* fix wav2vec2 tok

* up
```
  880154d2
21 Apr, 2021 13 commits

Add in torchhub · 6f14eab5
Sylvain Gugger authored Apr 21, 2021

6f14eab5
Add huggingface_hub dep for #11328 · ff26f8ee
Sylvain Gugger authored Apr 21, 2021

ff26f8ee

Fix token_type_ids error for big_bird model. (#11355) · 5e04d708

wlhgtc authored Apr 22, 2021



* MOD: fit chinese wwm to new datasets

* MOD: move wwm to new folder

* MOD: formate code

* Styling

* MOD add param and recover trainer

* MOD: add token_type_ids method for big bird

* MOD: format code

* MOD: format code
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

5e04d708

[contributing doc] explain/link to good first issue (#11346) · 5aaf5aac

Stas Bekman authored Apr 21, 2021



* explain/link to good first issue

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

5aaf5aac

Move old TF text classification script to legacy (#11361) · 6fe79e57
Matt authored Apr 21, 2021
```
And update README to explain the work-in-progress!
```
6fe79e57
Remove boiler plate code (#11340) · 50595a33
Patrick von Platen authored Apr 21, 2021
```
* remove boiler plate code

* adapt roberta

* correct docs

* finish refactor
```
50595a33
Merge new TF example script (#11360) · ac588594
Matt authored Apr 21, 2021
```
First of the new and more idiomatic TF examples!
```
ac588594
[testing doc] bring doc up to date (#11359) · 9f72e8f4
Stas Bekman authored Apr 21, 2021
```
* bring doc up to date

* fix
```
9f72e8f4

Extract metric_key_prefix during NotebookProgressCallback.on_evaluate (#11347) · 41f3133a

lewtun authored Apr 21, 2021

* Pass metric_key_prefix as kwarg to on_evaluate

* Replace eval_loss with metric_key_prefix_loss

* Default to "eval" if metric_key_prefix not in kwargs

* Add kwargs to CallbackHandler.on_evaluate signature

* Revert "Add kwargs to CallbackHandler.on_evaluate signature"

This reverts commit 8d4c85ed512f558f7579d36771e907b3379947b7.

* Revert "Pass metric_key_prefix as kwarg to on_evaluate"

This reverts commit 7766bfe2718601230ae593d37b1317bd53cfc075.

* Extract metric_key_prefix from metrics

41f3133a

Examples reorg (#11350) · dabeb152

Sylvain Gugger authored Apr 21, 2021



* Base move

* Examples reorganization

* Update references

* Put back test data

* Move conftest

* More fixes

* Move test data to test fixtures

* Update path

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments and clean
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

dabeb152

[deepspeed] fix resume from checkpoint (#11352) · ca7ff64f

Stas Bekman authored Apr 21, 2021

This PR fixes a bug that most likely somehow got exposed (not caused) by https://github.com/huggingface/transformers/pull/11318 - surprisingly the same test worked just fine before that other PR.

ca7ff64f

Honor contributors to models (#11329) · 74712e22
Sylvain Gugger authored Apr 21, 2021
```
* Honor contributors to models

* Fix typo

* Address review comments

* Add more authors
```
74712e22

Removed `max_length` from being mandatory within `generate`. (#11314) · aad95c7c

Nicolas Patry authored Apr 21, 2021

* Removed `max_length` from being mandatory within `generate`.

- Moving on to fully using `StoppingCriteria` for `greedy` and `sample`
modes.
- `max_length` still used for `beam_search` and `group_beam_search`
(Follow up PR)
- Fixes a bug with MaxLengthStoppingCriteria (we should stop as soon a
we hit the max_length, the comparison needs to be or equal, that affects
the tests).
- Added options to use `logits_processor` and `stopping_criteria`
directly within `generate` function (so some users can define their own
`logits_processor` and `stopping_criteria`).
- Modified the backward compat tests to make sure we issue a warning.

* Fix `max_length` argument in `generate`.

* Moving validate to being functional.

- Renamed `smax_length` to `stoppping_max_length`.

* Removing `logits_processor` and `stopping_criteria` from `generate`
arguments.

* Deepcopy.

* Fix global variable name.

aad95c7c

20 Apr, 2021 6 commits

Add an error message that fires when Reformer is not in training mode, but one... · 95dab34d
Yusuke Mori authored Apr 21, 2021
```
Add an error message that fires when Reformer is not in training mode, but one runs .backward() (#11117)
```
95dab34d
Update to use datasets remove_cloumns method (#11343) · f1b938fd
Sylvain Gugger authored Apr 20, 2021
```
* Update to use datasets remove_cloumns method

* Quality
```
f1b938fd
[GPTNeo] create local attention mask ones (#11335) · cfd2eaa8
Suraj Patil authored Apr 20, 2021
```
* create local attention mask ones

* remove old method, address patricks comment
```
cfd2eaa8
[Generate] Remove outdated code (#11331) · f464f10a
Patrick von Platen authored Apr 20, 2021
```
* remove update function

* update

* refactor more

* refactor
```
f464f10a

Added translation example script (#11196) · bfd83c17

rajvi-k authored Apr 20, 2021

* initial changes

* modified evaluation

* updated evaluation

* updated evaluation on text translation example script

* added translation example script

* Formatted translation example script

* Reformatted translation example

* Fixed evaluation bug and added support for other tokenisers

* Fixed evaluation bug and added support for other tokenisers

* Added translation example script

* Formatted summarization example script

* Removed typos from summarization example script

bfd83c17

Load checkpoint without re-creating the model (#11318) · c0328a6c
Sylvain Gugger authored Apr 19, 2021

c0328a6c

19 Apr, 2021 4 commits
- [Trainer] Add a progress bar for batches skipped (#11324) · 95037a16
  Sylvain Gugger authored Apr 19, 2021
  
  95037a16
- [Trainer] fix the placement on device with fp16_full_eval (#11322) · 95ffbe16
  Stas Bekman authored Apr 19, 2021
```
* fix the placement on device with fp16_full_eval

* deepspeed never goes on device
```
  95ffbe16
- modify double considering special tokens in `language_modeling.py` (#11275) · 3981ce3d
  TAE YOUNGDON authored Apr 20, 2021
```
* Update language_modeling.py

in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" 

so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do

* Update language_modeling.py
```
  3981ce3d
- move device statements outside if statements (#11292) · 5a34d8d9
  e authored Apr 19, 2021
  
  5a34d8d9
16 Apr, 2021 5 commits

Trainer support for IterableDataset for evaluation and predict (#11286) · d9c62047

Sylvain Gugger authored Apr 16, 2021

* Bulk of the work

* Polish and tests

* Update QA Trainer

* Avoid breaking the predict method

* Deprecation warnings

* Store real eval dataloder

* Get eval dataset reference before wrap

d9c62047

Fix failing workflows · e783ea73
Lysandre authored Apr 16, 2021

e783ea73

Enabling multilingual models for translation pipelines. (#10536) · 92970c0c

Nicolas Patry authored Apr 16, 2021



* [WIP] Enabling multilingual models for translation pipelines.

* decoder_input_ids -> forced_bos_token_id

* Improve docstring.

* Rebase

* Fixing 2 bugs

- Type token_ids coming from `_parse_and_tokenize`
- Wrong index from tgt_lang.

* Fixing black version.

* Adding tests for _build_translation_inputs and add them for all
tokenizers.

* Mbart actually puts the lang code at the end.

* Fixing m2m100.

* Adding TF support to `deep_round`.

* Update src/transformers/pipelines/text2text_generation.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Adding one line comment.

* Fixing M2M100 `_build_translation_input_ids`, and fix the call site.

* Fixing tests + deep_round -> nested_simplify
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

92970c0c

Workflow fixes (#11270) · 5254220e
Lysandre Debut authored Apr 15, 2021

5254220e
update dependency_versions_table (#11273) · dfc6dd85
Stas Bekman authored Apr 15, 2021
```
missed this updating when bumped the version.
```
dfc6dd85

15 Apr, 2021 3 commits
- Tokenizer fast save (#11234) · 2550b41a
  Sylvain Gugger authored Apr 15, 2021
```
* Save fast tokenizers in both formats

* Fix for HerBERT

* Proper fix

* Properly test new behavior
```
  2550b41a
- Support for set_epoch (#11258) · 6e1ee47b
  Sylvain Gugger authored Apr 15, 2021
  
  6e1ee47b
- Adding pipeline task aliases. (#11247) · c3fcba32
  Nicolas Patry authored Apr 15, 2021
```
* Adding task aliases and adding `token-classification` and
`text-classification` tasks.

* Cleaning docstring.
```
  c3fcba32
14 Apr, 2021 1 commit

Trainer iterable dataset (#11254) · aaaed56f

Sylvain Gugger authored Apr 14, 2021



* IterableDatasetShard

* Test and integration in Trainer

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

aaaed56f