Commits · f1b938fda81d4b9e8ab435cb7f37f71c9b7cbb1e · chenpangpang / transformers

20 Apr, 2021 5 commits

Update to use datasets remove_cloumns method (#11343) · f1b938fd
Sylvain Gugger authored Apr 20, 2021
```
* Update to use datasets remove_cloumns method

* Quality
```
f1b938fd
[GPTNeo] create local attention mask ones (#11335) · cfd2eaa8
Suraj Patil authored Apr 20, 2021
```
* create local attention mask ones

* remove old method, address patricks comment
```
cfd2eaa8
[Generate] Remove outdated code (#11331) · f464f10a
Patrick von Platen authored Apr 20, 2021
```
* remove update function

* update

* refactor more

* refactor
```
f464f10a

Added translation example script (#11196) · bfd83c17

rajvi-k authored Apr 20, 2021

* initial changes

* modified evaluation

* updated evaluation

* updated evaluation on text translation example script

* added translation example script

* Formatted translation example script

* Reformatted translation example

* Fixed evaluation bug and added support for other tokenisers

* Fixed evaluation bug and added support for other tokenisers

* Added translation example script

* Formatted summarization example script

* Removed typos from summarization example script

bfd83c17

Load checkpoint without re-creating the model (#11318) · c0328a6c
Sylvain Gugger authored Apr 19, 2021

c0328a6c

19 Apr, 2021 4 commits
- [Trainer] Add a progress bar for batches skipped (#11324) · 95037a16
  Sylvain Gugger authored Apr 19, 2021
  
  95037a16
- [Trainer] fix the placement on device with fp16_full_eval (#11322) · 95ffbe16
  Stas Bekman authored Apr 19, 2021
```
* fix the placement on device with fp16_full_eval

* deepspeed never goes on device
```
  95ffbe16
- modify double considering special tokens in `language_modeling.py` (#11275) · 3981ce3d
  TAE YOUNGDON authored Apr 20, 2021
```
* Update language_modeling.py

in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" 

so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do

* Update language_modeling.py
```
  3981ce3d
- move device statements outside if statements (#11292) · 5a34d8d9
  e authored Apr 19, 2021
  
  5a34d8d9
16 Apr, 2021 5 commits

Trainer support for IterableDataset for evaluation and predict (#11286) · d9c62047

Sylvain Gugger authored Apr 16, 2021

* Bulk of the work

* Polish and tests

* Update QA Trainer

* Avoid breaking the predict method

* Deprecation warnings

* Store real eval dataloder

* Get eval dataset reference before wrap

d9c62047

Fix failing workflows · e783ea73
Lysandre authored Apr 16, 2021

e783ea73

Enabling multilingual models for translation pipelines. (#10536) · 92970c0c

Nicolas Patry authored Apr 16, 2021



* [WIP] Enabling multilingual models for translation pipelines.

* decoder_input_ids -> forced_bos_token_id

* Improve docstring.

* Rebase

* Fixing 2 bugs

- Type token_ids coming from `_parse_and_tokenize`
- Wrong index from tgt_lang.

* Fixing black version.

* Adding tests for _build_translation_inputs and add them for all
tokenizers.

* Mbart actually puts the lang code at the end.

* Fixing m2m100.

* Adding TF support to `deep_round`.

* Update src/transformers/pipelines/text2text_generation.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Adding one line comment.

* Fixing M2M100 `_build_translation_input_ids`, and fix the call site.

* Fixing tests + deep_round -> nested_simplify
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

92970c0c

Workflow fixes (#11270) · 5254220e
Lysandre Debut authored Apr 15, 2021

5254220e
update dependency_versions_table (#11273) · dfc6dd85
Stas Bekman authored Apr 15, 2021
```
missed this updating when bumped the version.
```
dfc6dd85

15 Apr, 2021 3 commits
- Tokenizer fast save (#11234) · 2550b41a
  Sylvain Gugger authored Apr 15, 2021
```
* Save fast tokenizers in both formats

* Fix for HerBERT

* Proper fix

* Properly test new behavior
```
  2550b41a
- Support for set_epoch (#11258) · 6e1ee47b
  Sylvain Gugger authored Apr 15, 2021
  
  6e1ee47b
- Adding pipeline task aliases. (#11247) · c3fcba32
  Nicolas Patry authored Apr 15, 2021
```
* Adding task aliases and adding `token-classification` and
`text-classification` tasks.

* Cleaning docstring.
```
  c3fcba32
14 Apr, 2021 10 commits

Trainer iterable dataset (#11254) · aaaed56f

Sylvain Gugger authored Apr 14, 2021



* IterableDatasetShard

* Test and integration in Trainer

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

aaaed56f

[deepspeed] test on one node 2 gpus max (#11237) · 83206ca6

Stas Bekman authored Apr 14, 2021

* test on one node 2 gpus max

* fix the other place

* refactor

* fix

* cleanup

* more exact version

83206ca6

Fix #10128 (#11248) · 25e1af36
Sylvain Gugger authored Apr 14, 2021

25e1af36

[troubleshooting] add 2 points of reference to the offline mode (#11236) · 63ca4023

Stas Bekman authored Apr 14, 2021



* add 2 points of reference to the offline mode

* link the new doc

* add error message

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* rename

* Trigger CI
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

63ca4023

Add prefix to examples in model_doc rst (#11226) · 075e821d

Yusuke Mori authored Apr 14, 2021



* Add prefix to examples in model_doc rst

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

075e821d

Fix dimention misspellings. (#11238) · 4670b57c

Thomas Wood authored Apr 14, 2021

* Update modeling_gpt_neo.py

dimention -> dimension

* Update configuration_speech_to_text.py

dimention -> dimension

4670b57c

Close open files to suppress ResourceWarning (#11240) · f25444cb
Sudharsan S T authored Apr 14, 2021
```
Co-authored-by: Sudharsan Thirumalai <sudharsan.t@sprinklr.com>
```
f25444cb

Stale bot updated (#10562) · 7fe5aaa8

Lysandre Debut authored Apr 14, 2021

* Updated stale bot

* Specify issue number

* Remove particular handling of assignees

* Unleash the stalebot

* Remove debug branch

7fe5aaa8

make embeddings plural in warning message (#11228) · 9337c6c6
Joel Stremmel authored Apr 14, 2021

9337c6c6
Save the Wav2Vec2 processor before training starts (#10910) · 653076ca
Nithin Holla authored Apr 14, 2021
```
Co-authored-by: nithin19 <nithin@amberscript.com>
```
653076ca

13 Apr, 2021 13 commits

[Deepspeed] zero3 tests band aid (#11235) · 3d339ee6
Stas Bekman authored Apr 13, 2021
```
* temp band-aid

* style
```
3d339ee6

Run CI on deepspeed and fairscale (#11172) · 1ad7b039

Lysandre Debut authored Apr 13, 2021

* Run CI on deepspeed and fairscale

* Test it on this branch :)

* Rename

* Update the CI image

1ad7b039

Indent code block in the documentation (#11233) · f38cd437
Sylvain Gugger authored Apr 13, 2021
```
* Indent code block

* Indent code blocks version 2

* Quality
```
f38cd437
Avoid using no_sync on SageMaker DP (#11229) · 9d8e8a87
Sylvain Gugger authored Apr 13, 2021

9d8e8a87
added cache_dir=model_args.cache_dir to all example with cache_dir arg (#11220) · 9fa29959
Philipp Schmid authored Apr 13, 2021

9fa29959
Doc check: a bit of clean up (#11224) · 3312e96b
Sylvain Gugger authored Apr 13, 2021

3312e96b

Refactor GPT2 (#11225) · edca520d

Suraj Patil authored Apr 13, 2021



* refactor GPT2

* fix mlp and head pruning

* address Sylvains comments

* apply suggestion from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

edca520d

Document v4.5.1 · 893e51a5
Sylvain Gugger authored Apr 13, 2021

893e51a5
Replace error by warning when loading an architecture in another (#11207) · 81009b7a
Sylvain Gugger authored Apr 13, 2021
```
* Replace error by warning when loading an architecture in another

* Style

* Style again

* Add a test

* Adapt old test
```
81009b7a

Add documentation for BertJapanese (#11219) · 22fa0a60

Yusuke Mori authored Apr 13, 2021



* Start writing BERT-Japanese doc

* Fix typo, Update toctree

* Modify model file to use comment for document, Add examples

* Clean bert_japanese by make style

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Split a big code block into two

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add prefix >>> to all lines in code blocks

* Clean bert_japanese by make fixup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

22fa0a60

fix docstrings (#11221) · 896d7be9
Suraj Patil authored Apr 13, 2021

896d7be9

Fix GPT-2 warnings (#11213) · 823df939

Lysandre Debut authored Apr 13, 2021



* Fix GPT-2 warnings

* Update src/transformers/models/gpt2/modeling_gpt2.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

823df939

Add Matt as the TensorFlow reference (#11212) · 0cd89d8c
Lysandre Debut authored Apr 13, 2021

0cd89d8c