Commits · edca520d0fd8f23e6a5dbf98c209f8da0e3e293c · chenpangpang / transformers

13 Apr, 2021 9 commits

Suraj Patil authored Apr 13, 2021



* refactor GPT2

* fix mlp and head pruning

* address Sylvains comments

* apply suggestion from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

edca520d

Document v4.5.1 · 893e51a5
Sylvain Gugger authored Apr 13, 2021

893e51a5
Replace error by warning when loading an architecture in another (#11207) · 81009b7a
Sylvain Gugger authored Apr 13, 2021
```
* Replace error by warning when loading an architecture in another

* Style

* Style again

* Add a test

* Adapt old test
```
81009b7a

Add documentation for BertJapanese (#11219) · 22fa0a60

Yusuke Mori authored Apr 13, 2021



* Start writing BERT-Japanese doc

* Fix typo, Update toctree

* Modify model file to use comment for document, Add examples

* Clean bert_japanese by make style

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Split a big code block into two

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add prefix >>> to all lines in code blocks

* Clean bert_japanese by make fixup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

22fa0a60

fix docstrings (#11221) · 896d7be9
Suraj Patil authored Apr 13, 2021

896d7be9

Fix GPT-2 warnings (#11213) · 823df939

Lysandre Debut authored Apr 13, 2021



* Fix GPT-2 warnings

* Update src/transformers/models/gpt2/modeling_gpt2.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

823df939

Add Matt as the TensorFlow reference (#11212) · 0cd89d8c
Lysandre Debut authored Apr 13, 2021

0cd89d8c

wav2vec2 converter: create the proper vocab.json while converting fairseq... · 7c205bf4

Ceyda Cinarel authored Apr 13, 2021

wav2vec2 converter: create the proper vocab.json while converting fairseq wav2vec2 finetuned model (#11041)

* add vocab while converting wav2vec2 original finetuned model

* check save directory exists

* return_attention_mask fix

* quality

7c205bf4

Use MSELoss in (M)BartForSequenceClassification (#11178) · d49d3cf6
calpt authored Apr 13, 2021

d49d3cf6

12 Apr, 2021 9 commits

Sagemaker test docs update for framework upgrade (#11206) · f243a5ec
Philipp Schmid authored Apr 13, 2021
```
* increased train_runtime for model parallelism

* added documentation for framework upgrade
```
f243a5ec
Import torch.utils.checkpoint in ProphetNet (#11214) · 74d7c24d
Lysandre Debut authored Apr 12, 2021

74d7c24d
Replaced `which` with `who` (#11183) · 38a10c6b
cronoik authored Apr 13, 2021

38a10c6b

Add DeiT (PyTorch) (#11056) · 9f126097

NielsRogge authored Apr 13, 2021

* First draft of deit

* More improvements

* Remove DeiTTokenizerFast from init

* Conversion script works

* Add DeiT to ViT conversion script

* Add tests, add head model, add support for deit in vit conversion script

* Update model checkpoint names

* Update image_mean and image_std, set resample to bicubic

* Improve docs

* Docs improvements

* Add DeiTForImageClassificationWithTeacher to init

* Address comments by @sgugger

* Improve feature extractors

* Make fix-copies

* Minor fixes

* Address comments by @patil-suraj

* All models uploaded

* Fix tests

* Remove labels argument from DeiTForImageClassificationWithTeacher

* Fix-copies, style and quality

* Fix tests

* Fix typo

* Multiple docs improvements

* More docs fixes

9f126097

Fix typo (#11188) · cb251ba6
Takuya Makino authored Apr 13, 2021

cb251ba6

Added documentation for data collator. (#10941) · 0c6fcd30

fghuman authored Apr 12, 2021



* Added documentation for data collator.

* Update docs/source/data_collator.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Added documentation for data collator.

* Added documentation for the data collator.

* Merge branch 'doc_DataCollator' of C:\Users\mahii\PycharmProjects\transformers with conflicts.

* Update documentation for the data collator.

* Update documentation for the data collator.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Amna <A.A.Ahmad@student.tudelft.nl>

0c6fcd30

model_path should be ignored as the checkpoint path (#11157) · ef102c48

Masatoshi TSUCHIYA authored Apr 12, 2021

* model_path is refered as the path of the trainer, and should be ignored as the checkpoint path.

* Improved according to Sgugger's comment.

ef102c48

Fix style · 623cd6ae
Sylvain Gugger authored Apr 12, 2021

623cd6ae
Minor typos fixed (#11182) · a99f7f5c
cronoik authored Apr 12, 2021

a99f7f5c

09 Apr, 2021 13 commits
- Reactivate Megatron tests an use less workers · 26212c14
  Sylvain Gugger authored Apr 09, 2021
  
  26212c14
- Fix Typo · 716120cb
  Lysandre authored Apr 09, 2021
  
  716120cb
- added json dump and extraction of train run time (#11167) · 6f90c29e
  Philipp Schmid authored Apr 09, 2021
```
* added json dump and extraction of train run time

* make style happy
```
  6f90c29e
- [examples run_clm] fix _LazyModule hasher error (#11168) · 07f0bb69
  Stas Bekman authored Apr 09, 2021
```
* fix _LazyModule hasher error

* reword
```
  07f0bb69
- [examples/translation] support mBART-50 and M2M100 fine-tuning (#11170) · c161dd56
  Suraj Patil authored Apr 09, 2021
```
* keep a list of multilingual tokenizers

* add forced_bos_token argument
```
  c161dd56
- Add a special tokenizer for CPM model (#11068) · fb41f9f5
  Kevin Canwen Xu authored Apr 10, 2021
```
* Add a special tokenizer for CPM model

* make style

* fix

* Add docs

* styles

* cpm doc

* fix ci

* fix the overview

* add test

* make style

* typo

* Custom tokenizer flag

* Add REAMDE.md
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  fb41f9f5
- Make `get_special_tokens_mask` consider all tokens (#11163) · 45fc8c79
  Sylvain Gugger authored Apr 09, 2021
  
  45fc8c79
- Update README.md (#11161) · 60607465
  Saviour Owolabi authored Apr 09, 2021
```
Corrected a typo ('Downlowd' to 'Download')
```
  60607465
- Fix LogitsProcessor documentation (#11130) · b9b60c16
  Keisuke Hirota authored Apr 09, 2021
```
* Change duplicated LogitsProcessor to LogitsWarper in LogitsProcessorList document

* Write more detailed information about LogitsProcessor's scores argument

* apply suggestion from review

* style
Co-authored-by: Suraj Patil <surajp815@gmail.com>
```
  b9b60c16
- [Community notebooks] Add Wav2Vec notebook for creating captions for YT Clips (#11142) · 8b78a32b
  Niklas Muennighoff authored Apr 09, 2021
```
* Add Wav2Vec Inference notebook

* Update docs/source/community.md
Co-authored-by: Suraj Patil <surajp815@gmail.com>
```
  8b78a32b
- typo (#11152) · 0311ba21
  Stas Bekman authored Apr 08, 2021
```
* typo

* style
```
  0311ba21
- Merge branch 'master' of github.com:huggingface/transformers · 269c9638
  Sylvain Gugger authored Apr 08, 2021
  
  269c9638
- Skip Megatron tests for now · d31c7b10
  Sylvain Gugger authored Apr 08, 2021
  
  d31c7b10
08 Apr, 2021 9 commits
- [setup] make fairscale and deepspeed setup extras (#11151) · c2e0fd52
  Stas Bekman authored Apr 08, 2021
```
* make fairscale and deepspeed setup extras

* fix default

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* no reason not to ask for the good version

* update the CIs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  c2e0fd52
- Add support for multiple models for one config in auto classes (#11150) · ba8b1f47
  Sylvain Gugger authored Apr 08, 2021
```
* Add support for multiple models for one config in auto classes

* Use get_values everywhere

* Prettier doc
```
  ba8b1f47
- [setup] extras[docs] must include 'all' (#11148) · 97ccf67b
  Stas Bekman authored Apr 08, 2021
```
* extras[doc] must include 'all'

* fix

* better

* regroup
```
  97ccf67b
- [tests] relocate core integration tests (#11146) · 66446909
  Stas Bekman authored Apr 08, 2021
```
* relocate core integration tests

* add sys.path context manager

* cleanup

* try

* try2

* fix path

* doc

* style

* add dep

* add 2 more deps
```
  66446909
- Run mlm pad to multiple for fp16 (#11128) · 6c40e497
  Andrea Cappelli authored Apr 08, 2021
```
* Add mlm collator pad to multiple option (#10627)

* Use padding to 8x in run mlm (#10627)
```
  6c40e497
- Don't duplicate logs in TensorBoard and handle --use_env (#11141) · dfed4ec2
  Sylvain Gugger authored Apr 08, 2021
  
  dfed4ec2
- Updates SageMaker docs for updating DLCs (#11140) · 9c9b8e70
  Philipp Schmid authored Apr 08, 2021
  
  9c9b8e70
- Add fairscale and deepspeed back to the CI (#11147) · ba2cf5f9
  Lysandre Debut authored Apr 08, 2021
```
* Add fairscale and deepspeed back to the CI

* Add deepspeed to single GPU tests
```
  ba2cf5f9
- [trainer] solve "scheduler before optimizer step" warning (#11144) · 1ed24afe
  Stas Bekman authored Apr 08, 2021
```
* solve "scheduler before optimizer step" warning

* style

* correct the state evaluation test
```
  1ed24afe