Commits · 917f1045025001bc7dfefc992bd101e02fe4bada · chenpangpang / transformers

08 Mar, 2021 12 commits
- [examples tests] various fixes (#10584) · 917f1045
  Stas Bekman authored Mar 08, 2021
```
* fix sharded ddp enum

* test fixes

* stronger validation + apex breaks other tests
```
  917f1045
- offline mode for firewalled envs (part 2) (#10569) · 6f84531e
  Stas Bekman authored Mar 08, 2021
```
* more readable test

* add all the missing places

* one more nltk

* better exception check

* revert
```
  6f84531e
- Fix version control with anchors (#10595) · 54693694
  Sylvain Gugger authored Mar 08, 2021
```
* Fix version control with anchors

* Simplify
```
  54693694
- fix double wrapping + test (#10583) · f8829660
  Stas Bekman authored Mar 08, 2021
  
  f8829660
- tokenization_marian.py: use current_spm for decoding (#10357) · b8805084
  Mehrad Moradshahi authored Mar 08, 2021
```
* Fix Marian decoding

Tokenizer's decode and batch_decode now accepts a new argument (use_source_tokenizer) which indicates whether the source spm should be used to decode ids. This is useful for Marian models specificallly when decoding source input ids.

* Adapt docstrings
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
```
  b8805084
- Correct YAML · 8fd7eb34
  Lysandre authored Mar 08, 2021
  
  8fd7eb34
- Enable torch 1.8.0 on GPU CI (#10593) · 89b8d4f5
  Lysandre Debut authored Mar 08, 2021
```
* Enable torch 1.8.0 in GPU CI

* Disable torch-scatter
```
  89b8d4f5
- [M2M100] fix positional embeddings (#10590) · 2a737bff
  Suraj Patil authored Mar 08, 2021
```
* fix tests

* emb should be a parameter

* fix positional embeddings

* fix make_weights

* don't save pos embeds

* add comment to describe the clamping
```
  2a737bff
- fix BART Summarization example in doc (#10582) · d59464db
  Oren Amsalem authored Mar 08, 2021
  
  d59464db
- Fix typo in docstring for pipeline (#10591) · 3b583d02
  Eunhyuk Shin authored Mar 08, 2021
  
  3b583d02
- fix nltk lookup (#10585) · e6ce636e
  Stas Bekman authored Mar 07, 2021
  
  e6ce636e
- fix tf doc bug (#10570) · 9dd054fb
  Yu authored Mar 08, 2021
  
  9dd054fb
06 Mar, 2021 3 commits

Suraj Patil authored Mar 06, 2021

* m2m_100

* no layernorm_embedding

* sinusoidal positional embeddings

* update pos embeddings

* add default config values

* tokenizer

* add conversion script

* fix config

* fix pos embed

* remove _float_tensor

* update tokenizer

* update lang codes

* handle lang codes

* fix pos embeds

* fix spm key

* put embedding weights on device

* remove qa and seq classification heads

* fix convert script

* lang codes pn one line

* fix embeds

* fix tokenizer

* fix tokenizer

* add fast tokenizer

* style

* M2M100MT => M2M100

* fix copyright, style

* tokenizer converter

* vocab file

* remove fast tokenizer

* fix embeds

* fix tokenizer

* fix tests

* add tokenizer tests

* add integration test

* quality

* fix model name

* fix test

* doc

* doc

* fix doc

* add copied from statements

* fix tokenizer tests

* apply review suggestions

* fix urls

* fix shift_tokens_right

* apply review suggestions

* fix

* fix doc

* add lang code to id

* remove unused function

* update checkpoint names

* fix copy

* fix tokenizer

* fix checkpoint names

* fix merge issue

* style

f6e74a63

Temporarily disable stale bot · fd011044
Lysandre authored Mar 06, 2021

fd011044

offline mode for firewalled envs (#10407) · 88a951e3

Stas Bekman authored Mar 05, 2021



* offline mode start

* add specific values

* fix fallback

* add test

* better values check and range

* test that actually works

* document the offline mode

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* more strict check

* cleaner test

* pt-only test

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

88a951e3

05 Mar, 2021 11 commits

Refactoring checkpoint names for multiple models (#10527) · 90ecc296

Daniel Hug authored Mar 05, 2021

* Refactor checkpoint name in ALBERT and ALBERT_tf

* Refactor checkpoint name in BART and BART_tf

* Refactor checkpoint name in BERT generation

* Refactor checkpoint name in Blenderbot_tf

* Refactor checkpoint name in Blenderbot_small_tf

* Refactor checkpoint name in ConvBERT AND CONVBERT_TF

* Refactor checkpoint name in CTRL AND CTRL_TF

* Refactor checkpoint name in DistilBERT AND DistilBERT_TF

* Refactor checkpoint name in DistilBERT redo

* Refactor checkpoint name in Electra and Electra_tf

* Refactor checkpoint name in FlauBERT and FlauBERT_tf

* Refactor checkpoint name in FSMT

* Refactor checkpoint name in GPT2 and GPT2_tf

* Refactor checkpoint name in IBERT

* Refactor checkpoint name in LED and LED_tf

* Refactor checkpoint name in Longformer and Longformer_tf

* Refactor checkpoint name in Lxmert and Lxmert_tf

* Refactor checkpoint name in Marian_tf

* Refactor checkpoint name in MBART and MBART_tf

* Refactor checkpoint name in MobileBERT and MobileBERT_tf

* Refactor checkpoint name in mpnet and mpnet_tf

* Refactor checkpoint name in openai and openai_tf

* Refactor checkpoint name in pegasus_tf

* Refactor checkpoint name in reformer

* Refactor checkpoint name in Roberta and Roberta_tf

* Refactor checkpoint name in SqueezeBert

* Refactor checkpoint name in Transformer_xl and Transformer_xl_tf

* Refactor checkpoint name in XLM and XLM_tf

* Refactor checkpoint name in XLNET and XLNET_tf

* Refactor checkpoint name in BERT_tf

* run make tests, style, quality, fixup

90ecc296

Stale Bot (#10509) · defe9e20

Lysandre Debut authored Mar 05, 2021

* Add stale bot to Github Actions

* Update message

* Message for assignee

* Update scripts/stale.py

* Uncomment & stop testing

defe9e20

Fix embeddings for PyTorch 1.8 (#10549) · 7da995c0

Sylvain Gugger authored Mar 05, 2021

* Fix embeddings for PyTorch 1.8

* Try with PyTorch 1.8.0

* Fix embeddings init

* Fix copies

* Typo

* More typos

7da995c0

Typo correction. (#10531) · 3e056c10

Chen Liang authored Mar 05, 2021

DEBERTA_PRETRAINED_MODEL_ARCHIVE_LIST => DEBERTA_V2_PRETRAINED_MODEL_ARCHIVE_LIST in line 31.

3e056c10

fixed dead link in trainer doc (#10554) · 9f8bc87c
Joakim Warholm authored Mar 05, 2021

9f8bc87c
Fix torch 1.8.0 segmentation fault (#10546) · 6b58e155
Lysandre Debut authored Mar 05, 2021
```
* Only run one test

* Patch segfault

* Fix summarization pipeline

* Ready for merge
```
6b58e155
fix run seq2seq (#10547) · 395ffcd7
Patrick von Platen authored Mar 05, 2021

395ffcd7
Fixing conversation test for torch 1.8 (#10545) · 54e55b52
Nicolas Patry authored Mar 05, 2021

54e55b52
Pin torch to 1.7.1 in tests while we resolve issues · dc9aaa38
Lysandre authored Mar 05, 2021

dc9aaa38
Fix example of custom Trainer to reflect signature of compute_loss (#10537) · 12b66215
lewtun authored Mar 05, 2021

12b66215
Update scatter to use torch 1.8.0 · 093b88f4
Lysandre authored Mar 05, 2021

093b88f4

04 Mar, 2021 6 commits
- [ProphetNet] Bart-like Refactor (#10501) · c503a1c1
  Patrick von Platen authored Mar 04, 2021
```
* first step to refactor

* make all fast tests pass

* make all slow tests pass

* save intermediate

* correct cache

* finish PR

* make fp16 work
```
  c503a1c1
- Rework TPU checkpointing in Trainer (#10504) · 6290169e
  Sylvain Gugger authored Mar 04, 2021
```
* Rework TPU checkpointing in Trainer

* Wraps the barrier in a dist test

* Address review comments

* Remove line
```
  6290169e
- Removes overwrites for output_dir (#10521) · 805c5200
  Philipp Schmid authored Mar 04, 2021
```
* removed overwrites

* remove default value for output_dir

* adjusted typing
```
  805c5200
- Not always consider a local model a checkpoint in run_glue (#10517) · a5bd40b7
  Sylvain Gugger authored Mar 04, 2021
  
  a5bd40b7
- Revert "Not always consider a local model a checkpoint in run_glue" · 745ea78d
  Sylvain Gugger authored Mar 04, 2021
```
This reverts commit f3660613.
```
  745ea78d
- Not always consider a local model a checkpoint in run_glue · f3660613
  Sylvain Gugger authored Mar 04, 2021
  
  f3660613
03 Mar, 2021 8 commits
- Remove unsupported methods from ModelOutput doc (#10505) · 948b730f
  Sylvain Gugger authored Mar 03, 2021
  
  948b730f
- Smp grad accum (#10488) · b70f441b
  Sylvain Gugger authored Mar 03, 2021
```
* Fix gradient accumulation for SM Model Parallelism

* Style and divide loss by grad accum steps
```
  b70f441b
- Fix the bug in constructing the all_hidden_states of DeBERTa v2 (#10466) · d064fb56
  felixgwu authored Mar 03, 2021
```
* fix all_hidden_states

* use output_states instead of next_kv
```
  d064fb56
- remap MODEL_FOR_QUESTION_ANSWERING_MAPPING classes to names auto-generated file (#10487) · 188574ac
  Stas Bekman authored Mar 03, 2021
```
* remap classes to strings

* missing new util

* style

* doc

* move the autogenerated file

* Trigger CI
```
  188574ac
- Refactor checkpoint name in BERT and MobileBERT (#10424) · 801ff969
  Sylvain Gugger authored Mar 03, 2021
```
* Refactor checkpoint name in BERT and MobileBERT

* Add option to check copies

* Add QuestionAnswering

* Add last models

* Make black happy
```
  801ff969
- feat(docs): navigate with left/right arrow keys (#10481) · 39f70a40
  Jeff Yang authored Mar 03, 2021
```
* feat(docs): navigate with left/right arrow keys

* fix: add missing comma
```
  39f70a40
- [T5] Fix speed degradation bug t5 (#10496) · 2d2ed2cc
  Patrick von Platen authored Mar 03, 2021
```
* fix speed degradation bug t5

* fix for all models

* fix code quality
```
  2d2ed2cc
- Fixed minor spelling mistakes (#10489) · 5dc303e2
  WybeKoper authored Mar 03, 2021
```
Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>
```
  5dc303e2