Commits · c1f209dadd3ec595de10f8a3560b29e0225d21ab · chenpangpang / transformers

11 Mar, 2022 18 commits

[ZeRO] Fixes issue with embedding resize (#16093) · c1f209da

Jeff Rasley authored Mar 11, 2022



* gather z3 params for new_lm_head

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

c1f209da

Audio/vision task guides (#15808) · ae2dd42b

Steven Liu authored Mar 11, 2022

* 📝 first draft of audio/vision guides

* ✨ make fixup

* 🖍 fix typo

* 🖍 close parentheses

* 🖍 apply feedback

* 🖍 apply feedback, make fixup

* 🖍 more fixup for perceiver

* 🖍 apply feedback

* ✨ make fixup

* 🖍 fix data collator

ae2dd42b

[Fix doc example] FSMT (#16085) · cb5e50c8
Yih-Dar authored Mar 11, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
cb5e50c8
Add missing type hints for all flavors of RoBERTa PyTorch models. (#16086) · eaed6897
Thomas Chaigneau authored Mar 11, 2022
```
* Add missing type hints for all flavors of RoBERTa PyTorch models.

* Fixed type hints for all classes and fixed return types.
```
eaed6897

Rebuild deepspeed (#16081) · a01fe4cd

Lysandre Debut authored Mar 11, 2022



* Rebuild deepspeed

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

a01fe4cd

add type annotations for ImageGPT (#16088) · 7f3d4440
João Gustavo A. Amorim authored Mar 11, 2022

7f3d4440
Update troubleshoot guide (#16001) · 5b4c97d0
Steven Liu authored Mar 11, 2022
```
* 📝 first draft

* 🖍 apply feedback

* 🖍 apply feedback
```
5b4c97d0

Add soft length regulation for sequence generation (#15245) · 9442b3ce

Kevin Bondzio authored Mar 11, 2022



* add possibility to softly regulate length when using sampling method in model.generate() function

* fix test config, fix formatting

* fix rag integration, fix docstyling

* fix wrong docstring

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* change test according to new param

* fix formatting

* fix test case

* fix doc style

* move start_length calculation to Logitprocessor

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix rag integration, fix docstyling

* fix test config, fix formatting

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* add possibility to softly regulate length when using sampling method in model.generate() function

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* remove unused import

* fix small errors

* fix test

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix test config, fix formatting

* fix rag integration, fix docstyling

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* change test according to new param

* fix test case

* move start_length calculation to Logitprocessor

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix rag integration, fix docstyling

* fix test config, fix formatting

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix test config, fix formatting

* fix rag integration, fix docstyling

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix rag integration, fix docstyling

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* fix small errors

* Update src/transformers/generation_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/generation_utils.py

* Update src/transformers/generation_utils.py

* fix docstring, add type ind model rag

* fix docstrings

* introduce seq_length variable for cleaner code

* fix black formatting

* add input_ids_seq_length to modeling_rag

* add input_ids_seq_length to test

* retrigger checks

* retrigger checks
Co-authored-by: Kevin Bondzio <kev@AIM-LAP-02.local>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Kevin Bondzio <kev@AIM-LAP-02.fritz.box>

9442b3ce

Run daily test without time-out at least once (#16077) · 322c8533
Patrick von Platen authored Mar 11, 2022

322c8533
check for key 'torch.dtype' in nested dicts in config (#16065) · 7e00247f
feifang24 authored Mar 11, 2022

7e00247f
Adding type hints for TFRoBERTa (#16057) · 5d2fed2e
Matt authored Mar 11, 2022
```
* Adding type annotations for TFRoBERTa

* Add type hints to TFRobertaModel too
```
5d2fed2e
Add type annotations for BERT and copies (#16074) · bb69d154
Matt authored Mar 11, 2022
```
* Add type annotations for BERT and copies

* make fixup
```
bb69d154
Force default brnahc name via the config · f7708e1b
Sylvain Gugger authored Mar 11, 2022

f7708e1b
Trigger doc build · ecf989ca
Sylvain Gugger authored Mar 11, 2022

ecf989ca
Fix torch-scatter version (#16072) · 0868fdef
Lysandre Debut authored Mar 11, 2022

0868fdef
Remove assertion over possible activation functions in DistilBERT (#16066) · 5b369dc5
Funtowicz Morgan authored Mar 11, 2022
```
* Remove assertion over possible activation functions

* Same for TF and Flax
```
5b369dc5
Move QDQBert in just PyTorch block (#16062) · f5741bcd
Sylvain Gugger authored Mar 11, 2022

f5741bcd
Fix a TF test name (LayoutLMModelTest) (#16061) · b6bdb943
Yih-Dar authored Mar 11, 2022
```
* fix name
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b6bdb943

10 Mar, 2022 17 commits

updating fine-tune classifier documentation (#16063) · 96ac7549
David S. Batista authored Mar 10, 2022

96ac7549

Fix duplicate arguments passed to dummy inputs in ONNX export (#16045) · 6b093283

lewtun authored Mar 10, 2022

* Fix duplicate arguments passed to dummy inputs in ONNX export

* Fix M2M100 ONNX config

* Ensure we check PreTrained model only if torch is available

* Remove TensorFlow tests for models without PyTorch parity

6b093283

support new marian models (#15831) · ba21001f

Suraj Patil authored Mar 10, 2022

* support not sharing embeddings

* update modeling

* update tokenizer

* fix conversion script

* always use self.shared

* boom boom

* begin tests

* update tests

* fix resize_decoder_token_embeddings

* address Patrick's comments

* style

* update conversion script

* fix conversion script

* fix tokenizer

* better name target vocab

* add integration test for tokenizer with two vocabs

* style

* address Patrick's comments

* add integration test for model

ba21001f

DeBERTa/DeBERTa-v2/SEW Support for torch 1.11 (#16043) · e66743e6
Lysandre Debut authored Mar 10, 2022
```
* Support for torch 1.11

* Address Sylvain's comment
```
e66743e6
Fix Bug in Flax Seq2Seq Models (#16021) · 741e4930
Sanchit Gandhi authored Mar 10, 2022
```
* Fix Bug in Flax Seq2Seq Models

* incorporate suggested changes
```
741e4930

TF: Unpack model inputs through a decorator (#15907) · b7018abf

Joao Gante authored Mar 10, 2022

* MVP

* apply decorator to TFBertModel

* finish updating bert

* update rembert (copy-linked to bert)

* update roberta (copy-linked to bert); Fix args

* Now working for non-text modalities

b7018abf

Don't compute metrics in LM examples on TPU (#16029) · 19597998
Sylvain Gugger authored Mar 10, 2022

19597998

Build the doc in a seperate folder then move it (#16020) · 10591399

Sylvain Gugger authored Mar 10, 2022

* Build the doc in a seperate folder then move it

* Allow job

* Is this it?

* Dislike comments?

* Copy instead of move

* Removing version built

* Typos

* No variable

* Take _versions.yml into account

* Finish main job and add dev job

* Forgot the run

* Fix syntax error

* Execute builder from the repo

* Typo

10591399

Fix TFDebertaV2ConvLayer in TFDebertaV2Model (#16031) · 2f463eff
Yih-Dar authored Mar 10, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
2f463eff
Fix Bug in Flax-Speech-Encoder-Decoder Test (#16041) · 1da84ae0
Sanchit Gandhi authored Mar 10, 2022
```
* Fix Bug in Flax-Speech-Encoder-Decoder Test

* change thresholds for CPU precision
```
1da84ae0
[README] fix url for Preprocessing tutorial (#16042) · b2a1c994
Suraj Patil authored Mar 10, 2022

b2a1c994

[Tests] Add attentions_option to ModelTesterMixin (#15909) · 8d83ebdf

NielsRogge authored Mar 10, 2022



* Add attentions_option to common tester

* Fix tests, apply suggestion

* Apply suggestion from code review
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

8d83ebdf

[Docs] Improve PyTorch, Flax generate API (#15988) · 6ce11c2c

Patrick von Platen authored Mar 10, 2022

* Move generate docs

* up

* Update docs/source/_toctree.yml

* correct

* correct some stuff

* correct tests

* more fixes

* finish generate

* add to doc stest

* finish

* finalize

* add warning to generate method

6ce11c2c

Fix dependency error message in ServeCommand (#16033) · 0951d317
André Storhaug authored Mar 10, 2022
```
"uvicorn" is misspelled as "unicorn".
```
0951d317

Add Document Image Transformer (DiT) (#15984) · 0835119b

NielsRogge authored Mar 10, 2022



* Add conversion script

* Improve script

* Fix bug

* Add option to push to hub

* Add support for classification models

* Update model name

* Upload feature extractor files first

* Remove hash checking

* Fix config

* Add id2label

* Add import

* Fix id2label file name

* Fix expected shape

* Add model to README

* Improve docs

* Add integration test and fix CI

* Fix code style

* Add missing init

* Add model to SPECIAL_MODULE_TO_TEST_MAP
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

0835119b

Update README.md · 6c9010ef
Sanchit Gandhi authored Mar 10, 2022

6c9010ef
Freeze Feature Encoder in FlaxSpeechEncoderDecoder (#15997) · fde90187
Sanchit Gandhi authored Mar 10, 2022
```
* Freeze Feature Encoder in FlaxSpeechEncoderDecoder

* add backprop test
```
fde90187

09 Mar, 2022 5 commits

Fix warning message in ElectraForCausalLM (#16023) · 65f9653e
Pavel Belevich authored Mar 09, 2022

65f9653e

add doctests for bart like seq2seq models (#15987) · a69e1850

Suraj Patil authored Mar 09, 2022



* boom boom

* enable doctest for few seq2seq models

* add seq2seq models in documentation_tests.txt

* fix docstring blenderbot

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix seq classif doc sample

* don't check loss for seq classif examples

* +IGNORE_OUTPUT => +IGNORE_RESULT

* fix _SEQ_CLASS_EXPECTED_OUTPUT_SHAPE

* fix some docs

* more fixes

* last fix (hopefully)

* fix big bird gen example

* fix mbart gen example
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

a69e1850

Add FlaxBartForCausalLM (#15995) · b256f351

Sanchit Gandhi authored Mar 09, 2022

* add causal lm

* add CausalLM tests

* Add FlaxBartForCausalLM

* Add EncoderDecoder model tests

* change docstring

* make repo-consistency

* suggested changes

* remove jax ops

* correction

* rename pre-trained decoder model

b256f351

Add ONNX export for ViT (#15658) · 50dd314d

lewtun authored Mar 09, 2022



* Add ONNX support for ViT

* Refactor to use generic preprocessor

* Add vision dep to tests

* Extend ONNX slow tests to ViT

* Add dummy image generator

* Use model_type to determine modality

* Add deprecation warnings for tokenizer argument

* Add warning when overwriting the preprocessor

* Add optional args to docstrings

* Add minimum PyTorch version to OnnxConfig

* Refactor OnnxConfig class variables from CONSTANT_NAME to snake_case

* Add reasonable value for default atol
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

50dd314d

Use tiny models for get_pretrained_model in TFEncoderDecoderModelTest (#15989) · b7fa1e3d

Yih-Dar authored Mar 09, 2022



* Use tiny model for TFRembertEncoderDecoderModelTest.get_pretrained_model()
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b7fa1e3d