Commits · 6e1e88fd38fe9b1d294a4782c63aeb047362a1fe · chenpangpang / transformers

14 Mar, 2022 1 commit
- Add TFCamembertForCausalLM and ONNX integration test (#16073) · 6e1e88fd
  lewtun authored Mar 14, 2022
```
* Make Camembert great again!

* Add Camembert to TensorFlow ONNX tests
```
  6e1e88fd
13 Mar, 2022 1 commit

Add missing type hints for all flavors of LayoutLMv2 PyTorch models. (#16089) · 20ab1582

Thomas Chaigneau authored Mar 13, 2022

* Add missing type hints for all flavors of LayoutLMv2 PyTorch models.

* Fixed return types and added type hints for LayoutLM.

* Fix removed arguments which breaks tests.

20ab1582

12 Mar, 2022 6 commits

Add type hints to XLM model (PyTorch) (#16108) · 65cf33e7
James Barry authored Mar 12, 2022

65cf33e7
apply unpack_input decorator to ViT model (#16102) · 84162068
João Gustavo A. Amorim authored Mar 12, 2022

84162068
Add type annotations for segformer classes (#16099) · 62b05b69
p-mishra1 authored Mar 12, 2022

62b05b69
add unpack_inputs decorator to mbart (#16097) · 9042dfe3
Abdelrhman-Hosny authored Mar 12, 2022

9042dfe3
Change unpacking of TF Bart inputs (#16094) · 3e9d0f7f
Omar Sanseviero authored Mar 12, 2022

3e9d0f7f

[Deepspeed] add support for bf16 mode (#14569) · 580dd87c

Stas Bekman authored Mar 11, 2022



* [WIP] add support for bf16 mode

* prep for bf16

* prep for bf16

* fix; zero2/bf16 is ok

* check bf16 is available

* test fixes

* enable zero3_bf16

* config files

* docs

* split stage_dtype; merge back to non-dtype-specific config file

* fix doc

* cleanup

* cleanup

* bfloat16 => bf16 to match the PR changes

* s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/

* test fixes/skipping

* move

* fix

* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* backticks

* cleanup

* cleanup

* cleanup

* new version

* add note about grad accum in bf16
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

580dd87c

11 Mar, 2022 18 commits

[ZeRO] Fixes issue with embedding resize (#16093) · c1f209da

Jeff Rasley authored Mar 11, 2022



* gather z3 params for new_lm_head

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

c1f209da

Audio/vision task guides (#15808) · ae2dd42b

Steven Liu authored Mar 11, 2022

* 📝 first draft of audio/vision guides

* ✨ make fixup

* 🖍 fix typo

* 🖍 close parentheses

* 🖍 apply feedback

* 🖍 apply feedback, make fixup

* 🖍 more fixup for perceiver

* 🖍 apply feedback

* ✨ make fixup

* 🖍 fix data collator

ae2dd42b

[Fix doc example] FSMT (#16085) · cb5e50c8
Yih-Dar authored Mar 11, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
cb5e50c8
Add missing type hints for all flavors of RoBERTa PyTorch models. (#16086) · eaed6897
Thomas Chaigneau authored Mar 11, 2022
```
* Add missing type hints for all flavors of RoBERTa PyTorch models.

* Fixed type hints for all classes and fixed return types.
```
eaed6897

Rebuild deepspeed (#16081) · a01fe4cd

Lysandre Debut authored Mar 11, 2022



* Rebuild deepspeed

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

a01fe4cd

add type annotations for ImageGPT (#16088) · 7f3d4440
João Gustavo A. Amorim authored Mar 11, 2022

7f3d4440
Update troubleshoot guide (#16001) · 5b4c97d0
Steven Liu authored Mar 11, 2022
```
* 📝 first draft

* 🖍 apply feedback

* 🖍 apply feedback
```
5b4c97d0

Add soft length regulation for sequence generation (#15245) · 9442b3ce

Kevin Bondzio authored Mar 11, 2022



* add possibility to softly regulate length when using sampling method in model.generate() function

* fix test config, fix formatting

* fix rag integration, fix docstyling

* fix wrong docstring

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* change test according to new param

* fix formatting

* fix test case

* fix doc style

* move start_length calculation to Logitprocessor

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix rag integration, fix docstyling

* fix test config, fix formatting

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* add possibility to softly regulate length when using sampling method in model.generate() function

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* remove unused import

* fix small errors

* fix test

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix test config, fix formatting

* fix rag integration, fix docstyling

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* change test according to new param

* fix test case

* move start_length calculation to Logitprocessor

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix rag integration, fix docstyling

* fix test config, fix formatting

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix test config, fix formatting

* fix rag integration, fix docstyling

* add possibility to softly regulate length when using sampling method in model.generate() function

* fix rag integration, fix docstyling

* change param to tuple, add test

* fix old param in rag_model, remove unused import

* fix small errors

* Update src/transformers/generation_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/generation_utils.py

* Update src/transformers/generation_utils.py

* fix docstring, add type ind model rag

* fix docstrings

* introduce seq_length variable for cleaner code

* fix black formatting

* add input_ids_seq_length to modeling_rag

* add input_ids_seq_length to test

* retrigger checks

* retrigger checks
Co-authored-by: Kevin Bondzio <kev@AIM-LAP-02.local>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Kevin Bondzio <kev@AIM-LAP-02.fritz.box>

9442b3ce

Run daily test without time-out at least once (#16077) · 322c8533
Patrick von Platen authored Mar 11, 2022

322c8533
check for key 'torch.dtype' in nested dicts in config (#16065) · 7e00247f
feifang24 authored Mar 11, 2022

7e00247f
Adding type hints for TFRoBERTa (#16057) · 5d2fed2e
Matt authored Mar 11, 2022
```
* Adding type annotations for TFRoBERTa

* Add type hints to TFRobertaModel too
```
5d2fed2e
Add type annotations for BERT and copies (#16074) · bb69d154
Matt authored Mar 11, 2022
```
* Add type annotations for BERT and copies

* make fixup
```
bb69d154
Force default brnahc name via the config · f7708e1b
Sylvain Gugger authored Mar 11, 2022

f7708e1b
Trigger doc build · ecf989ca
Sylvain Gugger authored Mar 11, 2022

ecf989ca
Fix torch-scatter version (#16072) · 0868fdef
Lysandre Debut authored Mar 11, 2022

0868fdef
Remove assertion over possible activation functions in DistilBERT (#16066) · 5b369dc5
Funtowicz Morgan authored Mar 11, 2022
```
* Remove assertion over possible activation functions

* Same for TF and Flax
```
5b369dc5
Move QDQBert in just PyTorch block (#16062) · f5741bcd
Sylvain Gugger authored Mar 11, 2022

f5741bcd
Fix a TF test name (LayoutLMModelTest) (#16061) · b6bdb943
Yih-Dar authored Mar 11, 2022
```
* fix name
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b6bdb943

10 Mar, 2022 14 commits

updating fine-tune classifier documentation (#16063) · 96ac7549
David S. Batista authored Mar 10, 2022

96ac7549

Fix duplicate arguments passed to dummy inputs in ONNX export (#16045) · 6b093283

lewtun authored Mar 10, 2022

* Fix duplicate arguments passed to dummy inputs in ONNX export

* Fix M2M100 ONNX config

* Ensure we check PreTrained model only if torch is available

* Remove TensorFlow tests for models without PyTorch parity

6b093283

support new marian models (#15831) · ba21001f

Suraj Patil authored Mar 10, 2022

* support not sharing embeddings

* update modeling

* update tokenizer

* fix conversion script

* always use self.shared

* boom boom

* begin tests

* update tests

* fix resize_decoder_token_embeddings

* address Patrick's comments

* style

* update conversion script

* fix conversion script

* fix tokenizer

* better name target vocab

* add integration test for tokenizer with two vocabs

* style

* address Patrick's comments

* add integration test for model

ba21001f

DeBERTa/DeBERTa-v2/SEW Support for torch 1.11 (#16043) · e66743e6
Lysandre Debut authored Mar 10, 2022
```
* Support for torch 1.11

* Address Sylvain's comment
```
e66743e6
Fix Bug in Flax Seq2Seq Models (#16021) · 741e4930
Sanchit Gandhi authored Mar 10, 2022
```
* Fix Bug in Flax Seq2Seq Models

* incorporate suggested changes
```
741e4930

TF: Unpack model inputs through a decorator (#15907) · b7018abf

Joao Gante authored Mar 10, 2022

* MVP

* apply decorator to TFBertModel

* finish updating bert

* update rembert (copy-linked to bert)

* update roberta (copy-linked to bert); Fix args

* Now working for non-text modalities

b7018abf

Don't compute metrics in LM examples on TPU (#16029) · 19597998
Sylvain Gugger authored Mar 10, 2022

19597998

Build the doc in a seperate folder then move it (#16020) · 10591399

Sylvain Gugger authored Mar 10, 2022

* Build the doc in a seperate folder then move it

* Allow job

* Is this it?

* Dislike comments?

* Copy instead of move

* Removing version built

* Typos

* No variable

* Take _versions.yml into account

* Finish main job and add dev job

* Forgot the run

* Fix syntax error

* Execute builder from the repo

* Typo

10591399

Fix TFDebertaV2ConvLayer in TFDebertaV2Model (#16031) · 2f463eff
Yih-Dar authored Mar 10, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
2f463eff
Fix Bug in Flax-Speech-Encoder-Decoder Test (#16041) · 1da84ae0
Sanchit Gandhi authored Mar 10, 2022
```
* Fix Bug in Flax-Speech-Encoder-Decoder Test

* change thresholds for CPU precision
```
1da84ae0
[README] fix url for Preprocessing tutorial (#16042) · b2a1c994
Suraj Patil authored Mar 10, 2022

b2a1c994

[Tests] Add attentions_option to ModelTesterMixin (#15909) · 8d83ebdf

NielsRogge authored Mar 10, 2022



* Add attentions_option to common tester

* Fix tests, apply suggestion

* Apply suggestion from code review
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

8d83ebdf

[Docs] Improve PyTorch, Flax generate API (#15988) · 6ce11c2c

Patrick von Platen authored Mar 10, 2022

* Move generate docs

* up

* Update docs/source/_toctree.yml

* correct

* correct some stuff

* correct tests

* more fixes

* finish generate

* add to doc stest

* finish

* finalize

* add warning to generate method

6ce11c2c

Fix dependency error message in ServeCommand (#16033) · 0951d317
André Storhaug authored Mar 10, 2022
```
"uvicorn" is misspelled as "unicorn".
```
0951d317