Commits · e1da89ccb836014ccb5fd01b140ee2bab2d0b083 · chenpangpang / transformers

17 Mar, 2022 6 commits

Fix reproducibility in Training for PyTorch 1.11 (#16209) · e1da89cc
Sylvain Gugger authored Mar 17, 2022

e1da89cc
Fix typo (#16208) · e5101c2e
Dayyan Smith authored Mar 17, 2022

e5101c2e
Fix FlaxRoFormerClassificationHead activation (#16168) · 25b8f9a8
Yih-Dar authored Mar 17, 2022
```
* fix activation
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
25b8f9a8

[Tests] Fix DiT test (#16218) · 03c14a51

NielsRogge authored Mar 17, 2022



* Fix device

* Clean up
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

03c14a51

Fixes Loss for TransfoXL when using Trainer API v2 (#16140) · 73f0a5d1

Lysandre Debut authored Mar 17, 2022



* fix(transfo_xl): Fixes TransfoXL support when using Trainer.

* fix(tests): Uses losses_1 and losses_2 pattern with TransfoXL test.

* fix(transfo_xl): Adds requested changes to allow for backward compatibility.

fix(transfo_xl): Adds requested changes to allow for backward compatibility.

fix(transfo_xl): Fixes code styling.

* Backward compatibility

* Update src/transformers/models/transfo_xl/modeling_transfo_xl.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Gustavo de Rosa <gth.rosa@uol.com.br>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

73f0a5d1

VAN: update modules names (#16201) · 76c74b37
Francesco Saverio Zuppichini authored Mar 17, 2022
```
* done

* done
```
76c74b37

16 Mar, 2022 15 commits

Add/type annotations/model vision (#16151) · 99e2982f

João Gustavo A. Amorim authored Mar 16, 2022

* add types annotations for Beit (PyTorch)

* add types annotations for ViT (PyTorch)

* add types annotations for Deit (PyTorch)

* change Optional[bool] to bool into some places at Beit

* change Optional[bool] to bool into some places at ViT

99e2982f

Fix generation min length (#16206) · 2410d0f8
Patrick von Platen authored Mar 16, 2022
```
* up

* fix min lengths
```
2410d0f8

Swin support for any input size (#15986) · 667b823b

Francesco Saverio Zuppichini authored Mar 16, 2022



* padding done

* correctly return one attention per layer

* almost correct, attentions are not flatten one tuple per stage

* tests green

* doc

* conversations

* reshaping hidden_states

* view in the test

* reshape_hidden_states in Encoder and Model

* new outputs with reshaped_hidden_states

* conversations

* doc

* Update docs/source/model_doc/swin.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* conversations

* fix tests

* minor changes

* resolved conversations

* attentions one per stage

* typo

* typos

* typos

* function signature

* CI

* clean up tests
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

667b823b

TF: add beam search tests (#16202) · 204c54d4
Joao Gante authored Mar 16, 2022

204c54d4
Fix loading CLIPVisionConfig and CLIPTextConfig (#16198) · 19099457
Suraj Patil authored Mar 16, 2022
```
* override from_pretrained

* add tests

* remove docstrings

* fix typo

* Trigger CI
```
19099457
Update step name (#16189) · 09013efd
Yih-Dar authored Mar 16, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
09013efd
ResNet: update modules names (#16196) · 36f8c425
Francesco Saverio Zuppichini authored Mar 16, 2022
```
* updated names

* fit in one line

* typo
```
36f8c425

Adding type hints for Distilbert (#16090) · 5bdf3313

John Ryan authored Mar 16, 2022



* Distillbert type - squash

* Update src/transformers/models/distilbert/modeling_distilbert.py

Undo cleanup
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Update src/transformers/models/distilbert/modeling_distilbert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Update src/transformers/models/distilbert/modeling_distilbert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Update src/transformers/models/distilbert/modeling_distilbert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Remove type
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

5bdf3313

clearer model variable naming: blenderbot_small (#16194) · 0b8b0618
Utku Saglam authored Mar 16, 2022
```
Co-authored-by: utku saglam <utkusaglam@utku-MacBook-Pro.local>
```
0b8b0618

TF unpack_input decorator for convnext (#16181) · f06c2c2b

Johannes Kolbe authored Mar 16, 2022



* unpack_input decorator for tf_convnext

* set unpack_input as top decorator
Co-authored-by: Johannes Kolbe <johannes.kolbe@tech.better.team>

f06c2c2b

Minor fixes to XTREME-S (#16193) · d35e0c62

Anton Lozhkov authored Mar 16, 2022



* Minor fixes

* Fix vocab union

* Update examples/research_projects/xtreme-s/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update README

* unused import
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d35e0c62

TF clearer model variable naming: blenderbot (#16192) · 8cc925a2
Utku Saglam authored Mar 16, 2022
```
Co-authored-by: utku saglam <utkusaglam@utku-MacBook-Pro.local>
```
8cc925a2
TF clearer model variable naming: funnel (#16178) · 0f35cda4
Utku Saglam authored Mar 16, 2022
```
Co-authored-by: utku saglam <utkusaglam@utku-MacBook-Pro.local>
```
0f35cda4
Replace all deprecated `jax.ops` operations with jnp's `at` (#16078) · ee27b3d7
Sanchit Gandhi authored Mar 16, 2022
```
* Replace all deprecated `jax.ops` operations with jnp's `at`

* np to jnp scores

* suggested changes
```
ee27b3d7
[Xtreme-S] fix some namings (#16183) · c2dc89be
Patrick von Platen authored Mar 16, 2022

c2dc89be

15 Mar, 2022 19 commits

Add the XTREME-S fine-tuning example (#15985) · 99fd3eb4

Anton Lozhkov authored Mar 16, 2022

* CTC+classification draft

* CTC+classification draft

* style

* multilingual runs

* Fix race condition during processor.from_reatrained

* Merge covost experiments

* Add README

* Quality

* Switch to .all configs

* Fix typos

99fd3eb4

Trigger doc build · db4dd44a
Sylvain Gugger authored Mar 15, 2022

db4dd44a

Fix some Flax models' `hidden_states` (#16167) · ea05d671

Yih-Dar authored Mar 15, 2022



* fix the last element in `hidden_states`

* fix missing elements in outputs for FlaxWav2Vec2EncoderLayerStableLayerNormCollection
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ea05d671

Added type hints for Reformer (#16175) · 88f7c564
Dan Tegzes authored Mar 15, 2022

88f7c564
Add type annotations for Perceiver (#16174) · 16399d61
Jack McDonald authored Mar 16, 2022

16399d61
TF clearer model variable naming: xlnet (#16150) · 015de6f0
Kamal Raj authored Mar 15, 2022

015de6f0

Add flaubert types (#16118) · a23a7c0c

Thomas Chaigneau authored Mar 15, 2022

* Add type hints for FlauBERT PyTorch Base model. Others downstream tasks are inherited from XLM RoBERTa.

* Add type hints for FlaubERT Tensorflow models.

* fix output for TFFlaubertWithLMHeadModel

a23a7c0c

TF clearer model variable naming: Deberta (#16146) · 366c18f4
Kamal Raj authored Mar 15, 2022

366c18f4
TF clearer model variable naming: Tapas (#16145) · 79465ac5
Kamal Raj authored Mar 15, 2022

79465ac5
[MT5Config] add relative_attention_max_distance in config (#16170) · a78565b7
Suraj Patil authored Mar 15, 2022

a78565b7
Framework split (#16030) · 4f4e5ddb
Sylvain Gugger authored Mar 15, 2022
```
* First files

* More files

* Last files

* Style
```
4f4e5ddb
added type hints to yoso (#16163) · 4a353cac
mowafess authored Mar 15, 2022

4a353cac
update transformer XL with tf decorator (#16166) · c1c17bd0
Joydeep Bhattacharjee authored Mar 15, 2022
```
* update transformer XL with tf decorator

* code fixup

* remove unused variables
```
c1c17bd0
Change unpacking of TF inputs: layoutlm, mpnet, rag, and roformer (#16112) · 611d3a09
Minh Chien Vu authored Mar 15, 2022
```
Co-authored-by: ChienVM <chien_vm@detomo.co.jp>
```
611d3a09
TF clearer model variable naming: pegasus (#16152) · 0d7322c1
Kamal Raj authored Mar 15, 2022

0d7322c1

TF XLA greedy generation (#15786) · cd4c5c90

Matt authored Mar 15, 2022



* First attempt at TF XLA generation

* Fix comments

* Update XLA greedy generate with direct XLA calls

* Support attention mask, prepare_inputs_for_generation no longer hardcoded for greedy

* Handle position_ids correctly

* make xla generate work for non xla case

* force using xla generate

* refactor

* more fixes

* finish cleaning

* finish

* finish

* clean gpt2 tests

* add gpt2 tests

* correct more cases

* up

* finish

* finish

* more fixes

* flake 8 stuff

* final rag fix

* Update src/transformers/models/rag/modeling_tf_rag.py

* finish t5 as well

* finish

* Update src/transformers/generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

cd4c5c90

[Fix doc example] Fix 2 PyTorch Vilt docstring examples (#16076) · e5bc438c

Yih-Dar authored Mar 15, 2022



* fix 2 pytorch vilt docstring examples

* add vilt to doctest list file

* remove device
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e5bc438c

[Fix doc example] Fix first example for the custom_datasets tutorial (#16087) · bcaf5660

Markus Sagen authored Mar 15, 2022

* Fix inconsistent example variable naming

- Example code for a sequence classification in Tensorflow had spelling mistakes and incorrect and inconsistent naming
- Changed variable naming to be consistent with the two other TF examples

* Fix incorrect incorrect training examples

bcaf5660

Use templates (#16142) · 8bfd2fb8

Sylvain Gugger authored Mar 15, 2022

* Use tempaltes for all doc building jobs

* Add this branch to the doc build

* Switch to main branch

8bfd2fb8