Commits · 5fdb54ece78b5d277fe26a3865beca8da0430495 · chenpangpang / transformers

18 May, 2022 7 commits

Add Information Gain Filtration algorithm (#16953) · 5fdb54ec

mraunak authored May 18, 2022



* Add information gain filtration algorithm

* Complying with black requirements

* Added author

* Fixed import order

* flake8 corrections
Co-authored-by: Javier Turek <javier.turek@intel.com>

5fdb54ec

Fix typo (#17328) · 91ede485
Kamal Raj authored May 18, 2022

91ede485

remove (#17325) · fe28eb94

Yih-Dar authored May 18, 2022


Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

fe28eb94

Accepting real pytorch device as arguments. (#17318) · 2cb2ea3f
Nicolas Patry authored May 18, 2022
```
* Accepting real pytorch device as arguments.

* is_torch_available.
```
2cb2ea3f
Updating the docs for `max_seq_len` in QA pipeline (#17316) · 1c9d1f4c
Nicolas Patry authored May 18, 2022

1c9d1f4c

[T5] Fix init in TF and Flax for pretraining (#17294) · 60ad7344

Patrick von Platen authored May 18, 2022



* fix init

* Apply suggestions from code review

* fix

* finish

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

60ad7344

Add type hints for ProphetNet (Pytorch) (#17223) · 7ba1d4e5

Joaq authored May 18, 2022



* added type hints to prophetnet

* reformatted with black

* fix bc black misformatted some parts

* fix imports

* fix imports

* Update src/transformers/models/prophetnet/configuration_prophetnet.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* update OPTIONAL type hint and docstring
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

7ba1d4e5

17 May, 2022 17 commits

Add trajectory transformer (#17141) · d6b8e9ce

Carl authored May 18, 2022



* Add trajectory transformer


Fix model init


Fix end of lines for .mdx files

Add trajectory transformer model to toctree

Add forward input docs

Fix docs, remove prints, simplify prediction test

Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Update docs, more descriptive comments

Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Update readme

Small comment update and add conversion script

Rebase and reformat

Fix copies

Fix rebase, remove duplicates

Fix rebase, remove duplicates

* Remove tapex

* Remove tapex

* Remove tapex

d6b8e9ce

fix (#17310) · c3526400
Patrick von Platen authored May 18, 2022

c3526400

[LED] fix global_attention_mask not being passed for generation and docs... · d9050dc7

Cesare Campagnano authored May 17, 2022


[LED] fix global_attention_mask not being passed for generation and docs clarification about grad checkpointing (#17112)

* [LED] fixed global_attention_mask not passed for generation + docs clarification for gradient checkpointing

* LED docs clarification
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* [LED] gradient_checkpointing=True should be passed to TrainingArguments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* [LED] docs: remove wrong word
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* [LED] docs fix typo
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d9050dc7

Add support for pretraining recurring span selection to Splinter (#17247) · bad35839

Jean Vancoppenolle authored May 17, 2022



* Add SplinterForSpanSelection for pre-training recurring span selection.

* Formatting.

* Rename SplinterForSpanSelection to SplinterForPreTraining.

* Ensure repo consistency

* Fixup changes

* Address SplinterForPreTraining PR comments

* Incorporate feedback and derive multiple question tokens per example.

* Update src/transformers/models/splinter/modeling_splinter.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/models/splinter/modeling_splinter.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Jean Vancoppenole <jean.vancoppenolle@retresco.de>
Co-authored-by: Tobias Günther <tobias.guenther@retresco.de>
Co-authored-by: Tobias Günther <github@tobigue.de>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bad35839

Add PR author in CI report + merged by info (#17298) · 05113055

Yih-Dar authored May 17, 2022



* Add author info to CI report

* Add merged by info

* update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

05113055

Fix dummy creation script (#17304) · 032d63b9
Sylvain Gugger authored May 17, 2022

032d63b9
Fix style · 986dd5c5
Sylvain Gugger authored May 17, 2022

986dd5c5

Doctest longformer (#16441) · 38ddab10

Karim Foda authored May 17, 2022



* Add initial doctring changes

* make fixup

* Add TF doc changes

* fix seq classifier output

* fix quality errors

* t

* swithc head to random init

* Fix expected outputs

* Update src/transformers/models/longformer/modeling_longformer.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

38ddab10

[Test] Fix W2V-Conformer integration test (#17303) · 10704e12
Patrick von Platen authored May 17, 2022
```
* [Test] Fix W2V-Conformer integration test

* correct w2v2

* up
```
10704e12

Improve mismatched sizes management when loading a pretrained model (#17257) · 28a08116

regisss authored May 17, 2022

- Add --ignore_mismatched_sizes argument to classification examples

- Expand the error message when loading a model whose head dimensions are different from expected dimensions

28a08116

correct opt (#17301) · 1f13ba81
Patrick von Platen authored May 17, 2022

1f13ba81

Rewrite TensorFlow train_step and test_step (#17057) · 349f1c85

Matt authored May 17, 2022

* Initial commit

* Better label renaming

* Remove breakpoint before pushing (this is your job)

* Test a lot more in the Keras fit() test

* make fixup

* Clarify the case where we flatten y dicts into tensors

* Clarify the case where we flatten y dicts into tensors

* Extract label name remapping to a method

349f1c85

Fix tests of mixed precision now that experimental is deprecated (#17300) · 651e48e1

Matt authored May 17, 2022

* Fix tests of mixed precision now that experimental is deprecated

* Fix mixed precision in training_args_tf.py too

651e48e1

fix retribert's `test_torch_encode_plus_sent_to_model` (#17231) · 6d211429
SaulLu authored May 17, 2022

6d211429
[ConvNeXT] Fix drop_path_rate (#17280) · ec7f8af1
NielsRogge authored May 17, 2022
```
* Fix drop_path_rate

* Fix TF's drop path rate
```
ec7f8af1
Fix wrong PT/TF categories in CI report (#17272) · a26ab95e
Yih-Dar authored May 17, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a26ab95e

Fix missing job action button in CI report (#17270) · 1ac2b8fa

Yih-Dar authored May 17, 2022



* use matrix.machine_type

* fix job names used in job_link
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1ac2b8fa

16 May, 2022 16 commits

Add Wav2Vec2Conformer (#16812) · 5a995735

Patrick von Platen authored May 17, 2022



* save intermediate

* add wav2vec2 conformer

* add more code

* more

* first test passes

* make all checkpoints work

* update

* up

* more clean ups

* save clean-up

* save clean-up

* save more

* remove bogus

* finalize design conformer

* remove vision

* finish all tests

* more changes

* finish code

* add doc tests

* add slow tests

* fix autoconfig test

* up

* correct docstring

* up

* update

* fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update docs/source/en/model_doc/wav2vec2-conformer.mdx

* upload

* save copied from

* correct configs

* fix model outputs

* add to docs

* fix imports

* finish

* finish code

* correct copied from

* correct again

* correct make fix

* improve make fix copies

* save

* correct fix copy from

* correct init structure

* correct

* fix import

* apply suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

5a995735

Fix test_model_parallelization (#17249) · f0395cf5
Kyungmin Lee authored May 17, 2022
```
* Fix test_model_parallelization

* Modify
```
f0395cf5

[Tests] Fix slow opt tests (#17282) · e705e126

Patrick von Platen authored May 16, 2022

* fix opt tests

* remove unused tok

* make style

* make flake8 happy

* Update tests/models/opt/test_modeling_opt.py

e705e126

Add Tensorflow Swin model (#16988) · f6a63889

amyeroberts authored May 16, 2022


Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f6a63889

docs(transformers): fix typo (#17263) · 6cb71873
Kevin Zehnder authored May 16, 2022

6cb71873

logging documentation update (#17174) · 053a80c6

Sander Land authored May 16, 2022



* logging documentation

* style
Co-authored-by: Sander Land <sander@chatdesk.com>

053a80c6

Use the PR URL in CI report (#17269) · 8600d770
Yih-Dar authored May 16, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8600d770
Fix FlavaForPreTrainingIntegrationTest CI test (#17232) · 3fb82f74
Yih-Dar authored May 16, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
3fb82f74
Better error in the Auto API when a dep is missing (#17289) · 9b0d2860
Sylvain Gugger authored May 16, 2022

9b0d2860
Make TrainerHyperParameterSigOptIntegrationTest slow test (#17288) · 66b3e106
Yih-Dar authored May 16, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
66b3e106

Automatically sort auto mappings (#17250) · ddb1a47e

Sylvain Gugger authored May 16, 2022

* Automatically sort auto mappings

* Better class extraction

* Some auto class magic

* Adapt test and underlying behavior

* Remove re-used config

* Quality

ddb1a47e

Mlflowcallback fix nonetype error (#17171) · 2f611f85
Nicolas Brousse authored May 16, 2022
```
* Fix edge cases TypeError: 'NoneType' object is not callable

* fix style
```
2f611f85
Align logits and labels in OPT (#17237) · 95b6bef6
MichelBartels authored May 16, 2022

95b6bef6
Remove next sentence prediction from supported ONNX tasks (#17276) · a5d18396
lewtun authored May 16, 2022

a5d18396

CodeParrot data pretokenization (#16932) · 05a90579

Loubna Ben Allal authored May 16, 2022



* add pretokenization arguments

* add pretokenization script

* add support for pretokenized data

* reformat code

* fix run command for training

* fix model call from config

* remove a package

* add comments on pretokenization in the readme

* remove explicit parallelization
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* update readme
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* update readme -remove username
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* update readme -remove username
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* keep data parallelization

* reformat code

* reformat code

* update readme

* reformat code

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Loubna ben allal <loubnabenallal@gmail.com>

05a90579

Update codeparrot data preprocessing (#16944) · e730e125

Loubna Ben Allal authored May 16, 2022



* add new preprocessing arguments

* add new filters

* add new filters to readme

* fix config and test count, update function names and docstrings

* reformat code

* update readme

* Update readme

* rename config_test filter
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* rename few_assignments filter
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* rename tokenizer in arguments
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* rename functions and add limit_line argument for config_test filter

* update threshold for config_test filter
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Loubna ben allal <loubnabenallal@gmail.com>

e730e125