Commits · 10704e12094b09a069bb4375a422c83a3c4f44b1 · chenpangpang / transformers

17 May, 2022 9 commits

[Test] Fix W2V-Conformer integration test (#17303) · 10704e12
Patrick von Platen authored May 17, 2022
```
* [Test] Fix W2V-Conformer integration test

* correct w2v2

* up
```
10704e12

Improve mismatched sizes management when loading a pretrained model (#17257) · 28a08116

regisss authored May 17, 2022

- Add --ignore_mismatched_sizes argument to classification examples

- Expand the error message when loading a model whose head dimensions are different from expected dimensions

28a08116

correct opt (#17301) · 1f13ba81
Patrick von Platen authored May 17, 2022

1f13ba81

Rewrite TensorFlow train_step and test_step (#17057) · 349f1c85

Matt authored May 17, 2022

* Initial commit

* Better label renaming

* Remove breakpoint before pushing (this is your job)

* Test a lot more in the Keras fit() test

* make fixup

* Clarify the case where we flatten y dicts into tensors

* Clarify the case where we flatten y dicts into tensors

* Extract label name remapping to a method

349f1c85

Fix tests of mixed precision now that experimental is deprecated (#17300) · 651e48e1

Matt authored May 17, 2022

* Fix tests of mixed precision now that experimental is deprecated

* Fix mixed precision in training_args_tf.py too

651e48e1

fix retribert's `test_torch_encode_plus_sent_to_model` (#17231) · 6d211429
SaulLu authored May 17, 2022

6d211429
[ConvNeXT] Fix drop_path_rate (#17280) · ec7f8af1
NielsRogge authored May 17, 2022
```
* Fix drop_path_rate

* Fix TF's drop path rate
```
ec7f8af1
Fix wrong PT/TF categories in CI report (#17272) · a26ab95e
Yih-Dar authored May 17, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a26ab95e

Fix missing job action button in CI report (#17270) · 1ac2b8fa

Yih-Dar authored May 17, 2022



* use matrix.machine_type

* fix job names used in job_link
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1ac2b8fa

16 May, 2022 21 commits

Add Wav2Vec2Conformer (#16812) · 5a995735

Patrick von Platen authored May 17, 2022



* save intermediate

* add wav2vec2 conformer

* add more code

* more

* first test passes

* make all checkpoints work

* update

* up

* more clean ups

* save clean-up

* save clean-up

* save more

* remove bogus

* finalize design conformer

* remove vision

* finish all tests

* more changes

* finish code

* add doc tests

* add slow tests

* fix autoconfig test

* up

* correct docstring

* up

* update

* fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update docs/source/en/model_doc/wav2vec2-conformer.mdx

* upload

* save copied from

* correct configs

* fix model outputs

* add to docs

* fix imports

* finish

* finish code

* correct copied from

* correct again

* correct make fix

* improve make fix copies

* save

* correct fix copy from

* correct init structure

* correct

* fix import

* apply suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

5a995735

Fix test_model_parallelization (#17249) · f0395cf5
Kyungmin Lee authored May 17, 2022
```
* Fix test_model_parallelization

* Modify
```
f0395cf5

[Tests] Fix slow opt tests (#17282) · e705e126

Patrick von Platen authored May 16, 2022

* fix opt tests

* remove unused tok

* make style

* make flake8 happy

* Update tests/models/opt/test_modeling_opt.py

e705e126

Add Tensorflow Swin model (#16988) · f6a63889

amyeroberts authored May 16, 2022


Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f6a63889

docs(transformers): fix typo (#17263) · 6cb71873
Kevin Zehnder authored May 16, 2022

6cb71873

logging documentation update (#17174) · 053a80c6

Sander Land authored May 16, 2022



* logging documentation

* style
Co-authored-by: Sander Land <sander@chatdesk.com>

053a80c6

Use the PR URL in CI report (#17269) · 8600d770
Yih-Dar authored May 16, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8600d770
Fix FlavaForPreTrainingIntegrationTest CI test (#17232) · 3fb82f74
Yih-Dar authored May 16, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
3fb82f74
Better error in the Auto API when a dep is missing (#17289) · 9b0d2860
Sylvain Gugger authored May 16, 2022

9b0d2860
Make TrainerHyperParameterSigOptIntegrationTest slow test (#17288) · 66b3e106
Yih-Dar authored May 16, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
66b3e106

Automatically sort auto mappings (#17250) · ddb1a47e

Sylvain Gugger authored May 16, 2022

* Automatically sort auto mappings

* Better class extraction

* Some auto class magic

* Adapt test and underlying behavior

* Remove re-used config

* Quality

ddb1a47e

Mlflowcallback fix nonetype error (#17171) · 2f611f85
Nicolas Brousse authored May 16, 2022
```
* Fix edge cases TypeError: 'NoneType' object is not callable

* fix style
```
2f611f85
Align logits and labels in OPT (#17237) · 95b6bef6
MichelBartels authored May 16, 2022

95b6bef6
Remove next sentence prediction from supported ONNX tasks (#17276) · a5d18396
lewtun authored May 16, 2022

a5d18396

CodeParrot data pretokenization (#16932) · 05a90579

Loubna Ben Allal authored May 16, 2022



* add pretokenization arguments

* add pretokenization script

* add support for pretokenized data

* reformat code

* fix run command for training

* fix model call from config

* remove a package

* add comments on pretokenization in the readme

* remove explicit parallelization
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* update readme
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* update readme -remove username
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* update readme -remove username
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* keep data parallelization

* reformat code

* reformat code

* update readme

* reformat code

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Loubna ben allal <loubnabenallal@gmail.com>

05a90579

Update codeparrot data preprocessing (#16944) · e730e125

Loubna Ben Allal authored May 16, 2022



* add new preprocessing arguments

* add new filters

* add new filters to readme

* fix config and test count, update function names and docstrings

* reformat code

* update readme

* Update readme

* rename config_test filter
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* rename few_assignments filter
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* rename tokenizer in arguments
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* rename functions and add limit_line argument for config_test filter

* update threshold for config_test filter
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Loubna ben allal <loubnabenallal@gmail.com>

e730e125

Updated checkpoint support for Sagemaker Model Parallel (#17219) · 518dd127

cavdard authored May 16, 2022



* adding partial checkpoint support for optimizer state

* formatted trainer.py

* Refactoring based on comments

* reformatting

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Cavdar <dcavdar@a07817b12d7e.ant.amazon.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

518dd127

fixed bug in run_mlm_flax_stream.py (#17203) · 71d18d08

Kenneth Enevoldsen authored May 16, 2022



* fixed bug run_mlm_flax_stream.py

Fixed bug caused by an update to tokenizer keys introduced in recent transformers versions (between `4.6.2` and `4.18.0`) where additional keys were introduced to the tokenizer output.

* Update run_mlm_flax_stream.py

* adding missing paranthesis

* formatted to black

* remove cols from dataset instead

* reformat to black

* moved rem. columns to map

* formatted to black
Co-authored-by: KennethEnevoldsen <kennethcenevolsen@gmail.com>

71d18d08

[WIP] [doc] performance/scalability revamp (#15723) · 71abd3ad

Stas Bekman authored May 16, 2022



* [doc] performance/scalability revamp

* link the new docs

* no :

* mixed precision

* work on the first doc

* expand the main doc

* Trigger CI

* style

* revamp single GPU training section

* work on training performance

* remove files not used anymore or will be added later

* final touches

* fix rebase

* Add hardware section to toctree

* fix toctree again

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove `fast_tokenizers` entry that was copied in rebase

* add warning about DP vs DDP

* remove todo

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix missing closure of codeblock

* Update docs/source/en/perf_train_gpu_many.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* sync with #16860

* update toc
Co-authored-by: leandro <leandro.vonwerra@spoud.io>
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

71abd3ad

TF - Fix convnext classification example (#17261) · d3d87b45
Joao Gante authored May 16, 2022

d3d87b45
Fix obvious typos in flax decoder impl (#17279) · e86faecf
cloudhan authored May 16, 2022
```
Change config.encoder_ffn_dim -> config.decoder_ffn_dim for decoder.
```
e86faecf

13 May, 2022 10 commits

Guide to create custom models in Spanish (#17158) · ee393c00

Ignacio Talavera authored May 13, 2022



* file copied and toctree updated

* Intro and configuration translated

* model section translated

* enter hotfix

* Translation over, correction pending

* Typos and corrections

* Update docs/source/es/create_a_model.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source/es/create_a_model.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source/es/create_a_model.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Update docs/source/es/create_a_model.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

ee393c00

Translated version of model_sharing.mdx doc to spanish (#16184) · 16be4229

Gerardo Huerta Robles authored May 13, 2022



* Translated version of model_sharing to spanish

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Update docs/source_es/model_sharing.mdx

* Addind model sharing to _toctree.yml
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

16be4229

[ fast_tokenizers.mdx ] - Added translation to portuguese to tutorial (#17076) · f9024814

Fellip Silva Alves authored May 13, 2022



* [ fast_tokenizers.mdx ] - Added translation to portuguese to tutorial

* Delete docs/source/pt-br directory

* [ fast_tokenizers.mdx ] - Continuing work on file

* [ fast_tokenizers.mdx ] - Continuing work on file

* Add fast tokenizers to _toctree.yml

* Eliminated config and toctree.yml

* Nits in fast_tokenizers.mdx
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

f9024814

Add PR title to push CI report (#17246) · 50d1867c

Yih-Dar authored May 13, 2022



* add PR title to push CI report

* add link
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

50d1867c

Fix push CI channel (#17242) · 506899d1
Yih-Dar authored May 13, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
506899d1
install dev. version of accelerate (#17243) · 7198b633
Yih-Dar authored May 13, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
7198b633
Fix Trainer for Datasets that don't have dict items (#17239) · b96cb169
Sylvain Gugger authored May 13, 2022

b96cb169
Handle copyright in add-new-model-like (#17218) · 9c8fde8e
Sylvain Gugger authored May 13, 2022

9c8fde8e
fix --gpus option for docker (#17235) · 993553b2
Yih-Dar authored May 13, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
993553b2

Update self-push workflow (#17177) · 38043d84

Yih-Dar authored May 13, 2022



* update push ci

* install git-python

* update comment

* update deepspeed jobs

* fix report

* skip 2 more tests that require fairscale

* Fix changes in test_fetcher.py (to deal with `setup.py` is changed)

* set RUN_PT_TF_CROSS_TESTS=1 and final clean-up

* remove SIGOPT_API_TOKEN

* remove echo "$matrix_folders"
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

38043d84