Commits · 1b0ca7d270d9af0e997bb857c67b41b879c60596 · chenpangpang / transformers

20 Dec, 2021 6 commits
- Update CONTRIBUTING.md (#14835) · 1b0ca7d2
  Kamal Raj authored Dec 20, 2021
```
fix cmd typo
```
  1b0ca7d2
- Add an argument to set bucket_cap_mb for PyTorch DDP (#14756) · 1531b319
  Chang Lan authored Dec 20, 2021
```
* [trainer] Set bucket_cap_mb for DDP from arguments

* Put find_unused_parameters into kwargs
```
  1531b319
- Add SD and SV heads for WavLM (#14847) · 3883e3a7
  Anton Lozhkov authored Dec 20, 2021
```
* Add converted heads

* Add dummies
```
  3883e3a7
- [WavLM] Fix slow tests (#14845) · cd583bda
  Patrick von Platen authored Dec 20, 2021
  
  cd583bda
- up (#14829) · 281e1fba
  Patrick von Platen authored Dec 20, 2021
  
  281e1fba
- [Seq2SeqTrainer] Remove model input name hack (#14802) · 091693b4
  Patrick von Platen authored Dec 20, 2021
```
* [Seq2SeqTrainer] Remove model input name hack

* Update src/transformers/trainer_seq2seq.py

* make style

* finish
```
  091693b4
17 Dec, 2021 8 commits

[ImageGPT] Deprecate pixel_values input name to input_ids (#14801) · 84ea427f

Patrick von Platen authored Dec 17, 2021



* [ImageGPT] Deprecate pixel_values input name to input_ids

* up

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* correct

* finish
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

84ea427f

Wav2Vec2 meets phonemes (#14353) · c4a96cec

Patrick von Platen authored Dec 17, 2021



* up

* add tokenizer

* improve more

* finish tokenizer

* finish

* adapt speech recognition script

* adapt convert

* more fixes

* more fixes

* update phonemizer wav2vec2

* better naming

* fix more tests

* more fixes swedish

* correct tests

* finish

* improve script

* remove file

* up

* lets get those 100 model architectures until the end of the month

* make fix-copies

* correct more

* correct script

* more fixes

* more fixes

* add to docs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* replace assert

* fix copies

* fix docs

* new try docs

* boom boom

* update

* add phonemizer to audio tests

* make fix-copies

* up

* upload models

* some changes

* Update tests/test_tokenization_wav2vec2_phoneme.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* more fixes

* remove @
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

c4a96cec

Convert rst to mdx bert (#14806) · 77d6c826

Lysandre Debut authored Dec 17, 2021



* BERT to mdx
mdx :)
c

* Update docs/source/model_doc/bert.mdx
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Remove all
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>

77d6c826

Trigger doc building · 0b4ea79a
Sylvain Gugger authored Dec 17, 2021

0b4ea79a

Implement head_mask for Flax BERT and other models copied from BERT (#14620) · ff066119

Daniel Stancl authored Dec 17, 2021

* Implement head_mask for Flax BERT and other models copied from BERT

* Remove `from jax._src.nn.functions import sigmoid`

Remove `from jax._src.nn.functions import sigmoid` unintentionally added by IDE

* Remove no more valid copy statement

* Apply patil-suraj's suggestions from code review

* Apply suggestions from the code review

* Update Flax template

* Fix a typo

* Also update template for CausalLM modules

ff066119

[Generate] Correct input_ids detection (#14815) · 95119ad7
Patrick von Platen authored Dec 17, 2021
```
* [Generate] Correct input_ids detection

* correct
```
95119ad7
[WavLM] Layerdrop is not allowed for first layer (#14811) · bdbe3df8
Patrick von Platen authored Dec 17, 2021
```
* [WavLM] Layerdrop is not allowed for first layer

* Apply suggestions from code review
```
bdbe3df8
Add test (#14810) · cbf036f7
NielsRogge authored Dec 17, 2021

cbf036f7

16 Dec, 2021 11 commits

[WavLM] Correct position bias computation (#14805) · c4a0fb51
Patrick von Platen authored Dec 16, 2021

c4a0fb51
Remove datasets requirement (#14795) · d194d639
Lysandre Debut authored Dec 16, 2021

d194d639

Add WavLM (#14354) · bef1e3e4

Patrick von Platen authored Dec 16, 2021



* first commit

* fix some stuff

* fix more readme

* Apply suggestions from code review

* update

* correct

* up

* attn layer works

* push code

* make modedls work

* Small change

* more refactor

* finish

* up

* fix convertsion

* fix position bias

* Fix style

* fix conversion

* make fix-copies

* add

* clean

* fix docs

* fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply final changes

* make fix-copies
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

bef1e3e4

[Generate] Make generate multi-modal (#14784) · b18d8534

Patrick von Platen authored Dec 16, 2021



* finish refactor

* refactor

* add tests

* add more tests

* up

* finish tests

* finish

* up

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* improve docstring

* fix docs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b18d8534

Add Speaker Diarization and Verification heads (#14723) · 48463ebb

Anton Lozhkov authored Dec 16, 2021

* Models

* Squashed commit of the following:

commit 72278e1e931a16d0879acc77f65762f3364833d0
Author: anton-l <aglozhkov@gmail.com>
Date:   Fri Dec 10 21:45:08 2021 +0300

* Add unispeech heads

* Add sd/sv automodels

* Docs cleanup

* Fix docstrings

* rename xvector classes

* examples

* Tests cleanup

* Style

* Better checkpoints for tests

* leftover docs

* apply review suggestions

* Style + init tests

* Update unispeech-sat tdnn downsampling

48463ebb

Train step fix (#14796) · 2e07180c

Matt authored Dec 16, 2021

* Fix for TF train step when no "labels" key in input

* make style

2e07180c

Update CONTRIBUTING.md (#14800) · 465a8b8d
Kamal Raj authored Dec 16, 2021
```
fix pip installation cmd
```
465a8b8d
Update CONTRIBUTING.md (#14799) · 8ae24e19
Kamal Raj authored Dec 16, 2021
```
typo
```
8ae24e19
Fix the build documentation job (#14788) · 12e1b4c6
Sylvain Gugger authored Dec 16, 2021
```
* Fix the build documentation job

* Fix install

* Address review comment
```
12e1b4c6

Post sphinx-clean up and contributing guide updates (#14790) · 5061a9fd

Sylvain Gugger authored Dec 16, 2021



* Clean up sphinx

* Update contributing guide

* Update docs README

* No example title

* Fix copies

* Update CONTRIBUTING.md
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

5061a9fd

Removes images to put them in a dataset (#14781) · 8010fda9
Lysandre Debut authored Dec 16, 2021
```
* First try

* Update instructions
```
8010fda9

15 Dec, 2021 12 commits

PoC for conserving old links (#14754) · 459677ae

Sylvain Gugger authored Dec 15, 2021



* PoC for conserving old links

* Do the same for other links

* remap the redirects section

* add instructions on how to move sections

* improve
Co-authored-by: Stas Bekman <stas@stason.org>

459677ae

Move import (#14787) · c40ecfd7
Sylvain Gugger authored Dec 15, 2021

c40ecfd7
Docs for v4.14.0 · 7c9c41f4
Lysandre authored Dec 15, 2021

7c9c41f4
Release: v4.14.0 · 960d8cb4
Lysandre authored Dec 15, 2021

960d8cb4

Improve Perceiver docs (#14786) · aece7bad

NielsRogge authored Dec 15, 2021



* Fix docs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Code quality
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

aece7bad

Update Perceiver code examples (#14783) · 50bc57ce
NielsRogge authored Dec 15, 2021
```
* Fix code examples

* Fix code example
```
50bc57ce

TF model cards (#14720) · 48d48276

Matt authored Dec 15, 2021

* Initial commit for Keras model cards

* Revert accidental change

* make style

* make style

* make style

* Fix PR comments

* Move repo creation to __init__

* Fixes to README.md creation

* Partial progress for proper card creation on `push_to_hub`

* Proper card creation from `push_to_hub` plus fixes for malformed model cards

* Fixes for model card creation outside the callback

* Adding a model card creation test

* Putting the model card creation test in the right file.
Good job, Matt.

* make style

* Fix model card test temp dir usage

* Fix model card creation when no optimizer present

* Fixes for when training history not present

* Fix accidental edit to test_modeling_common

48d48276

Update t5.rst (#14776) · 72c6e8b8
Xing Han Lu authored Dec 15, 2021

72c6e8b8
Fix preprocess_function in run_summarization_flax.py (#14769) · a94105f9
Yih-Dar authored Dec 15, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a94105f9

Fix the doc_build_test job (#14774) · 7e61d56a

Sylvain Gugger authored Dec 15, 2021

* Fake new model

* Fix doc-building test job

* Is this the problem?

* Another try

* Typo

* Clean up

* Can we do without -e ?

* Clean setup

7e61d56a

[doc] performance: groups of operations by compute-intensity (#14757) · fdf3ce28
Stas Bekman authored Dec 14, 2021

fdf3ce28

Fix broken links to distillation on index page of documentation (#14722) · 851a7897

Amit Chaudhary authored Dec 15, 2021

* Fix broken links to distillation on index page of documentation

* Fix broken link for distillation in main README

* Run make fixup

851a7897

14 Dec, 2021 3 commits

Adding support for multiple mask tokens. (#14716) · e7ed7ffd

Nicolas Patry authored Dec 14, 2021

* Adding support for multiple mask tokens.

- Original implem: https://github.com/huggingface/transformers/pull/10222

Co-authored-by: njafer <naveen.jafer@oracle.com>

* In order to accomodate optionally multimodal models like Perceiver

we add information to the tasks to specify tasks where we know for sure
if we need the tokenizer/feature_extractor or not.

* Adding info in the documentation about multi masks.

+ marked as experimental.

* Add a copy() to prevent overriding the same tensor over and over.

* Fixup.

* Adding small test for multi mask with real values..
Co-authored-by: njafer <naveen.jafer@oracle.com>

e7ed7ffd

Make data shuffling in `run_clm_flax.py` respect global seed (#13410) · 2a606f99
Benjamin Minixhofer authored Dec 14, 2021
```
* use jax and jnp instead of numpy in data_loader

* return batches as np.ndarray
```
2a606f99

Fixing tests for Perceiver (#14739) · 546a91ab

Nicolas Patry authored Dec 14, 2021

* Adding some slow test to check for perceiver at least from a high level.

* Re-enabling fast tests for Perceiver ImageClassification.

* Perceiver might try to run without Tokenizer (Fast doesn't exist) and
with FeatureExtractor some text only pipelines.

* Oops.

* Adding a comment for `update_config_with_model_class`.

* Remove `model_architecture` to get `tiny_config`.

* Finalize rebase.

* Smarter way to handle undefined FastTokenizer.

* Remove old code.

* Addressing some nits.

* Don't instantiate `None`.

546a91ab