Commits · 8f2cc1c3ab9fac0b49c6c9372e6557532a607024 · chenpangpang / transformers

23 Dec, 2021 2 commits

Yih-Dar authored Dec 23, 2021



* Start the work for TFCLIPModel

* Convert to TF code (TODO: loss + doc)

* Clean up

* Fix pooled_output for TFCLIPTextTransformer - using tf.gather_nd

* assert -> raise error

* Expose TFCLIPModel

* Deal with dummy_inputs

* Add tests

* Fix all tests. TODO: manual check weight loading + add more comments

* Fix pt tf equivalence test

* fixes

* update TFCLIPVisionEmbeddings's Conv2D

* Fix loss + overwrite test_pt_tf_model_equivalence from common

* Add a comment about the change about MainLayer in test_keras_save_load

* Set return_loss=True in TFCLIPModelTester + make tests pass

* overwrite test_pt_tf_model_equivalence from tf common

* fix base_model_prefix

* Fix examples

* remove unused

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply review suggestions

* change self.pre_layrnorm to self.pre_layernorm

* apply more review suggestions

* return attention probs before dropout (to align with PT)

* fix weight init

* fix

* build doc

* fix missing doc

* fix for test
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8f2cc1c3

Add ONNX support for MarianMT models (#14586) · 6b655cc6

lewtun authored Dec 23, 2021

* First commit to add MarianMT to ONNX

* Now MarianModel.forward() automatically generates decoder_input_ids, like BartModel.forward()

* Adjusted MarianOnnxConfig.inputs and outputs to work with seq2seq-lm feature

* Style fix

* Added support for other features for already supported models

* Partial support for causal and seq2seq models

* Partial support for causal and seq2seq models

* Add default task for MarianMT ONNX

* Remove automatic creation of decoder_input_ids

* Extend inputs and outputs for MarianMT ONNX config

* Add MarianMT to ONNX unit tests

* Refactor

* OnnxSeq2SeqConfigWithPast to support seq2seq models

* Parameterized the onnx tests

* Restored run_mlm.py

* Restored run_mlm.py

* [WIP] BART update

* BART and MBART

* Add past_key_values and fix dummy decoder inputs

Using a sequence length of 1 in generate_dummy_outputs() produces large discrepancies, presumably due to some hidden optimisations.

* Refactor MarianOnnxConfig to remove custom past_key_values logic

* Fix quality

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Refactor Marian export to account for base changes

* Fix copies

* Implemented suggestions

* Extend support for causal LM

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Implemented suggestions

* Fixed __init__ to resolve conflict with master

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5

.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Implemented suggestions

* Fixed __init__ to resolve conflict with master

* Remove commented import

* Remove ONNX model

* Remove redundant class method

* Tidy up imports

* Fix quality

* Refactor dummy input function

* Add copied from statements to Marian config functions

* Remove false copied from comments

* Fix copy from comment
Co-authored-by: Massimiliano Bruni <massimiliano.bruni@hcl.com>
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

6b655cc6

22 Dec, 2021 4 commits

Convert rst files (#14888) · 207594be

Sylvain Gugger authored Dec 22, 2021

* Convert all tutorials and guides

* Convert all remaining rst to mdx

* Track and fix bad links

207594be

Fix Perceiver docs (#14879) · 7df4b90c
NielsRogge authored Dec 22, 2021

7df4b90c

Feature/fix slow test in mluke (#14749) · 824fd44f

Ryokan RI authored Dec 22, 2021

* make MLukeTokenizerTest fast

* make LukeTokenizerTest fast

* add entry to _toctree.yaml

824fd44f

Convert model files from rst to mdx (#14865) · ec3567fe

Lysandre Debut authored Dec 22, 2021



* First pass

* Apply suggestions from code review

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ec3567fe

21 Dec, 2021 2 commits

[doc porting] several docs (#14858) · 18587639

Stas Bekman authored Dec 21, 2021



* [doc porting] 2 docs

* [doc porting] 2 docs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/main_classes/deepspeed.mdx

* cleanup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

18587639

[logging] implement warning_advice / TRANSFORMERS_NO_ADVISORY_WARNINGS (#14669) · b6ec9569
Stas Bekman authored Dec 20, 2021
```
* [logging] implement warning_advice / TRANSFORMERS_NO_ADVISORY_WARNINGS

* reword
```
b6ec9569

20 Dec, 2021 4 commits
- [doc] typo (#14849) · c1125dc2
  Stas Bekman authored Dec 20, 2021
```
fix small typo
```
  c1125dc2
- [Perceiver] Skip multi-gpu tests for now (#14813) · 952a77b0
  Patrick von Platen authored Dec 20, 2021
```
* [Perceiver] Skip multi-gpu tests for now

* Update tests/test_modeling_perceiver.py

* up

* up
```
  952a77b0
- Fix dead link to benchmarks.ipynb (#14842) · 8a818c26
  Derek Chia authored Dec 20, 2021
```
Notebook has been updated here https://github.com/huggingface/notebooks/tree/master/examples/benchmark.ipynb
```
  8a818c26
- Add SD and SV heads for WavLM (#14847) · 3883e3a7
  Anton Lozhkov authored Dec 20, 2021
```
* Add converted heads

* Add dummies
```
  3883e3a7
17 Dec, 2021 2 commits

Wav2Vec2 meets phonemes (#14353) · c4a96cec

Patrick von Platen authored Dec 17, 2021



* up

* add tokenizer

* improve more

* finish tokenizer

* finish

* adapt speech recognition script

* adapt convert

* more fixes

* more fixes

* update phonemizer wav2vec2

* better naming

* fix more tests

* more fixes swedish

* correct tests

* finish

* improve script

* remove file

* up

* lets get those 100 model architectures until the end of the month

* make fix-copies

* correct more

* correct script

* more fixes

* more fixes

* add to docs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* replace assert

* fix copies

* fix docs

* new try docs

* boom boom

* update

* add phonemizer to audio tests

* make fix-copies

* up

* upload models

* some changes

* Update tests/test_tokenization_wav2vec2_phoneme.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* more fixes

* remove @
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

c4a96cec

Convert rst to mdx bert (#14806) · 77d6c826

Lysandre Debut authored Dec 17, 2021



* BERT to mdx
mdx :)
c

* Update docs/source/model_doc/bert.mdx
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Remove all
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>

77d6c826

16 Dec, 2021 3 commits

Add WavLM (#14354) · bef1e3e4

Patrick von Platen authored Dec 16, 2021



* first commit

* fix some stuff

* fix more readme

* Apply suggestions from code review

* update

* correct

* up

* attn layer works

* push code

* make modedls work

* Small change

* more refactor

* finish

* up

* fix convertsion

* fix position bias

* Fix style

* fix conversion

* make fix-copies

* add

* clean

* fix docs

* fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply final changes

* make fix-copies
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

bef1e3e4

Add Speaker Diarization and Verification heads (#14723) · 48463ebb

Anton Lozhkov authored Dec 16, 2021

* Models

* Squashed commit of the following:

commit 72278e1e931a16d0879acc77f65762f3364833d0
Author: anton-l <aglozhkov@gmail.com>
Date:   Fri Dec 10 21:45:08 2021 +0300

* Add unispeech heads

* Add sd/sv automodels

* Docs cleanup

* Fix docstrings

* rename xvector classes

* examples

* Tests cleanup

* Style

* Better checkpoints for tests

* leftover docs

* apply review suggestions

* Style + init tests

* Update unispeech-sat tdnn downsampling

48463ebb

Removes images to put them in a dataset (#14781) · 8010fda9
Lysandre Debut authored Dec 16, 2021
```
* First try

* Update instructions
```
8010fda9

15 Dec, 2021 5 commits
- PoC for conserving old links (#14754) · 459677ae
  Sylvain Gugger authored Dec 15, 2021
```
* PoC for conserving old links

* Do the same for other links

* remap the redirects section

* add instructions on how to move sections

* improve
Co-authored-by: Stas Bekman <stas@stason.org>
```
  459677ae
- Update Perceiver code examples (#14783) · 50bc57ce
  NielsRogge authored Dec 15, 2021
```
* Fix code examples

* Fix code example
```
  50bc57ce
- Update t5.rst (#14776) · 72c6e8b8
  Xing Han Lu authored Dec 15, 2021
  
  72c6e8b8
- [doc] performance: groups of operations by compute-intensity (#14757) · fdf3ce28
  Stas Bekman authored Dec 14, 2021
  
  fdf3ce28
- Fix broken links to distillation on index page of documentation (#14722) · 851a7897
  Amit Chaudhary authored Dec 15, 2021
```
* Fix broken links to distillation on index page of documentation

* Fix broken link for distillation in main README

* Run make fixup
```
  851a7897
13 Dec, 2021 6 commits

Update Table of Contents (#14755) · 322d4169
Sylvain Gugger authored Dec 13, 2021

322d4169
Convert Trainer doc page to MarkDown (#14753) · 7533d30a
Sylvain Gugger authored Dec 13, 2021
```
* Convert Trainer doc page to MarkDown

* Fix repo consistency

* Fix the doc build test job
```
7533d30a
Small fixes for the doc (#14751) · c3cd88a9
Sylvain Gugger authored Dec 13, 2021

c3cd88a9
Swap TF and PT code inside two blocks (#14742) · fc74c845
Lucien authored Dec 13, 2021

fc74c845
Fix the perceiver docs (#14748) · 6e05bb1c
Lysandre Debut authored Dec 13, 2021

6e05bb1c

Improve documentation of some models (#14695) · 4c99e553

NielsRogge authored Dec 13, 2021



* Migrate docs to mdx

* Update TAPAS docs

* Remove lines

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply some more suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add pt/tf switch to code examples

* More improvements

* Improve docstrings

* More improvements
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4c99e553

11 Dec, 2021 1 commit
- [doc] document MoE model approach and current solutions (#14725) · 027074f4
  Stas Bekman authored Dec 10, 2021
```
* document MoE model approach

* additional info from Samyam

* fix
```
  027074f4
10 Dec, 2021 3 commits
- Fix special character in MDX (#14721) · 5eca742f
  Sylvain Gugger authored Dec 10, 2021
  
  5eca742f
- Prevent style_doc from tempering the config file · 63c284c2
  Sylvain Gugger authored Dec 10, 2021
  
  63c284c2
- Automatically build doc notebooks (#14718) · 1b75d723
  Sylvain Gugger authored Dec 10, 2021
```
* Test workflow

* Build doc

* Make a clean build

* Add doc config

* Restore other workflows

* Final job

* Print something in else statements

* Pull before making changes
```
  1b75d723
09 Dec, 2021 3 commits
- Put back open in colab markers (#14684) · bab15564
  Sylvain Gugger authored Dec 09, 2021
  
  bab15564
- Fix : wrong link in the documentation (ConvBERT vs DistilBERT) (#14705) · 3bc7d70e
  Tikeng Notsawo Pascal Junior authored Dec 09, 2021
  
  3bc7d70e
- Fix typo in toctree (#14704) · 60be4bf8
  Mishig Davaadorj authored Dec 09, 2021
  
  60be4bf8
08 Dec, 2021 5 commits

Move pyctcdecode (#14686) · 13186d71

Sylvain Gugger authored Dec 08, 2021

* Move pyctcdecode dep

* Fix doc and last objects

* Quality

* Style

* Ignore this black

13186d71

[bf16 support] tweaks (#14580) · 12286612

Stas Bekman authored Dec 08, 2021



* [bf16 support] tweaks

* corrections
Co-authored-by: Manuel R. Ciosici <manuelrciosici@gmail.com>

12286612

Revert open-in-colab and add perceiver (#14683) · 01b8cd59
Sylvain Gugger authored Dec 08, 2021

01b8cd59

Convert tutorials (#14665) · cf36f4d7

Sylvain Gugger authored Dec 08, 2021

* Convert a few docs

* And another

* Last tutorials

* New syntax for colab links

* Convert a few docs

* And another

* Last tutorials

* New syntax for colab links

cf36f4d7

Add Perceiver IO (#14487) · 65b20b73

NielsRogge authored Dec 08, 2021

* First draft

* Style and remove mlm

* Make forward pass work

* More improvements

* More improvements

* Fix bug

* More improvements

* More improvements

* Add PerceiverTokenizer first draft

* Improve conversion script

* More improvements

* Make conversion script work for the encoder

* Make conversion script work with local pickle files

* Style & quality, fix-copies

* Add dummy input to conversion script

* Add absolute position embeddings to TextPreProcessor

* Make forward pass of encoder work

* More improvements

* Move text preprocessor to separate script

* More improvements

* More improvements

* Add post processor

* Make MLM model work

* Style

* Add PerceiverForMaskedLM

* Add PerceiverImagePreprocessor

* Make style

* Make PerceiverForImageClassification work

* More improvements

* More improvements

* Use tokenizer in conversion script

* Use PerceiverForMaskedLM in conversion script

* Define custom PerceiverModelOutput

* Improve PerceiverAttention to make it work for both MLM and image classification

* More improvements

* More improvements

* More improvements to the conversion script

* Make conversion script work for both MLM and image classification

* Add PerceiverFeatureExtractor

* More improvements

* Style and quality

* Add center cropping

* Fix bug

* Small fix

* Add print statement

* Fix bug in image preprocessor

* Fix bug with conversion script

* Make output position embeddings an nn.Parameter layer instead of nn.Embedding

* Comment out print statements

* Add position encoding classes

* More improvements

* Use position_encoding_kwargs

* Add PerceiverForImageClassificationFourier

* Make style & quality

* Add PerceiverForImageClassificationConvProcessing

* Style & quality

* Add flow model

* Move processors to modeling file

* Make position encodings modular

* Make basic decoder use modular position encodings

* Add PerceiverForOpticalFlow to conversion script

* Add AudioPreprocessor

* Make it possible for the basic decoder to use Fourier position embeddings

* Add PerceiverForMultimodalAutoencoding

* Improve model for optical flow

* Improve _build_network_inputs method

* Add print statement

* Fix device issue

* Fix device of Fourier embeddings

* Add print statements for debugging

* Add another print statement

* Add another print statement

* Add another print statement

* Add another print statement

* Improve PerceiverAudioPreprocessor

* Improve conversion script for multimodal modal

* More improvements

* More improvements

* Improve multimodal model

* Make forward pass multimodal model work

* More improvements

* Improve tests

* Fix some more tests

* Add output dataclasses

* Make more tests pass

* Add print statements for debuggin

* Add tests for image classification

* Add PerceiverClassifierOutput

* More improvements

* Make more tests pass for the optical flow model

* Make style & quality

* Small improvements

* Don't support training for optical flow model for now

* Fix _prepare_for_class for tests

* Make more tests pass, add some docs

* Add multimodal model to tests

* Minor fixes

* Fix tests

* Improve conversion script

* Make fixup

* Remove pos_dim argument

* Fix device issue

* Potential fix for OOM

* Revert previous commit

* Fix test_initialization

* Add print statements for debugging

* Fix print statement

* Add print statement

* Add print statement

* Add print statement

* Add print statement

* Add print statement

* Add print statement

* Remove need for output_shape

* Comment out output_shape

* Remove unnecessary code

* Improve docs

* Fix make fixup

* Remove PerceiverTextProcessor from init

* Improve docs

* Small improvement

* Apply first batch of suggestions from code review

* Apply more suggestions from code review

* Update docstrings

* Define dicts beforehand for readability

* Rename task to architecture in conversion script, include PerceiverModel in tests

* Add print statements for debugging

* Fix tests on GPU

* Remove preprocessors, postprocessors and decoders from main init

* Add integration test

* Fix docs

* Replace einops by torch

* Update for new docs frontend

* Rename PerceiverForImageClassification

* Improve docs

* Improve docs

* Improve docs of PerceiverModel

* Fix some more tests

* Improve center_crop

* Add PerceiverForSequenceClassification

* Small improvements

* Fix tests

* Add integration test for optical flow model

* Clean up

* Add tests for tokenizer

* Fix tokenizer by adding special tokens properly

* Fix CI

65b20b73