Commits · 4701a1a182462ba0e0b928651b5159a26c28cbad · chenpangpang / transformers

09 Dec, 2021 8 commits
- Patch release script · 4701a1a1
  Lysandre authored Dec 09, 2021
  
  4701a1a1
- Docs for v4.14.0dev0 · ab31b3e4
  Lysandre authored Dec 09, 2021
  
  ab31b3e4
- Release: v4.13.0 · 4da3a696
  Lysandre authored Dec 09, 2021
  
  4da3a696
- Fix typo in toctree (#14704) · 60be4bf8
  Mishig Davaadorj authored Dec 09, 2021
  
  60be4bf8
- add str hub token to repository when provided else fallback to default (#14682) · da7aabf2
  Philipp Schmid authored Dec 09, 2021
```
* add str hub token to repository when provided else fallback to default True

* make style
```
  da7aabf2
- Fix tests (#14703) · 7375758b
  NielsRogge authored Dec 09, 2021
  
  7375758b
- Add a job to test doc building (for realsies this time) (#14662) · 68e53e6f
  Sylvain Gugger authored Dec 09, 2021
  
  68e53e6f
- Add kenlm dep to missing tests · e9800122
  Sylvain Gugger authored Dec 08, 2021
  
  e9800122
08 Dec, 2021 19 commits

Fix doc examples: name '...' is not defined (#14687) · ee6674d4

Yih-Dar authored Dec 08, 2021



* Fix doc examples: name '...' is not defined

* remove >>> and ... in some docstrings in visual_bert
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ee6674d4

Make MLuke tokenizer tests slow (#14690) · e6219320
Sylvain Gugger authored Dec 08, 2021

e6219320

Move pyctcdecode (#14686) · 13186d71

Sylvain Gugger authored Dec 08, 2021

* Move pyctcdecode dep

* Fix doc and last objects

* Quality

* Style

* Ignore this black

13186d71

[trainer] support UserDict inputs (torch-nightly) (#14688) · d104dd46
Stas Bekman authored Dec 08, 2021

d104dd46

[bf16 support] tweaks (#14580) · 12286612

Stas Bekman authored Dec 08, 2021



* [bf16 support] tweaks

* corrections
Co-authored-by: Manuel R. Ciosici <manuelrciosici@gmail.com>

12286612

Fix wrong checkpoint paths in doc examples (#14685) · 16870d11
Yih-Dar authored Dec 08, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
16870d11
Revert open-in-colab and add perceiver (#14683) · 01b8cd59
Sylvain Gugger authored Dec 08, 2021

01b8cd59
Fixes in init (#14681) · f6b87c5f
Sylvain Gugger authored Dec 08, 2021
```
* Fixes in init

* Style
```
f6b87c5f

Improvements to Comet Integration (#14680) · fe06f8dc

Dhruv Nair authored Dec 09, 2021

* change args to address overwriting issue

* remove project name from args

* remove passing args as kwargs to experiment object

* remove passing args as kwargs to offline experiment

* fix offline directory assignment in experiment kwargs

* log checkpoint folder on training end

* log entire output_dir as asset folder

* log asset folder  recursively

* end experiment at the end of training

* clean up

* clean up

* Default to always log training assets to Comet when using CometCallback

* change logging training assets to be true when running callback setup

* fix so that experiment always ends when training ends

* styling and quality fixes

* update docstring for COMET_LOG_ASSETS environment variable

* run styling and quality checks

* clean up to docstring

* remove merge markers

* change asset logging to false to avoid hitting max assets per experiment limit

* update training asset description

* fix styling

fe06f8dc

fix: verify jsonlines file in run_translation (#14660) (#14661) · 4ea19de8

Gaurang Tandon authored Dec 08, 2021

* fix: verify jsonl in run_translation (#14660)

* fix(run_translation.py): json/jsonl validation

Both json and jsonl are to be accepted as valid jsonlines file extension

* fix(run_translation.py): make black happy

* Ran make style

4ea19de8

Convert tutorials (#14665) · cf36f4d7

Sylvain Gugger authored Dec 08, 2021

* Convert a few docs

* And another

* Last tutorials

* New syntax for colab links

* Convert a few docs

* And another

* Last tutorials

* New syntax for colab links

cf36f4d7

Revert "Added support for other features for already supported models (#14358)" (#14679) · 0f4e39c5
lewtun authored Dec 08, 2021
```
This reverts commit 0c70f145.
```
0f4e39c5

Added support for other features for already supported models (#14358) · 0c70f145

Michael Benayoun authored Dec 08, 2021

* Added support for other features for already supported models

* Partial support for causal and seq2seq models

* Partial support for causal and seq2seq models

* OnnxSeq2SeqConfigWithPast to support seq2seq models

* Parameterized the onnx tests

* Restored run_mlm.py

* Restored run_mlm.py

* [WIP] BART update

* BART and MBART

* Added comments

* Another sequence length of the past_key_values

0c70f145

[AutoProcessor] Add Wav2Vec2WithLM & small fix (#14675) · ee4fa2e4

Patrick von Platen authored Dec 08, 2021

* [AutoProcessor] Add Wav2Vec2WithLM & small fix

* revert line removal

* Update src/transformers/__init__.py

* add test

* up

* up

* small fix

ee4fa2e4

Fix doc builder (#14676) · 2294071a
Lysandre Debut authored Dec 08, 2021

2294071a
fix deprecated tf method (#14671) · fab3b518
ZOHETH authored Dec 08, 2021
```
tf.matrix_band_part -> tf.linalg.band_part
```
fab3b518

Add Perceiver IO (#14487) · 65b20b73

NielsRogge authored Dec 08, 2021

* First draft

* Style and remove mlm

* Make forward pass work

* More improvements

* More improvements

* Fix bug

* More improvements

* More improvements

* Add PerceiverTokenizer first draft

* Improve conversion script

* More improvements

* Make conversion script work for the encoder

* Make conversion script work with local pickle files

* Style & quality, fix-copies

* Add dummy input to conversion script

* Add absolute position embeddings to TextPreProcessor

* Make forward pass of encoder work

* More improvements

* Move text preprocessor to separate script

* More improvements

* More improvements

* Add post processor

* Make MLM model work

* Style

* Add PerceiverForMaskedLM

* Add PerceiverImagePreprocessor

* Make style

* Make PerceiverForImageClassification work

* More improvements

* More improvements

* Use tokenizer in conversion script

* Use PerceiverForMaskedLM in conversion script

* Define custom PerceiverModelOutput

* Improve PerceiverAttention to make it work for both MLM and image classification

* More improvements

* More improvements

* More improvements to the conversion script

* Make conversion script work for both MLM and image classification

* Add PerceiverFeatureExtractor

* More improvements

* Style and quality

* Add center cropping

* Fix bug

* Small fix

* Add print statement

* Fix bug in image preprocessor

* Fix bug with conversion script

* Make output position embeddings an nn.Parameter layer instead of nn.Embedding

* Comment out print statements

* Add position encoding classes

* More improvements

* Use position_encoding_kwargs

* Add PerceiverForImageClassificationFourier

* Make style & quality

* Add PerceiverForImageClassificationConvProcessing

* Style & quality

* Add flow model

* Move processors to modeling file

* Make position encodings modular

* Make basic decoder use modular position encodings

* Add PerceiverForOpticalFlow to conversion script

* Add AudioPreprocessor

* Make it possible for the basic decoder to use Fourier position embeddings

* Add PerceiverForMultimodalAutoencoding

* Improve model for optical flow

* Improve _build_network_inputs method

* Add print statement

* Fix device issue

* Fix device of Fourier embeddings

* Add print statements for debugging

* Add another print statement

* Add another print statement

* Add another print statement

* Add another print statement

* Improve PerceiverAudioPreprocessor

* Improve conversion script for multimodal modal

* More improvements

* More improvements

* Improve multimodal model

* Make forward pass multimodal model work

* More improvements

* Improve tests

* Fix some more tests

* Add output dataclasses

* Make more tests pass

* Add print statements for debuggin

* Add tests for image classification

* Add PerceiverClassifierOutput

* More improvements

* Make more tests pass for the optical flow model

* Make style & quality

* Small improvements

* Don't support training for optical flow model for now

* Fix _prepare_for_class for tests

* Make more tests pass, add some docs

* Add multimodal model to tests

* Minor fixes

* Fix tests

* Improve conversion script

* Make fixup

* Remove pos_dim argument

* Fix device issue

* Potential fix for OOM

* Revert previous commit

* Fix test_initialization

* Add print statements for debugging

* Fix print statement

* Add print statement

* Add print statement

* Add print statement

* Add print statement

* Add print statement

* Add print statement

* Remove need for output_shape

* Comment out output_shape

* Remove unnecessary code

* Improve docs

* Fix make fixup

* Remove PerceiverTextProcessor from init

* Improve docs

* Small improvement

* Apply first batch of suggestions from code review

* Apply more suggestions from code review

* Update docstrings

* Define dicts beforehand for readability

* Rename task to architecture in conversion script, include PerceiverModel in tests

* Add print statements for debugging

* Fix tests on GPU

* Remove preprocessors, postprocessors and decoders from main init

* Add integration test

* Fix docs

* Replace einops by torch

* Update for new docs frontend

* Rename PerceiverForImageClassification

* Improve docs

* Improve docs

* Improve docs of PerceiverModel

* Fix some more tests

* Improve center_crop

* Add PerceiverForSequenceClassification

* Small improvements

* Fix tests

* Add integration test for optical flow model

* Clean up

* Add tests for tokenizer

* Fix tokenizer by adding special tokens properly

* Fix CI

65b20b73

[Wav2Vec2] PyCTCDecode Integration to support language model boosted decoding (#14339) · 961732c2

Patrick von Platen authored Dec 08, 2021



* up

* up

* up

* make it cleaner

* correct

* make styhahalal

* add more tests

* finish

* small fix

* make style

* up

* tryout to solve cicrle ci

* up

* fix more tests

* fix more tests

* apply sylvains suggestions

* fix import

* correct docs

* add pyctcdecode only to speech tests

* fix more tests

* add tf, flax and pt tests

* add pt

* fix last tests

* fix more tests

* Apply suggestions from code review

* change lines

* Apply suggestions from code review
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* correct tests

* correct tests

* add doc string
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

961732c2

Fixing Dataset for TQA + token-classification. (#14658) · 2e12d90b

Nicolas Patry authored Dec 08, 2021

* Fixing Dataset for TQA + token-classification.

* Fixing the tests.

* Making sure `offset_mappings` is a valid argument.

2e12d90b

07 Dec, 2021 5 commits

[trainer] conditional ctx managers into one wrapper (#14663) · fae0b9fa

Stas Bekman authored Dec 07, 2021



* [trainer] conditional ctx managers into one wrapper

* workaround for contextlib.nullcontext for py<3.7

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* one more autocast

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fae0b9fa

Fix a Bug, trainer_seq2seq.py, in the else branch at Line 172,... · 39f1dff5

TranSirius authored Dec 08, 2021

Fix a Bug, trainer_seq2seq.py, in the else branch at Line 172, generation_inputs should be a dict (#14546)

* fix bug, trainer_seq2seq.py, Line 172, generation_inputs must be a dict before feeding into self.model.generation()

* fix bug, trainer_seq2seq.py, Line 172, generation_inputs must be a dict before feeding into self.model.generation()

39f1dff5

quick fix SummarizationPipeline error messages (#14618) · 2171695c

Nouamane Tazi authored Dec 07, 2021

* quick fix SummarizationPipeline error messages

Fix error messages to avoid spam errors, and errors of type:
`Your max_length is set to 50, but you input_length is only 46. You might consider decreasing max_length manually, e.g. summarizer('...', max_length=50)`

* correcto SummarizationPipeline error messages fixes

2171695c

[deepspeed] fix --load_best_model_at_end (#14652) · b66c5ab2

Stas Bekman authored Dec 06, 2021

* [deepspeed] fix load_best_model_at_end

* try with pull_request_target

* revert: try with pull_request_target

* style

* add test

* cleanup

b66c5ab2

Add mLUKE (#14640) · 30646a0a

Ryokan RI authored Dec 07, 2021

* implement MLukeTokenizer and LukeForMaskedLM

* update tests

* update docs

* add LukeForMaskedLM to check_repo.py

* update README

* fix test and specify the entity pad id in tokenization_(m)luke

* fix EntityPredictionHeadTransform

30646a0a

06 Dec, 2021 8 commits

Use cross_attention_hidden_size in Encoder-Decoder models (#14378) · 4cdb67ca

Yih-Dar authored Dec 07, 2021



* add cross_attention_hidden_size to text-2-text encoder-decoder models (PT/Flax)

* for TFEncoderDecoderModel

* add equivalence test for TFEncoderDecoderModel

* fix

* fix failed equivalence tests

* remove unused import

* add detailed comment

* Fix check_equivalence_tf_to_pt by using encoder/decoder

* cleaning

* Use cross_attention_hidden_size in speech-to-text

* clean fast init logging msg in encoder decoder models

* increase tol from 1e-5 to 1e-3 for tf test

* style

* style

* make sure projection layer can run

* remove type conversion + add check

* fix conflict (config.output_hidden_size)

* Remove TF -> PT in check_pt_tf_equivalence for TFEncoderDecoderModel
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4cdb67ca

Remove nonworking workflow for now · 381b05a3
Sylvain Gugger authored Dec 06, 2021

381b05a3

fix flax examples tests (#14646) · 75ae287a

Suraj Patil authored Dec 07, 2021

* make tensorboard optional

* update test_fetcher for flax examples

* make the tests slow

75ae287a

Add a job to test the documentation build (#14645) · 03fda7b7
Sylvain Gugger authored Dec 06, 2021
```
* Add a job to the documentation build

* Add caching

* Test cache
```
03fda7b7
Fix syntax for class references (#14644) · e513c16e
Sylvain Gugger authored Dec 06, 2021

e513c16e

Auto processor fix (#14623) · e9688875

Lysandre Debut authored Dec 06, 2021



* Add AutoProcessor class
Init and tests
Add doc
Fix init
Update src/transformers/models/auto/processing_auto.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Reverts to tokenizer or feature extractor when available
Adapt test

* Revert "Adapt test"

This reverts commit bbdde5fab02465f24b54b227390073082cb32093.

* Revert "Reverts to tokenizer or feature extractor when available"

This reverts commit 77659ff5d21b6cc0baf6f443017e35e056a525bb.

* Don't revert everything Lysandre!
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

e9688875

fix flax example tests (#14643) · cbe60265
Suraj Patil authored Dec 06, 2021

cbe60265

doc: mismatch between pooler/d_output (#14641) · df085d8e

guhur authored Dec 06, 2021

The model outputs a pooler_output whereas the doctype examples were using a pooled_output.

df085d8e