Commits · 95b3ec3bc9e8fa135bd9adde5bbdd6cc7ee01618 · chenpangpang / transformers

09 Nov, 2021 2 commits

Add FlaxVisionEncoderDecoderModel (#13359) · 95b3ec3b

Yih-Dar authored Nov 09, 2021



* Start the work on FlaxVisionEncoderDecoderModel

* Add FlaxVisionEncoderDecoderModel

* Add VisionEncoderDecoderConfig

* Make FlaxVisionEncoderDecoderModel visible to transformers

* Add test

* Fix wrong getattr usage

* Fix tests

* Add FlaxAutoModelForVision2Seq

* Expose FLAX_MODEL_FOR_VISION_2_SEQ_MAPPING

* clean-up

* add integration test

* update expected logits

* update expected scores

* Add ViT2GPT2ModelIntegrationTest + some cleaning

* Add projection layer + PT/Flax equivalence tests

* Fix import

* minor changes

* make test slow again

* Apply suggestions

* Add modeling_flax_vision_encoder_decoder to _ignore_modules in get_model_modules()

* fix copies

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* split long strings in multiple lines

* decoder_input_ids can't be None

* Add back test_configuration_tie

* Remove attention_mask parameter

* fix test - encoder_last_hidden_state should be encoder_outputs.last_hidden_state instead of the projected vector

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Remove more encoder_attention_mask

* remove encoder_attention_mask when calling self.decode (in FlaxVisionEncoderDecoderModule)

* Fix style + pass 1s instead of None as encoder_attention_mask

* fix init_weights

* pass None for encoder_attention_mask

* pass 1s instead of None as encoder_attention_mask

* Fix doc style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

95b3ec3b

Small change to Wav2Vec2 model to support Tensor-Parallelism with DeepSpeed (#14298) · a5030122

Reza Yazdani authored Nov 08, 2021

* minor modification to the wav2vec2 modeling file to support tensor-parallelism with DeepSpeed on this HuggingFace model

* refine the comments

* synch changes

* fix comments

* refine comments

* fix format

a5030122

08 Nov, 2021 8 commits
- [deepspeed] Enable multiple test runs on single box, defer to DS_TEST_PORT if set (#14331) · d0e96c6d
  Jeff Rasley authored Nov 08, 2021
```
* defer to DS_TEST_PORT if set

* style
Co-authored-by: Stas Bekman <stas@stason.org>
```
  d0e96c6d
- Expand dynamic supported objects to configs and tokenizers (#14296) · dfb00bf6
  Sylvain Gugger authored Nov 08, 2021
```
* Dynamic configs

* Add config test

* Better tests

* Add tokenizer and test

* Add to from_config

* With save
```
  dfb00bf6
- Changed relative imports to absolute to allow convert_graph_to_onnx.py to run as a script. (#14325) · de635af3
  nbertagnolli authored Nov 08, 2021
```
* Changed relative imports to absolute to allow convert_graph_to_onnx.py to be run as a script

* isorted code
```
  de635af3
- Fixing mutable default argument in `pipeline`. (#14316) · a3ded170
  Nicolas Patry authored Nov 08, 2021
```
* Fixing mutable default argument.

* XX.

* Revert "XX."

This reverts commit 61d4bb333f6d39a7fbe31d161b8bd14787ceec2e.
```
  a3ded170
- Fixing tests on master. (#14317) · 9b78b070
  Nicolas Patry authored Nov 08, 2021
```
* Fixing tests on master.

* Better fix.

* Lxmert doesn't have feature extractor but is bimodal.
```
  9b78b070
- [TFWav2Vec2Model] Fix input shapes in TFWav2Vec2WeightNormConv1D (#14319) · df1f94eb
  Anton Lozhkov authored Nov 08, 2021
```
* Add paddings to input shapes

* Add padding comment
```
  df1f94eb
- [Tests] Update audio classification tests to support torch 1.10 (#14318) · e30078b5
  Anton Lozhkov authored Nov 08, 2021
  
  e30078b5
- [Marian Conversion] Fix eos_token_id conversion in conversion script (#14320) · b48faae3
  Patrick von Platen authored Nov 08, 2021
  
  b48faae3
06 Nov, 2021 4 commits
- Fix execution PATH for PPLM Example (#14287) · c016dbdb
  Junbum Lee authored Nov 06, 2021
  
  c016dbdb
- Fix tests (#14289) · 34307bb3
  NielsRogge authored Nov 06, 2021
  
  34307bb3
- Handle long answer needs to be updated. (#14279) · 24b30d4d
  Nicolas Patry authored Nov 06, 2021
```
`start_` and `end_` tensors now contain a batch_size at this point.
```
  24b30d4d
- Update dpr.rst (#14300) · 843c326e
  Xing Han Lu authored Nov 06, 2021
  
  843c326e
05 Nov, 2021 3 commits
- Add new LFS prune API (#14294) · 08a5f575
  Sylvain Gugger authored Nov 05, 2021
  
  08a5f575
- [Hubert Docs] Make sure example uses a fine-tuned model (#14291) · 4be78c22
  Patrick von Platen authored Nov 05, 2021
  
  4be78c22
- Pin TF until tests are fixed (#14283) · a14d62b0
  Sylvain Gugger authored Nov 04, 2021
```
* Pin TF until tests are fixed

* Also pin TF CPU
```
  a14d62b0
04 Nov, 2021 4 commits
- Removing Keras version pinning (#14280) · b90a48f6
  Matt authored Nov 04, 2021
```
* Removing Keras version pinning

* make fixup
```
  b90a48f6
- improve rewrite state_dict missing _metadata (#14276) · fd8136fa
  Chang Wang authored Nov 04, 2021
  
  fd8136fa
- Fixing mishandling of `ignore_labels`. (#14274) · d29baf69
  Nicolas Patry authored Nov 04, 2021
```
Fixes #14272
```
  d29baf69
- Fixing slow pipeline tests (#14260) · 68427c9b
  Nicolas Patry authored Nov 04, 2021
```
* Fiixng slow pipeline tests

* Remove the image-segmentaiton override.

* Fixing clamping only in training.

* Wav2vec2.

* Remove last mention of `no_grad`.

* Fixing copies.

* Rename.
```
  68427c9b
03 Nov, 2021 11 commits

Add more instructions to the release guide (#14263) · 1a674ce6

Sylvain Gugger authored Nov 03, 2021



* Add more instructions to the release guide

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comment
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

1a674ce6

Quality explain (#14264) · f0d6e952

Sylvain Gugger authored Nov 03, 2021



* Start PR doc

* Cleanup the quality checks and document them

* Add reference in the contributing guide

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename file as per review suggestion
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

f0d6e952

Pin Keras cause they messed their release (#14262) · a1c15ea8

Sylvain Gugger authored Nov 03, 2021

* Pin Keras cause they messed their release

* Put != instead of <

* Try this way

* Back to the beginning but more agressive

a1c15ea8

Fixing typo in error message. (#14226) · 11492431
Nicolas Patry authored Nov 03, 2021

11492431

Fix of issue #13327: Wrong weight initialization for TF t5 model (#14241) · 2c8957fe

Dan Shirron authored Nov 03, 2021



* Fix of issue #13327: Wrong weight initialization for TF t5 model

* run black formatter

* fix typo

* remove my name tag from comments
Co-authored-by: Shirron <dan.shirron@intel.com>

2c8957fe

Adding support for `truncation` parameter on `feature-extraction` pipeline. (#14193) · dec759e7

Nicolas Patry authored Nov 03, 2021

* Adding support for `truncation` parameter on `feature-extraction`
pipeline.

Fixes #14183

* Fixing tests on ibert, longformer, and roberta.

* Rebase fix.

dec759e7

minimal fixes to run DataCollatorForWholeWordMask with return_tensors="np" and... · 27b1516d

Dean Wyatte authored Nov 03, 2021

minimal fixes to run DataCollatorForWholeWordMask with return_tensors="np" and return_tensors="tf" (#13891)

* minimal fixes to run DataCollatorForWholeWordMask with return_tensors="np" and return_tensors="tf"

* more consinstent implementation for numpy_mask_tokens

27b1516d

Put `load_image` function in `image_utils.py` & fix image rotation issue (#14062) · 671569dd

Mishig Davaadorj authored Nov 03, 2021

* Fix img load rotation

* Add `load_image` to `image_utils.py`

* Implement LoadImageTester

* Use hf-internal-testing dataset

* Add img utils comments

* Refactor LoadImageTester

* Import load_image under is_vision_available

671569dd

up (#14258) · 89766b3d
Patrick von Platen authored Nov 03, 2021

89766b3d

Add cross attentions to TFGPT2Model (#14038) · bd21ed40

Yih-Dar authored Nov 03, 2021



* Add cross attentions to TFGPT2Model

* change to is_pt_tf_cross_test

* A minor correction to a comment

* Remove n_ctx when creating self.crossattention
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bd21ed40

Add LayoutXLMProcessor (and LayoutXLMTokenizer, LayoutXLMTokenizerFast) (#14115) · 5f789a68

NielsRogge authored Nov 03, 2021



* Add LayoutXLMTokenizer and LayoutXLMTokenizerFast

* Fix styling issues

* Fix more styling issues

* Fix more styling issues

* Fix docstring

* Fix unit tests

* Fix docs

* Fix unit tests

* Fix typos and styling issues

* Fix styling issues

* Fix docstring

* Make all tests of test_tokenization_layoutxlm pass

* Add LayoutXLMProcessor

* Make fixup

* Make all LayoutXLMProcessor tests pass

* Minor fixes

* Leave LayoutLMv2Processor tests unchanged

* Fix code quality

* Move LayoutXLM tokenizers and processor to separate folder

* Fix code quality

* Apply suggestions from code review

* Replace assertions by value errors

* Remove methods from fast tokenizer
Co-authored-by: King Yiu Suen <kingyiusuen@gmail.com>

5f789a68

02 Nov, 2021 7 commits
- Update Transformers to huggingface_hub >= 0.1.0 (#14251) · 558f8543
  Sylvain Gugger authored Nov 02, 2021
```
* Update Transformers to huggingface_hub >= 0.1.0

* Forgot to save...

* Style

* Fix test
```
  558f8543
- Added Beit model output class (#14133) · 519a677e
  lumliolum authored Nov 02, 2021
```
* add Beit model ouput class

* inherting from BaseModelOuputWithPooling

* updated docs if use_mean_pooling is False

* added beit specific outputs in model docs

* changed the import path

* Fix docs
Co-authored-by: Niels Rogge <niels.rogge1@gmail.com>
```
  519a677e
- Fixes Beit training for PyTorch 1.10+ (#14249) · bbaa3eff
  Sylvain Gugger authored Nov 02, 2021
  
  bbaa3eff
- Add PushToHubCallback in main init (#14246) · ad3e560b
  Sylvain Gugger authored Nov 02, 2021
  
  ad3e560b
- [Tests] Fix DistilHubert path (#14245) · ce01122a
  Anton Lozhkov authored Nov 02, 2021
```
* Add audio-classification benchmarking results

* fix distilhubert path
```
  ce01122a
- Fix test_configuration_tie in FlaxEncoderDecoderModelTest (#14076) · 4a394cf5
  Yih-Dar authored Nov 02, 2021
```
* check test_configuration_tie

* Fix test_configuration_tie

* make test slow again

* Remove property and use model.module.bind

* revert to slow test
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  4a394cf5
- Fix generation docstring (#14216) · a767276f
  Li-Huai (Allan) Lin authored Nov 02, 2021
```
* Fix generation docstring

* Style
```
  a767276f
01 Nov, 2021 1 commit

Add BeitForSemanticSegmentation (#14096) · e20faa6f

NielsRogge authored Nov 01, 2021



* Add first draft

* Make forward pass work

* Improve conversion script

* Add notebook that checks if it works

* Add BeitForSemanticSegmentation to the tests

* More improvements

* Make BeitForSemanticSegmentation consistent with Segformer

* Small bug fix

* Add BeitForSemanticSegmentation to docs

* Make sure model doesn't output hidden states when the user doesn't want to

* Make it possible to convert the large model

* Fix issue

* Fix conversion script for large model

* Add auxiliary_head option to semantic segmentation model

* Apply suggestions from @sgugger's review

* Apply suggestions from code review

* Fix failing test
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

e20faa6f