Commits · b7bb2b59f72504fbabe3de24c84b5e282c4870e8 · chenpangpang / transformers

06 Feb, 2023 2 commits
- Generate: TF can now accept custom logits processors (#21454) · 49433310
  Joao Gante authored Feb 06, 2023
  
  49433310
- Fix `SpeechT5ForSpeechToSpeechIntegrationTests` device issue (#21460) · 0db5d911
  Yih-Dar authored Feb 06, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0db5d911
03 Feb, 2023 4 commits

Avoid flaky generation sampling tests (#21445) · 59d5edef
Yih-Dar authored Feb 03, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
59d5edef

[WIP] add SpeechT5 model (#18922) · e4bacf66

Matthijs Hollemans authored Feb 03, 2023

* make SpeechT5 model by copying Wav2Vec2

* add paper to docs

* whoops added docs in wrong file

* remove SpeechT5Tokenizer + put CTC back in the name

* remove deprecated class

* remove unused docstring

* delete SpeechT5FeatureExtractor, use Wav2Vec2FeatureExtractor instead

* remove classes we don't need right now

* initial stab at speech encoder prenet

* add more speech encoder prenet stuff

* improve SpeechEncoderPrenet

* add encoder (not finished yet)

* add relative position bias to self-attention

* add encoder CTC layers

* fix formatting

* add decoder from BART, doesn't work yet

* make it work with generate loop

* wrap the encoder into a speech encoder class

* wrap the decoder in a text decoder class

* changed my mind

* changed my mind again ;-)

* load decoder weights, make it work

* add weights for text decoder postnet

* add SpeechT5ForCTC model that uses only the encoder

* clean up EncoderLayer and DecoderLayer

* implement _init_weights in SpeechT5PreTrainedModel

* cleanup config + Encoder and Decoder

* add head + cross attention masks

* improve doc comments

* fixup

* more cleanup

* more fixup

* TextDecoderPrenet works now, thanks Kendall

* add CTC loss

* add placeholders for other pre/postnets

* add type annotation

* fix freeze_feature_encoder

* set padding tokens to 0 in decoder attention mask

* encoder attention mask downsampling

* remove features_pen calculation

* disable the padding tokens thing again

* fixup

* more fixup

* code review fixes

* rename encoder/decoder wrapper classes

* allow checkpoints to be loaded into SpeechT5Model

* put encoder into wrapper for CTC model

* clean up conversion script

* add encoder for TTS model

* add speech decoder prenet

* add speech decoder post-net

* attempt to reconstruct the generation loop

* add speech generation loop

* clean up generate_speech

* small tweaks

* fix forward pass

* enable always dropout on speech decoder prenet

* sort declaration

* rename models

* fixup

* fix copies

* more fixup

* make consistency checker happy

* add Seq2SeqSpectrogramOutput class

* doc comments

* quick note about loss and labels

* add HiFi-GAN implementation (from Speech2Speech PR)

* rename file

* add vocoder to TTS model

* improve vocoder

* working on tokenizer

* more better tokenizer

* add CTC tokenizer

* fix decode and batch_code in CTC tokenizer

* fix processor

* two processors and feature extractors

* use SpeechT5WaveformFeatureExtractor instead of Wav2Vec2

* cleanup

* more cleanup

* even more fixup

* notebooks

* fix log-mel spectrograms

* support reduction factor

* fixup

* shift spectrograms to right to create decoder inputs

* return correct labels

* add labels for stop token prediction

* fix doc comments

* fixup

* remove SpeechT5ForPreTraining

* more fixup

* update copyright headers

* add usage examples

* add SpeechT5ProcessorForCTC

* fixup

* push unofficial checkpoints to hub

* initial version of tokenizer unit tests

* add slow test

* fix failing tests

* tests for CTC tokenizer

* finish CTC tokenizer tests

* processor tests

* initial test for feature extractors

* tests for spectrogram feature extractor

* fixup

* more fixup

* add decorators

* require speech for tests

* modeling tests

* more tests for ASR model

* fix imports

* add fake tests for the other models

* fixup

* remove jupyter notebooks

* add missing SpeechT5Model tests

* add missing tests for SpeechT5ForCTC

* add missing tests for SpeechT5ForTextToSpeech

* sort tests by name

* fix Hi-Fi GAN tests

* fixup

* add speech-to-speech model

* refactor duplicate speech generation code

* add processor for SpeechToSpeech model

* add usage example

* add tests for speech-to-speech model

* fixup

* enable gradient checkpointing for SpeechT5FeatureEncoder

* code review

* push_to_hub now takes repo_id

* improve doc comments for HiFi-GAN config

* add missing test

* add integration tests

* make number of layers in speech decoder prenet configurable

* rename variable

* rename variables

* add auto classes for TTS and S2S

* REMOVE CTC!!!

* S2S processor does not support save/load_pretrained

* fixup

* these models are now in an auto mapping

* fix doc links

* rename HiFiGAN to HifiGan, remove separate config file

* REMOVE auto classes

* there can be only one

* fixup

* replace assert

* reformat

* feature extractor can process input and target at same time

* update checkpoint names

* fix commit hash

e4bacf66

Fix device issue in a `ConvBertModelTest` test (#21438) · 197e7ce9
Yih-Dar authored Feb 03, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
197e7ce9
🚨🚨 Generate: standardize beam search behavior across frameworks (#21368) · f21af262
Joao Gante authored Feb 03, 2023

f21af262

02 Feb, 2023 4 commits

Fix some pipeline tests (#21401) · a6d8a149
Yih-Dar authored Feb 02, 2023
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a6d8a149

[`bnb`] Fine-tuning HF 8-bit models (#21290) · 8298e4ec

Younes Belkada authored Feb 02, 2023



* force `memory_efficient_backward=True`

* enhancements

- trainer support
- add new flag

* some changes

- internal changes in `Trainer`
- small refactor

* make quality

* Fixes

- add new testing util
- add new test
- change test in Trainer

* fix CI test

* educate users on how to ft 8bit models

* more checks

* fix `logger` error

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* adapt from review

* fix

* add comment

* use return instead

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8298e4ec

Fix Graphormer test suite (#21419) · 67a3920d

Clémentine Fourrier authored Feb 02, 2023

* [FIX] path for Graphormer checkpoint

* [FIX] Test suite for graphormer

* [FIX] Update graphormer default num_classes

67a3920d

Add the GeLU activation from pytorch with the tanh approximation (#21345) · e006ab51
Joel Lamy-Poirier authored Feb 02, 2023
```
* gelu_python_tanh

* rename

* Version check, add test

* Pr comment
```
e006ab51

01 Feb, 2023 3 commits

Generate: decoder-only models can generate with `inputs_embeds` (#21405) · 92ce53aa
Joao Gante authored Feb 01, 2023

92ce53aa

Fix the issue of using only inputs_embeds in convbert model (#21398) · 77db257e

raghavanone authored Feb 01, 2023

* Fix the input embeds issue with tests

* Fix black and isort issue

* Clean up tests

* Add slow tag to the test introduced

* Incorporate PR feedbacks

77db257e

Add variant to transformers (#21332) · 90cddfa8

Patrick von Platen authored Feb 01, 2023

* Bump onnx in /examples/research_projects/decision_transformer

Bumps [onnx](https://github.com/onnx/onnx) from 1.11.0 to 1.13.0.
- [Release notes](https://github.com/onnx/onnx/releases)
- [Changelog](https://github.com/onnx/onnx/blob/main/docs/Changelog.md)
- [Commits](https://github.com/onnx/onnx/compare/v1.11.0...v1.13.0

)

---
updated-dependencies:
- dependency-name: onnx
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>

* adapt

* finish

* Update examples/research_projects/decision_transformer/requirements.txt

* up

* add tests

* Apply suggestions from code review
Co-authored-by: Lucain <lucainp@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* fix test

---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Lucain <lucainp@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

90cddfa8

31 Jan, 2023 4 commits

Update `Graphormer` and fix its `torchscript` test failures (#21380) · bc44e947
Yih-Dar authored Jan 31, 2023
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
bc44e947
Generate: fix TF XLA tests on models with `max_position_embeddings` or... · 19d67bfe
Joao Gante authored Jan 31, 2023
```
Generate: fix TF XLA tests on models with `max_position_embeddings` or `max_target_positions` (#21389)
```
19d67bfe
Template for framework-agnostic tests (#21348) · 623346ab
Joao Gante authored Jan 31, 2023

623346ab

Add DETA (#20983) · 5451f889

NielsRogge authored Jan 31, 2023

* First draft

* Add initial draft of conversion script

* Convert all weights

* Fix config

* Add image processor

* Fix DetaImageProcessor

* Run make fix copies

* Remove timm dependency

* Fix dummy objects

* Improve loss function

* Remove conv_encoder attribute

* Update conversion scripts

* Improve postprocessing + docs

* Fix copied from statements

* Add tests

* Improve postprocessing

* Improve postprocessing

* Update READMEs

* More improvements

* Fix rebase

* Add is_torchvision_available

* Add torchvision dependency

* Fix typo and README

* Fix bug

* Add copied from

* Fix style

* Apply suggestions

* Fix thanks to @ydshieh

* Fix another dependency check

* Simplify image processor

* Add scipy

* Improve code

* Add threshold argument

* Fix bug

* Set default threshold

* Improve integration test

* Add another integration test

* Update setup.py

* Address review

* Improve deformable attention function

* Improve copied from

* Use relative imports

* Address review

* Replace assertions

* Address review

* Update dummies

* Remove dummies

* Address comments, update READMEs

* Remove custom kernel code

* Add image processor tests

* Add requires_backends

* Add minor comment

* Update scripts

* Update organization name

* Fix defaults, add doc tests

* Add id2label for object 365

* Fix tests

* Update task guide

5451f889

30 Jan, 2023 4 commits
- Fixes path for Graphormer checkpoint (#21367) · 14d989a9
  Clémentine Fourrier authored Jan 30, 2023
```
[FIX] path for Graphormer checkpoint
```
  14d989a9
- Generate: Relaxed `max_length` and `max_new_tokens` coexistence (#21347) · 42b60f8b
  Joao Gante authored Jan 30, 2023
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  42b60f8b
- Pipeline testing - using tiny models on Hub (#20426) · c749bd40
  Yih-Dar authored Jan 30, 2023
```
* rework pipeline tests

* run pipeline tests

* fix

* fix

* fix

* revert the changes in get_test_pipeline() parameter list

* fix expected error message

* skip a test

* clean up

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  c749bd40
- Fix `GitModelIntegrationTest.test_batched_generation` device issue (#21362) · a582cfce
  Yih-Dar authored Jan 30, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a582cfce
27 Jan, 2023 1 commit

[Whisper] another patch (#21324) · 0dff407d

Arthur authored Jan 27, 2023

* another patch

* fix timestamp test modeling

* let it be negative when the token is None

0dff407d

26 Jan, 2023 3 commits
- Fix `TFEncoderDecoder` tests (#21301) · 449df41f
  Yih-Dar authored Jan 26, 2023
```
remove max_length=None
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  449df41f
- Use `model_class.__name__` and compare against `XXX_MAPPING_NAMES` (#21304) · 4e41b87e
  Yih-Dar authored Jan 26, 2023
```
* update

* update all

* clean up

* make quality

* clean up
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  4e41b87e
- Accept batched tensor of images as input to image processor (#21144) · d18a1cba
  amyeroberts authored Jan 26, 2023
```
* Accept a batched tensor of images as input

* Add to all image processors

* Update oneformer
```
  d18a1cba
25 Jan, 2023 7 commits

[WHISPER] Small patch (#21307) · 6f3faf38

Arthur authored Jan 25, 2023

* add small patch

* update tests, forced decoder ids is not prioritary against generation config

* fix two new tests

6f3faf38

Add BridgeTower model (#20775) · 3a6e4a22

Anahita Bhiwandiwalla authored Jan 25, 2023



* Commit with BTModel and latest HF code

* Placeholder classes for BTForMLM and BTForITR

* Importing Bert classes from transformers

* Removed objectives.py and dist_utils.py

* Removed swin_transformer.py

* Add image normalization, BridgeTowerForImageAndTextRetrieval

* Add center_crop

* Removing bert tokenizer and LCI references

* Tested config loading from HF transformers hub

* Removed state_dict updates and added path to hub

* Enable center crop

* Getting image_size from config, renaming num_heads and num_layers

* Handling max_length in BridgeTowerProcessor

* Add BridgeTowerForMaskedLM

* Add doc string for BridgeTowerConfig

* Add doc strings for BT config, processor, image processor

* Adding docs, removed swin

* Removed convert_bridgetower_original_to_pytorch.py

* Added doc files for bridgetower, removed is_vision

* Add support attention_mask=None and BridgeTowerModelOutput

* Fix formatting

* Fixes with 'make style', 'make quality', 'make fixup'

* Remove downstream tasks from BridgeTowerModel

* Formatting fixes, add return_dict to BT models

* Clean up after doc_test

* Update BTModelOutput return type, fix todo in doc

* Remove loss_names from init

* implement tests and update tuples returned by models

* Add image reference to bridgetower.mdx

* after make fix-copies, make fixup, make style, make quality, make repo-consistency

* Rename class names with BridgeTower prefix

* Fix for image_size in BTImageProcessor

* implement feature extraction bridgetower tests

* Update image_mean and image_std to be list

* remove unused import

* Removed old comments

* Rework CLIP

* update config in tests followed config update

* Formatting fixes

* Add copied from for BridgeTowerPredictionHeadTransform

* Update bridgetower.mdx

* Update test_feature_extraction_bridgetower.py

* Update bridgetower.mdx

* BridgeTowerForMaskedLM is conditioned on image too

* Add BridgeTowerForMaskedLM

* Fixes

* Call post_init to init weights

* Move freeze layers into method

* Remove BTFeatureExtractor, add BT under multimodal models

* Remove BTFeatureExtractor, add BT under multimodal models

* Code review feedback - cleanup

* Rename variables

* Formatting and style to PR review feedback

* Move center crop after resize

* Use named parameters

* Style fix for modeling_bridgetower.py

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/bridgetower/modeling_bridgetower.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/bridgetower/modeling_bridgetower.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/models/bridgetower/modeling_bridgetower.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Rename config params, copy BERT classes, clean comments

* Cleanup irtr

* Replace Roberta imports, add BTTextConfig and Model

* Update docs, add visionconfig, consistent arg names

* make fixup

* Comments for forward in BTModel and make fixup

* correct tests

* Remove inconsistent roberta copied from

* Add BridgeTowerTextModel to dummy_pt_objects.py

* Add BridgeTowerTextModel to IGNORE_NON_TESTED

* Update docs for BT Text and Vision Configs

* Treat BridgeTowerTextModel as a private model

* BridgeTowerTextModel as private

* Run make fix-copies

* Adding BTTextModel to PRIVATE_MODELS

* Fix for issue with BT Text and Image configs

* make style changes

* Update README_ja.md

Add から to BridgeTower's description

* Clean up config, .mdx and arg names

* Fix init_weights. Remove nn.Sequential

* Formatting and style fixes

* Re-add tie_word_embeddings in config

* update test implementation

* update style

* remove commented out

* fix style

* Update README with abs for BridgeTower

* fix style

* fix mdx file

* Update bridgetower.mdx

* Update img src in bridgetower.mdx

* Update README.md

* Update README.md

* resolve style failed

* Update _toctree.yml

* Update README_ja.md

* Removed mlp_ratio, rename feats, rename BTCLIPModel

* Replace BTCLIP with BTVisionModel,pass in vision_config to BTVisionModel

* Add test_initialization support

* Add support for output_hidden_states

* Update support for output_hidden_states

* Add support for output_attentions

* Add docstring for output_hidden_states

* update tests

* add bridgetowervisionmodel as private model

* rerun the PR test

* Remove model_type, pass configs to classes, renames

* Change self.device to use weight device

* Remove image_size

* Style check fixes

* Add hidden_size and num_hidden_layers to BridgeTowerTransformer

* Update device setting

* cosmetic update

* trigger test again

* trigger tests again

* Update test_modeling_bridgetower.py

trigger tests again

* Update test_modeling_bridgetower.py

* minor update

* re-trigger tests

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Remove pad, update max_text_len, doc cleanup, pass eps to LayerNorm

* Added copied to, some more review feedback

* make fixup

* Use BridgeTowerVisionEmbeddings

* Code cleanup

* Fixes for BridgeTowerVisionEmbeddings

* style checks

* re-tests

* fix embedding

* address comment on init file

* retrigger tests

* update import prepare_image_inputs

* update test_image_processing_bridgetower.py to reflect test_image_processing_common.py

* retrigger tests
Co-authored-by: Shaoyen Tseng <shao-yen.tseng@intel.com>
Co-authored-by: Tiep Le <tiep.le@intel.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Tiep Le <97980157+tileintel@users.noreply.github.com>

3a6e4a22

Update `OneFormerModelIntegrationTest` expected values (#21295) · cc714d74

Yih-Dar authored Jan 25, 2023



* update values

* update values

* update values

* Update tests/models/oneformer/test_modeling_oneformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

cc714d74

Moving to cleaner tokenizer version or `oneformer`. (#21292) · 8788fd0c
Nicolas Patry authored Jan 25, 2023
```
Moving to cleaner tokenizer version.
```
8788fd0c

[Whisper] Refactor whisper (#21252) · 255257f3

Arthur authored Jan 25, 2023

* update whisper logit processor

* add generate for whisper

* remove part of the whisper specific code from pipeline

* update logit processes

* major update

* enforce first timestamp

* update generate

* add more tests

* update new decoding strategy

* Apply suggestions from code review

* update docstring

* fixup

* default config will not have multilingual ar

* update expected tokenizer size, see pull on the hub for whisper-tiny

255257f3

Supporting `ImageProcessor` in place of `FeatureExtractor` for pipelines (#20851) · 99e79054

Nicolas Patry authored Jan 25, 2023



* Fixing the pipeline with image processor.

* Update the slow test.

* Using only the first image processor.

* Include exclusion mecanism for Image processor.

* Do not handle Gitconfig, deemed as a bug.

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Remove `conversational` changes. They are not supposed to be here.

* Address first row of comments.

* Remove OneFormer modifications.
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

99e79054

[GIT] Add test for batched generation (#21282) · efdbad56

NielsRogge authored Jan 25, 2023



* Add test

* Apply suggestions
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

efdbad56

24 Jan, 2023 6 commits

[W2V2 with LM] Fix decoder test with params (#21277) · 14d058b9
Sanchit Gandhi authored Jan 24, 2023

14d058b9

[GenerationConfig] add additional kwargs handling (#21269) · 94a7edd9

Arthur authored Jan 24, 2023



* add additional kwargs handling

* fix issue when serializing

* correct order of kwargs removal for serialization in from dict

* add `dict_torch_dtype_to_str` in case a dtype is needed for generation

* add condition when adding the kwargs : not from config

* Add comment based on review
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* add test function

* default None when poping arg
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

94a7edd9

[examples/deepspeed] fix renamed api (#21283) · 9286039c
Stas Bekman authored Jan 24, 2023

9286039c

[`t5`] Fix T5 inference in `float16` + `bnb` error (#21281) · e2e393c6

Younes Belkada authored Jan 24, 2023

* attempts to fix:

- upcast input for `T5DenseActDense`
- add the condition `self.wo.weight.dtype != torch.int8`
- added tests on `test/mixed_int8`
- `make fixup`

* fix ci test

e2e393c6

Fix MaskFormerImageProcessor.post_process_instance_segmentation (#21256) · f424b094
Alara Dirik authored Jan 24, 2023
```
* fix instance segmentation post processing

* add Mask2FormerImageProcessor
```
f424b094
Skip `test_multi_gpu_data_parallel_forward` for `UperNetModelTest` (#21216) · bde7378b
Yih-Dar authored Jan 24, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
bde7378b

23 Jan, 2023 2 commits

Add class properties with warnings (#21195) · c18b4fbe

amyeroberts authored Jan 23, 2023

* Replace reduce_labels with do_reduce_labels

* Replace only for __init__ and preprocess

* Add class properties with warnings

* Update tests

c18b4fbe

[ci-daily] Fix pipeline tests (#21257) · b80b2218

Arthur authored Jan 23, 2023

* use streaming dataset

* fix whisper's test

* add rescale argument to chunk_iter

b80b2218