Commits · edc1e734bfc01109b8c66881d950ebbda032a6d2 · chenpangpang / transformers

"tests/utils/test_utils_check_copies.py" did not exist on "c89bdfbe720bc8f41c7dc6db5473a2cb0955f224"

13 Feb, 2023 5 commits
- Fix Blip-2 CI (#21595) · edc1e734
  Yih-Dar authored Feb 13, 2023
```
* use fp16

* use fp16

* use fp16

* use fp16

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  edc1e734
- [`bnb`] Let's make the daily CI green 🍏 (#21597) · 1666c42f
  Younes Belkada authored Feb 13, 2023
```
* fix bnb slow test

* make fixup
```
  1666c42f
- Generate: Fix flaky indexing error in `test_constrained_beam_search_generate_dict_output` (#21561) · 24273268
  Joao Gante authored Feb 13, 2023
  
  24273268
- CI: skip failing TF hubert test (#21601) · 4be75e97
  Joao Gante authored Feb 13, 2023
```
skip test
```
  4be75e97
- Generate: TF supports multiple eos tokens (#21571) · eb6c59bc
  Joao Gante authored Feb 13, 2023
  
  eb6c59bc
10 Feb, 2023 9 commits

Replace input_values_processing with unpack_inputs (#21502) · cb565901
amyeroberts authored Feb 10, 2023
```
* Replace input_values_prrocessing with unpack_inputs

* Skip test failing with OOM

* Update tests
```
cb565901

[from_pretrained] extend `torch_dtype="auto"` to look up `config.torch_dtype`... · 2f550758

Stas Bekman authored Feb 10, 2023

[from_pretrained] extend `torch_dtype="auto"` to look up `config.torch_dtype` first, expand docs (#21524)

* [from_pretrained] expand on torch_dtype entry

* fold 4 into 1

* style

* support torch_dtype='config' plus tests

* style

* oops

* fold config into auto, fix bug

* fix check

* better log

* better log

* clean up

2f550758

[Tests] Improve flax test_attention_outputs (#21486) · 9e40bba6
Shubhamai authored Feb 10, 2023
```
improving flax tests
```
9e40bba6
[Variant] Make sure variant files are not incorrectly deleted (#21562) · b20147a3
Patrick von Platen authored Feb 10, 2023
```
* [Variant] Make sure variant files are not incorrectly deleted

* Apply suggestions from code review

* fix
```
b20147a3

Add X-MOD (#20939) · b0d539cc

Jannis Vamvas authored Feb 10, 2023



* Add X-MOD to Readme

* Add documentation for X-MOD

* Implement X-MOD

* Fix formatting of X-MOD docs

* Change signature of X-MOD forward methods to use lang_ids

* Minor changes

* Rebase with main and run make fix-copies

* Make suggested changes to docstrings

* Improve code readability
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Fix code style

* Conversion script: Remove asserts and type annotations

* Remove _TOKENIZER_FOR_DOC

* XMOD -> Xmod

* Update copyright note

* Fix doctests

* Fix docstring

* Add integration test for FillMaskPipeline

* Revert "Add integration test for FillMaskPipeline"

This reverts commit 4381eb3b1d0f5d85785f89caba83928e6efa6d1f.

* Add end-to-end integration test for mask fill

* make style

* Rebase with main and make fix-copies

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

b0d539cc

Remove CLI spams with Whisper FeatureExtractor (#21267) · 5b72b341

Quentin Meeus authored Feb 10, 2023

* Remove CLI spams with Whisper FeatureExtractor

Whisper feature extractor representation includes the MEL filters, a list of list that is represented as ~16,000 lines. This needlessly spams the command line. I added a `__repr__` method that replaces this list with a string "<array of shape (80, 201)>"

* Remove mel_filters from to_dict output  

Credits to @ArthurZucker

* remove unused import

* update feature extraction tests for the changes in to_dict

5b72b341

Added with torch.no_grad() to Camembert integration test (#21544) · 21a2d900
Katie Le authored Feb 10, 2023
```
add with torch.no_grad() to Camembert integration test
Co-authored-by: Bibi <Bibi@katies-mac.local>
```
21a2d900

[`pipeline`] A simple fix for half-precision & 8bit models (#21479) · f8394268

Younes Belkada authored Feb 10, 2023



* v1 fix

* adapt from suggestions

* make style

* fix tests

* add gpu tests

* update docs

* fix other tests

* Apply suggestions from code review
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* better fix

* make fixup

* better example

* revert changes

* proposal

* more elegant solution

* Update src/transformers/pipelines/automatic_speech_recognition.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f8394268

Skip failing test for now · 97d3390f
Sylvain Gugger authored Feb 09, 2023

97d3390f

09 Feb, 2023 7 commits

Added with torch.no_grad() to XLM-Roberta integration test (#21547) · 23c146c3

Katie Le authored Feb 09, 2023



* added with torch.no_grad() to the integration tests and applied make style

* added with torch.no_grad() to xlm roberta forward pass

---------
Co-authored-by: Bibi <Bibi@katies-mac.local>

23c146c3

🚨

Enforce single model initialization (#21431) · 04b2f13c

Sylvain Gugger authored Feb 09, 2023

* Enforce single model initialization

* Add OneFormer example for problem 3

* Do it the Stas way

* Actually rename the uses...

* Rewrite test

* Try to change the test this way

* Fix all init slow/fast tests

* Break connection

* Fix more tests

* Fix test for initialization

* Remove custom test

* Quality

* Fix last failing tests

* The end?

04b2f13c

Fix from_pretrained API with config and state_dict (#21542) · 2020ac4b
Sylvain Gugger authored Feb 09, 2023

2020ac4b

Add BLIP-2 (#21441) · d7f1e7c0

NielsRogge authored Feb 09, 2023



* First draft

* More improvements

* More improvements

* Improve conversion script

* Convert all weights

* Make forward pass work

* Make logits match

* More improvements

* More improvements

* More improvements

* Use get_input_embeddings

* Improve some more

* Improve model tests

* Improve model tests

* More improvements

* Fix processor

* Update files

* Update prepare_inputs_for_generation

* More improvements

* Fix copies

* More fixes

* Make fixup

* More improvements

* Add support for seq2seq language model

* More improvements

* Fix test

* More improvements

* Improve conversion script

* Remove some todo's

* Fix README's

* Improve conversion script

* Fix generation

* Fix style and remove Blip2Model

* Fix model outputs

* More improvements

* Set eos_token_id in config

* Fix quality

* Small improvements

* Add processor tests

* More improvements

* Apply suggestions

* Apply suggestions

* Add integration test

* Update image URL

* Add integration test

* Fix model_type

* Update style

* Improve docs

* Add doc tests

* Fix copies

* Remove tests which are passing

* Improve some more

* Add tests for seq2seq language models

* Minor fix

* Convert more checkpoints

* finalize CI

* Fix blip and blip2 processors

* add `accelerate` support for `blip2`

* clean up

* make style

* Update conversion script

* Update conversion script some more

* Update organization

* revert toc file

* add blip-2 to toc file

* Some more improvements

* Fix docstring

* Improve docs

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

d7f1e7c0

Tag tests as slow ⌛ (#21537) · 0d33381f
Joao Gante authored Feb 09, 2023
```
begone slow tests
```
0d33381f
Generate: TF `.generate()` can now be exported with dynamic length (#21474) · 2edf9a85
Joao Gante authored Feb 09, 2023

2edf9a85
Generate: make TF `.generate()` signature == PT `.generate()` signature (#21525) · e69f9715
Joao Gante authored Feb 09, 2023

e69f9715

08 Feb, 2023 5 commits
- Fix multiple `eos_token_id`s in model.generate(...) (#21461) · 9960506c
  Motoki Wu authored Feb 08, 2023
```
* add tests with multiple eos_token_ids

* make math.prod instead of sum

* make fixup

* fix long and also use np.prod since math.prod does not exist <python 3.8

* make fixup

* add prod util

* use prod util instead of np.prod

* make fixup

* previous .long location

* use tensor ops

* remove prod

* remove prod

* update device

* make fixup

* fix none
```
  9960506c
- [tests] add missing `report_to none` (#21505) · 8ea994d3
  Stas Bekman authored Feb 08, 2023
```
[tests] report_to none
```
  8ea994d3
- Generate: TF `compute_transition_scores` (#21341) · 1d9c26a4
  Joao Gante authored Feb 08, 2023
  
  1d9c26a4
- Exclude the madeup words from M2M100Tokenizer.vocab_size (#20976) · ca905ba2
  Guillaume Klein authored Feb 08, 2023
  
  ca905ba2
- Wrap RemBert integration test forward passes with torch.no_grad() (#21503) · cc1d0685
  Katie Le authored Feb 08, 2023
```
added with torch.no_grad() to the integration tests and applied make style
Co-authored-by: Bibi <Bibi@katies-mac.local>
```
  cc1d0685
07 Feb, 2023 7 commits

Add inverse sqrt learning rate scheduler (#21495) · a3034c70

Adrian Sager La Ganga authored Feb 07, 2023

* added inverse sqrt lr scheduler

* Updated get_scheduler in src/transformers/optimization.py

* Updated src/transformers/__init__.py

* Added inverse sqrt lr scheduler test

* Updated docs/source/en/main_classes/optimizer_schedules.mdx

* Ran style and quality scripts

* Fix get_inverse_sqrt_schedule docstring

* Comment implementation URL

a3034c70

[tokenizer] sanitize saved config (#21483) · b9af152e
Stas Bekman authored Feb 07, 2023
```
* [tokenizer] sanitize saved config

* rm config["name_or_path"] test
```
b9af152e

Cleanup quality (#21493) · 67d07487

Sylvain Gugger authored Feb 07, 2023

* Remove mentions of flake8/isort

* Clean up inits

* Deall with all other inits

* Last special rule for dummy files

67d07487

[OPT] Adds `GPT2TokenizerFast` to the list of tokenizer to use for OPT. (#20823) · 9e7f84a5

Arthur authored Feb 07, 2023

* Add ("opt", ("GPT2Tokenizer", "GPT2TokenizerFast" if is_tokenizers_available() else None)),

* skip failing test

* Add ("opt", ("GPT2Tokenizer", "GPT2TokenizerFast" if is_tokenizers_available() else None)),

* skip failing test

9e7f84a5

Generate: TF can now generate from embeddings in encoder-decoder models (#21475) · 1e4cf8bb
Joao Gante authored Feb 07, 2023

1e4cf8bb

[CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b

Arthur authored Feb 07, 2023

* fix past renamed to past_key_value

* update more `past`that were ski^êd

* fixup

* remove changes made to rag

* refactor `_reorder_cache` to use `past_key_values`

* fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache

12eb528b

Fix epoch number when resuming training (#21478) · cc840752
Sylvain Gugger authored Feb 06, 2023

cc840752

06 Feb, 2023 3 commits
- Update quality tooling for formatting (#21480) · 6f79d264
  Sylvain Gugger authored Feb 06, 2023
```
* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies
```
  6f79d264
- Generate: TF can now accept custom logits processors (#21454) · 49433310
  Joao Gante authored Feb 06, 2023
  
  49433310
- Fix `SpeechT5ForSpeechToSpeechIntegrationTests` device issue (#21460) · 0db5d911
  Yih-Dar authored Feb 06, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0db5d911
03 Feb, 2023 4 commits

Avoid flaky generation sampling tests (#21445) · 59d5edef
Yih-Dar authored Feb 03, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
59d5edef

[WIP] add SpeechT5 model (#18922) · e4bacf66

Matthijs Hollemans authored Feb 03, 2023

* make SpeechT5 model by copying Wav2Vec2

* add paper to docs

* whoops added docs in wrong file

* remove SpeechT5Tokenizer + put CTC back in the name

* remove deprecated class

* remove unused docstring

* delete SpeechT5FeatureExtractor, use Wav2Vec2FeatureExtractor instead

* remove classes we don't need right now

* initial stab at speech encoder prenet

* add more speech encoder prenet stuff

* improve SpeechEncoderPrenet

* add encoder (not finished yet)

* add relative position bias to self-attention

* add encoder CTC layers

* fix formatting

* add decoder from BART, doesn't work yet

* make it work with generate loop

* wrap the encoder into a speech encoder class

* wrap the decoder in a text decoder class

* changed my mind

* changed my mind again ;-)

* load decoder weights, make it work

* add weights for text decoder postnet

* add SpeechT5ForCTC model that uses only the encoder

* clean up EncoderLayer and DecoderLayer

* implement _init_weights in SpeechT5PreTrainedModel

* cleanup config + Encoder and Decoder

* add head + cross attention masks

* improve doc comments

* fixup

* more cleanup

* more fixup

* TextDecoderPrenet works now, thanks Kendall

* add CTC loss

* add placeholders for other pre/postnets

* add type annotation

* fix freeze_feature_encoder

* set padding tokens to 0 in decoder attention mask

* encoder attention mask downsampling

* remove features_pen calculation

* disable the padding tokens thing again

* fixup

* more fixup

* code review fixes

* rename encoder/decoder wrapper classes

* allow checkpoints to be loaded into SpeechT5Model

* put encoder into wrapper for CTC model

* clean up conversion script

* add encoder for TTS model

* add speech decoder prenet

* add speech decoder post-net

* attempt to reconstruct the generation loop

* add speech generation loop

* clean up generate_speech

* small tweaks

* fix forward pass

* enable always dropout on speech decoder prenet

* sort declaration

* rename models

* fixup

* fix copies

* more fixup

* make consistency checker happy

* add Seq2SeqSpectrogramOutput class

* doc comments

* quick note about loss and labels

* add HiFi-GAN implementation (from Speech2Speech PR)

* rename file

* add vocoder to TTS model

* improve vocoder

* working on tokenizer

* more better tokenizer

* add CTC tokenizer

* fix decode and batch_code in CTC tokenizer

* fix processor

* two processors and feature extractors

* use SpeechT5WaveformFeatureExtractor instead of Wav2Vec2

* cleanup

* more cleanup

* even more fixup

* notebooks

* fix log-mel spectrograms

* support reduction factor

* fixup

* shift spectrograms to right to create decoder inputs

* return correct labels

* add labels for stop token prediction

* fix doc comments

* fixup

* remove SpeechT5ForPreTraining

* more fixup

* update copyright headers

* add usage examples

* add SpeechT5ProcessorForCTC

* fixup

* push unofficial checkpoints to hub

* initial version of tokenizer unit tests

* add slow test

* fix failing tests

* tests for CTC tokenizer

* finish CTC tokenizer tests

* processor tests

* initial test for feature extractors

* tests for spectrogram feature extractor

* fixup

* more fixup

* add decorators

* require speech for tests

* modeling tests

* more tests for ASR model

* fix imports

* add fake tests for the other models

* fixup

* remove jupyter notebooks

* add missing SpeechT5Model tests

* add missing tests for SpeechT5ForCTC

* add missing tests for SpeechT5ForTextToSpeech

* sort tests by name

* fix Hi-Fi GAN tests

* fixup

* add speech-to-speech model

* refactor duplicate speech generation code

* add processor for SpeechToSpeech model

* add usage example

* add tests for speech-to-speech model

* fixup

* enable gradient checkpointing for SpeechT5FeatureEncoder

* code review

* push_to_hub now takes repo_id

* improve doc comments for HiFi-GAN config

* add missing test

* add integration tests

* make number of layers in speech decoder prenet configurable

* rename variable

* rename variables

* add auto classes for TTS and S2S

* REMOVE CTC!!!

* S2S processor does not support save/load_pretrained

* fixup

* these models are now in an auto mapping

* fix doc links

* rename HiFiGAN to HifiGan, remove separate config file

* REMOVE auto classes

* there can be only one

* fixup

* replace assert

* reformat

* feature extractor can process input and target at same time

* update checkpoint names

* fix commit hash

e4bacf66

Fix device issue in a `ConvBertModelTest` test (#21438) · 197e7ce9
Yih-Dar authored Feb 03, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
197e7ce9
🚨🚨 Generate: standardize beam search behavior across frameworks (#21368) · f21af262
Joao Gante authored Feb 03, 2023

f21af262