Commits · bbe98ea9c3da007def37e54f04a41a3546b1ac47 · chenpangpang / transformers

07 Feb, 2023 7 commits
- :pen: fix typo in pytorch semantic segmentation readme (#21492) · bbe98ea9
  Jeroen Van Der Donckt authored Feb 07, 2023
  
  bbe98ea9
- changed "ot" to "to" (#21488) · 8581fbaa
  Iulian Taiatu authored Feb 07, 2023
  
  8581fbaa
- [`Doc`] Fix int8 docs (#21487) · fa0ae179
  Younes Belkada authored Feb 07, 2023
```
fix int8 docs
```
  fa0ae179
- Generate: TF can now generate from embeddings in encoder-decoder models (#21475) · 1e4cf8bb
  Joao Gante authored Feb 07, 2023
  
  1e4cf8bb
- [CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b
  Arthur authored Feb 07, 2023
```
* fix past renamed to past_key_value

* update more `past`that were ski^êd

* fixup

* remove changes made to rag

* refactor `_reorder_cache` to use `past_key_values`

* fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache
```
  12eb528b
- Deprecate parallelize API (#21448) · 5b493762
  Sylvain Gugger authored Feb 06, 2023
```
* Deprecate parallelize API

* Add documentation

* Fix copies
```
  5b493762
- Fix epoch number when resuming training (#21478) · cc840752
  Sylvain Gugger authored Feb 06, 2023
  
  cc840752
06 Feb, 2023 14 commits

Bump oauthlib from 3.2.1 to 3.2.2 in /examples/research_projects/decision_transformer (#21481) · 35f93f29

dependabot[bot] authored Feb 06, 2023

Bump oauthlib in /examples/research_projects/decision_transformer

Bumps [oauthlib](https://github.com/oauthlib/oauthlib) from 3.2.1 to 3.2.2.
- [Release notes](https://github.com/oauthlib/oauthlib/releases)
- [Changelog](https://github.com/oauthlib/oauthlib/blob/master/CHANGELOG.rst)
- [Commits](https://github.com/oauthlib/oauthlib/compare/v3.2.1...v3.2.2

)

---
updated-dependencies:
- dependency-name: oauthlib
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

35f93f29

Update quality tooling for formatting (#21480) · 6f79d264

Sylvain Gugger authored Feb 06, 2023

* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies

6f79d264

Add tips for generation with Int8 models (#21424) · b7bb2b59

lewtun authored Feb 06, 2023



* Add tips for generation with Int8 models

* Empty commit to trigger CI

* Apply suggestions from code review
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update docs/source/en/perf_infer_gpu_one.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b7bb2b59

OPT: BLIP2-ready `prepare_inputs_for_generation` (#21477) · 10056d89
Joao Gante authored Feb 06, 2023

10056d89

[i18n-fr] Translate index page to French (#21458) · baf4bacb

Nolwenn Bernard authored Feb 06, 2023



* Translate index page to French

* Fix indent

* Fix toctree

* Replace missing file by in_translation

* Add index

* Update docs/source/fr/index.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

baf4bacb

[examples] improve block_size warning message (#21463) · 3b9a1dc1
Stas Bekman authored Feb 06, 2023

3b9a1dc1

Removing `more_itertools` dependency. (#21473) · 4435c7f5

Nicolas Patry authored Feb 06, 2023

* Removing `more_itertools` dependency.

* Update examples/research_projects/vqgan-clip/requirements.txt

4435c7f5

Generate: TF can now accept custom logits processors (#21454) · 49433310
Joao Gante authored Feb 06, 2023

49433310
make SpeechT5 doc examples deterministic (#21470) · e215e6de
Matthijs Hollemans authored Feb 06, 2023
```
* make doc examples deterministic

* add IGNORE_RESULT
```
e215e6de

Fixed RAG script which was failing on dummy example (#21416) · 182afb7d

Kaustubh Dhole authored Feb 06, 2023

* do not use prefix="val" for test

The dummy example fails when test_epoch_end is called. The prefix="test" should be dynamic in the log metrics too.

* Create test.source

* Create test.target

182afb7d

Fix `PushToHubCallback` import in Share a model docs (#21457) · 7dbee87e
Irene López authored Feb 06, 2023
```
docs: update PushToHubCallback import in docs
```
7dbee87e
Added documentation for DagsHubCallback (#21452) · 5ac1c7ea
Jinen Setpal authored Feb 06, 2023
```
updated documentation
```
5ac1c7ea

Add perf numbers for perf_train_cpu (#20974) · ae318318

jianan-gu authored Feb 06, 2023



* Update perf_train_cpu.mdx

* Update perf_train_cpu.mdx

* Update perf_train_cpu.mdx

* Update docs/source/en/perf_train_cpu.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update perf_train_cpu.mdx

* Update perf_train_cpu.mdx

* Update perf_train_cpu.mdx

* Update perf_train_cpu.mdx

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ae318318

Fix `SpeechT5ForSpeechToSpeechIntegrationTests` device issue (#21460) · 0db5d911
Yih-Dar authored Feb 06, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0db5d911

03 Feb, 2023 11 commits

Avoid flaky generation sampling tests (#21445) · 59d5edef
Yih-Dar authored Feb 03, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
59d5edef

For IterableDataset, return DataLoader using self._train_batch_size. … (#21447) · 31c351c4

agossard authored Feb 03, 2023

For IterableDataset, return DataLoader using self._train_batch_size. This is consistent with how we generate a regular DataLoader, and leads to the correct args.per_device_train_batch_size eventually ending up on each GPU.

31c351c4

Add tutorial doc for TF + TPU (#21429) · 833174c9

Matt authored Feb 03, 2023



* Add tutorial doc for TF + TPU

* Fix all those extra asterisks in the markdown

* Use the actual Tip formatting

* Remove unnecessary spaces

* Reformat checklist

* Fix checklist and reformat tips slightly

* Update docs/source/en/perf_train_tpu_tf.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/en/perf_train_tpu_tf.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/en/perf_train_tpu_tf.mdx
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/perf_train_tpu_tf.mdx
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Add link to TPU notebook in the notebooks list

* Add links to the TPU notebook in the tutorial doc

* Make the markdown table a bit less wild

* Fix notebook link

* More notebook links

* More fixes to wild tables

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

833174c9

exclude deleted files in the fixup script (#21436) · 6c62cfb2
Darren Tuit authored Feb 04, 2023
```
exclude deleted files from fixup script
```
6c62cfb2

[WIP] add SpeechT5 model (#18922) · e4bacf66

Matthijs Hollemans authored Feb 03, 2023

* make SpeechT5 model by copying Wav2Vec2

* add paper to docs

* whoops added docs in wrong file

* remove SpeechT5Tokenizer + put CTC back in the name

* remove deprecated class

* remove unused docstring

* delete SpeechT5FeatureExtractor, use Wav2Vec2FeatureExtractor instead

* remove classes we don't need right now

* initial stab at speech encoder prenet

* add more speech encoder prenet stuff

* improve SpeechEncoderPrenet

* add encoder (not finished yet)

* add relative position bias to self-attention

* add encoder CTC layers

* fix formatting

* add decoder from BART, doesn't work yet

* make it work with generate loop

* wrap the encoder into a speech encoder class

* wrap the decoder in a text decoder class

* changed my mind

* changed my mind again ;-)

* load decoder weights, make it work

* add weights for text decoder postnet

* add SpeechT5ForCTC model that uses only the encoder

* clean up EncoderLayer and DecoderLayer

* implement _init_weights in SpeechT5PreTrainedModel

* cleanup config + Encoder and Decoder

* add head + cross attention masks

* improve doc comments

* fixup

* more cleanup

* more fixup

* TextDecoderPrenet works now, thanks Kendall

* add CTC loss

* add placeholders for other pre/postnets

* add type annotation

* fix freeze_feature_encoder

* set padding tokens to 0 in decoder attention mask

* encoder attention mask downsampling

* remove features_pen calculation

* disable the padding tokens thing again

* fixup

* more fixup

* code review fixes

* rename encoder/decoder wrapper classes

* allow checkpoints to be loaded into SpeechT5Model

* put encoder into wrapper for CTC model

* clean up conversion script

* add encoder for TTS model

* add speech decoder prenet

* add speech decoder post-net

* attempt to reconstruct the generation loop

* add speech generation loop

* clean up generate_speech

* small tweaks

* fix forward pass

* enable always dropout on speech decoder prenet

* sort declaration

* rename models

* fixup

* fix copies

* more fixup

* make consistency checker happy

* add Seq2SeqSpectrogramOutput class

* doc comments

* quick note about loss and labels

* add HiFi-GAN implementation (from Speech2Speech PR)

* rename file

* add vocoder to TTS model

* improve vocoder

* working on tokenizer

* more better tokenizer

* add CTC tokenizer

* fix decode and batch_code in CTC tokenizer

* fix processor

* two processors and feature extractors

* use SpeechT5WaveformFeatureExtractor instead of Wav2Vec2

* cleanup

* more cleanup

* even more fixup

* notebooks

* fix log-mel spectrograms

* support reduction factor

* fixup

* shift spectrograms to right to create decoder inputs

* return correct labels

* add labels for stop token prediction

* fix doc comments

* fixup

* remove SpeechT5ForPreTraining

* more fixup

* update copyright headers

* add usage examples

* add SpeechT5ProcessorForCTC

* fixup

* push unofficial checkpoints to hub

* initial version of tokenizer unit tests

* add slow test

* fix failing tests

* tests for CTC tokenizer

* finish CTC tokenizer tests

* processor tests

* initial test for feature extractors

* tests for spectrogram feature extractor

* fixup

* more fixup

* add decorators

* require speech for tests

* modeling tests

* more tests for ASR model

* fix imports

* add fake tests for the other models

* fixup

* remove jupyter notebooks

* add missing SpeechT5Model tests

* add missing tests for SpeechT5ForCTC

* add missing tests for SpeechT5ForTextToSpeech

* sort tests by name

* fix Hi-Fi GAN tests

* fixup

* add speech-to-speech model

* refactor duplicate speech generation code

* add processor for SpeechToSpeech model

* add usage example

* add tests for speech-to-speech model

* fixup

* enable gradient checkpointing for SpeechT5FeatureEncoder

* code review

* push_to_hub now takes repo_id

* improve doc comments for HiFi-GAN config

* add missing test

* add integration tests

* make number of layers in speech decoder prenet configurable

* rename variable

* rename variables

* add auto classes for TTS and S2S

* REMOVE CTC!!!

* S2S processor does not support save/load_pretrained

* fixup

* these models are now in an auto mapping

* fix doc links

* rename HiFiGAN to HifiGan, remove separate config file

* REMOVE auto classes

* there can be only one

* fixup

* replace assert

* reformat

* feature extractor can process input and target at same time

* update checkpoint names

* fix commit hash

e4bacf66

do not scale gradient in bf16 mode (#21428) · fb13a7df

Kashif Rasul authored Feb 03, 2023

* no dot scale gradient in bf16 mode

* fix since args.fp16 might be none

* fixed typo

* typo

* only do if grad scaling is true

* self.amp_dtype == torch.float16 is true

* put back prop when fsdp is not none

fb13a7df

Fix device issue in a `ConvBertModelTest` test (#21438) · 197e7ce9
Yih-Dar authored Feb 03, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
197e7ce9

Added model resources for LayoutLM Issue#19848 (#21377) · 0df80282

Avi Singhal authored Feb 03, 2023



* updated resources for LayoutLM

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fixed formatting, removed extra section

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

0df80282

Remove more unused attributes in config classes (#21392) · f726d53e

Yih-Dar authored Feb 03, 2023



* * Remove unused type_vocab_size

* Remove unused initializer_factor

* Remove unused n_embd

* Remove unused scale_embedding

* Remove unused scale_attn_weights

* fix

* fix

* Remove unused head_hidden_scale

* Remove unused activation_dropout

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f726d53e

Add `inputs_embeds` support for `.generate()` with BLOOM models (#21430) · 3560ae6d
Pavel Denisov authored Feb 03, 2023
```
Add accepting `.generate()` calls with `inputs_embeds` on BLOOM models
```
3560ae6d
🚨🚨 Generate: standardize beam search behavior across frameworks (#21368) · f21af262
Joao Gante authored Feb 03, 2023

f21af262

02 Feb, 2023 8 commits

Add VQGAN-CLIP research project (#21329) · ea55bd86

Erwann Millon authored Feb 02, 2023



* Add VQGAN-CLIP research project

* fixed style issues

* Update examples/research_projects/vqgan-clip/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/research_projects/vqgan-clip/VQGAN_CLIP.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/research_projects/vqgan-clip/requirements.txt
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/research_projects/vqgan-clip/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/research_projects/vqgan-clip/VQGAN_CLIP.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/research_projects/vqgan-clip/VQGAN_CLIP.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/research_projects/vqgan-clip/VQGAN_CLIP.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/research_projects/vqgan-clip/loaders.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* replace CLIPProcessor with tokenizer, change asserts to exceptions

* rm unused import

* remove large files (jupyter notebook linked in readme, imgs migrated to hf dataset)

* add tokenizers dependency

* Remove comment
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* rm model checkpoints

---------
Co-authored-by: Erwann Millon <erwann@Erwanns-MacBook-Air.local>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

ea55bd86

Update task summary (#21067) · fbee8295

Steven Liu authored Feb 02, 2023

* first draft of audio section

* make style

* first draft of computer vision section

* add convnext and encoder tasks

* finish up nlp tasks

* minor edits

* add arch images, more edits

* fix image links

* apply sanchit feedback

* model naming convention

* apply niels vit feedback

* replace detr for segmentation with mask2former

* apply feedback

* apply feedback

fbee8295

Fixes bug in the creation of ExponentialDecayLengthPenalty (#21423) · 6a3d1a98

Jorge C. Gomes authored Feb 02, 2023

input_ids_seq_length doesn't exist in the GenerationConfig, it exists as local variable in the function.

Setting exponential_decay_length_penalty therefore results in an error:
`AttributeError: 'GenerationConfig' object has no attribute 'input_ids_seq_length'`

This simple change fixes this issue, and the exponential_decay_length_penalty works as expected.

6a3d1a98

Fix task guide formatting (#21409) · 0a757176
Steven Liu authored Feb 02, 2023
```
fix formatting
```
0a757176
Fix some pipeline tests (#21401) · a6d8a149
Yih-Dar authored Feb 02, 2023
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a6d8a149

Allow to add more information in `is_flaky` (#21426) · 145bf41c

Yih-Dar authored Feb 02, 2023



* Allow to add more information

* fix style

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

145bf41c

[`bnb`] Fine-tuning HF 8-bit models (#21290) · 8298e4ec

Younes Belkada authored Feb 02, 2023



* force `memory_efficient_backward=True`

* enhancements

- trainer support
- add new flag

* some changes

- internal changes in `Trainer`
- small refactor

* make quality

* Fixes

- add new testing util
- add new test
- change test in Trainer

* fix CI test

* educate users on how to ft 8bit models

* more checks

* fix `logger` error

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* adapt from review

* fix

* add comment

* use return instead

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8298e4ec

Fix Graphormer test suite (#21419) · 67a3920d

Clémentine Fourrier authored Feb 02, 2023

* [FIX] path for Graphormer checkpoint

* [FIX] Test suite for graphormer

* [FIX] Update graphormer default num_classes

67a3920d