Commits · 62d71f4083acccca1c1c9b0eea68db69d9ef759a · chenpangpang / transformers

16 Jun, 2023 2 commits

Fix functional TF Whisper and modernize tests (#24301) · 62d71f40

Matt authored Jun 16, 2023

* Revert whisper change and modify the test_compile_tf_model test

* make fixup

* Tweak test slightly

* Add functional model saving to test

* Ensure TF can infer shapes for data2vec

* Add override for efficientformer

* Mark test as slow

62d71f40

[`SwitchTransformers`] Fix return values (#24300) · ba3fb4b8
Arthur authored Jun 16, 2023
```
* clean history

* remove other changes

* fix

* fix coipes
```
ba3fb4b8

15 Jun, 2023 16 commits

Update test versions on README.md (#24307) · 0b7b4429
Sayed Qaiser Ali authored Jun 15, 2023
```
Update README.md

Updated the tested versions
```
0b7b4429
Make `can_generate` as class method (#24299) · 6134b9b4
Yih-Dar authored Jun 15, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6134b9b4

Beam search type (#24288) · e45bc143

jprivera44 authored Jun 15, 2023

* test check in

* adding in type hint fix on beam search

* fixed code quality issue

e45bc143

Update tokenizer_summary.mdx (grammar) (#24286) · 1a113fcf
Belladore authored Jun 15, 2023

1a113fcf
[Docs] Fix the paper URL for MMS model (#24302) · c3ca346b
hitchhicker authored Jun 15, 2023
```
Fix the paper URL for MMS model
```
c3ca346b

[EnCodec] Changes for 32kHz ckpt (#24296) · 4124a09f

Sanchit Gandhi authored Jun 15, 2023

* [EnCodec] Changes for 32kHz ckpt

* Update src/transformers/models/encodec/convert_encodec_checkpoint_to_pytorch.py

* Update src/transformers/models/encodec/convert_encodec_checkpoint_to_pytorch.py

4124a09f

deepspeed init during eval fix (#24298) · 01b55779

Sourab Mangrulkar authored Jun 15, 2023



* deepspeed init during eval fix

* commit suggestions
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

01b55779

Update README_zh-hans.md (#24181) · 6a081c51

Cooper authored Jun 15, 2023



* Update README_zh-hans.md

update document link

* Update README_zh-hans.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

6a081c51

[Docs] Improve docs for MMS loading of other languages (#24292) · 604a21b1

Patrick von Platen authored Jun 15, 2023



* Improve docs

* Apply suggestions from code review

* upload readme

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

604a21b1

Fix image segmentation tool bug (#23897) · e6122c3f
amyeroberts authored Jun 15, 2023
```
* Image segmentation tool bug

* Remove resizing in the tests
```
e6122c3f
[fix] bug in BatchEncoding.__getitem__ (#24293) · 6cd34d45
jiangmingyan authored Jun 15, 2023
```
Co-authored-by: luchen <luchen@luchendeMBP.lan>
```
6cd34d45
Split common test from core tests (#24284) · 372f5003
Sylvain Gugger authored Jun 15, 2023

372f5003

remove unused is_decoder parameter in DetrAttention (#24226) · a611ac9b

JayL0321 authored Jun 15, 2023

* issue#24161 remove unused is_decoder parameter in DetrAttention

* #24161 fix check_repository_consistency fail

a611ac9b

Fix LLaMa beam search when using parallelize (#24224) · 33196b45

Fei Wang authored Jun 15, 2023

* Fix LLaMa beam search when using parallelize

same issue as T5 #11717

* fix code format in modeling_llama.py

* fix format of _reorder_cache in modeling_llama.py

33196b45

Fix `check_config_attributes`: check all configuration classes (#24231) · 7504be35
Yih-Dar authored Jun 15, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
7504be35

Fix bug in slow tokenizer conversion, make it a lot faster (#24266) · 6793f0cf

Stephan Tulkens authored Jun 15, 2023



* Make conversion faster, fix None vs 0 bug

* Add second sort for consistency

* Update src/transformers/convert_slow_tokenizer.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

6793f0cf

14 Jun, 2023 11 commits

Add MMS CTC Fine-Tuning (#24281) · 1609a436

Patrick von Platen authored Jun 15, 2023

* Add mms ctc fine tuning

* make style

* More fixes that are needed

* make fix-copies

* make draft for README

* add new file

* move to new file

* make style

* make style

* add quick test

* make style

* make style

1609a436

[WIP] add EnCodec model (#23655) · 0c3fdccf

Matthijs Hollemans authored Jun 14, 2023



* boilerplate stuff

* messing around with the feature extractor

* fix feature extractor

* unit tests for feature extractor

* rename speech to audio

* quick-and-dirty import of Meta's code

* import weights (sort of)

* cleaning up

* more cleaning up

* move encoder/decoder args into config

* cleanup model

* rename EnCodec -> Encodec

* RVQ parameters in config

* add slow test

* add lstm init and test_init

* Add save & load

* finish EncodecModel

* remove decoder_input_values as they are ont used anywhere (not removed from doc yet)

* fix test feature extraction model name

* Add better slow test

* Fix tests

* some fixup and cleaning

* Improve further

* cleaning up quantizer

* fix up conversion script

* test don't pass, _encode_fram does not work

* update tests with output per encode and decode

* more cleanup

* rename _codebook

* remove old config cruft

* ratios & hop_length

* use ModuleList instead of Sequential

* clean up resnet block

* update types

* update tests

* fixup

* quick cleanup

* fix padding

* more styl,ing

* add patrick feedback

* fix copies

* fixup

* fix lstm

* fix shape issues

* fixup

* rename conv layers

* fixup

* fix decoding

* small conv refactoring

* remove norm_params

* simplify conv layers

* rename conv layers

* stuff

* Clean up

* Add padding logic

use padding mask

small conv refactoring

remove norm_params

simplify conv layers

rename conv layers

stuff

add batched test

update

Clean up

merge and update for padding

fix padding

fixup

* clean up more

* clean up more

* More clean ups

* cleanup convolutions

* typo

* fix typos

* fixup

* build PR doc?

* start refactoring docstring

* fix don't pad when no strid and chunk

* update docstring

* update docstring

* nits

* update going to lunch

* update config and model

* fix broken testse (becaue of the config changes)

* fix scale computation

* fixu[

* only return dict if speciefied or if config returns it

* remove todos

* update defaults in config

* update conversion script

* fix doctest

* more docstring + fixup

* nits on batched_tests

* more nits

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update basxed on review

* fix update

* updaet tests

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fixup

* add overlap and chunl_length_s

* cleanup feature extraction

* teste edge cases truncation and padding

* correct processor values

* update config encodec, nits

* fix tests

* fixup

* fix 24Hz test

* elle tests are green

* fix fixup

* Apply suggestions from code review

* revert readme changes

* fixup

* add example

* use facebook checkpoints

* fix typo

* no pipeline tests

* use slef.pad everywhere we can

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* update based on review

* update

* update mdx

* fix bug and tests

* fixup

* fix doctest

* remove comment

* more nits

* add more coverage for `test_truncation_and_padding`

* fixup

* add last test

* fix text

* nits

* Update tests/models/encodec/test_modeling_encodec.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* take care of the last comments

* typo

* fix test

* nits

* fixup

* Update src/transformers/models/encodec/feature_extraction_encodec.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: arthur.zucker@gmail.com <arthur.zucker@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0c3fdccf

Clean up old Accelerate checks (#24279) · 26a2ec56
Sylvain Gugger authored Jun 14, 2023
```
* Clean up old Accelerate checks

* Put back imports
```
26a2ec56

Fix Debertav2 embed_proj (#24205) · 860d11ff

Wissam Antoun authored Jun 14, 2023

* MLM prediction head output size from embed_size

Take the output size of the dense projection layer from embedding_size instead of hidden_size since there could be a projection of the input embedding into hidden_size if they are different

* project TFDebertaV2 mlm output to embedding size

embedding size can be different that hidden_size, so the final layer needs to project back to embedding size. like in ELECTRA or DeBERTaV3 style pertaining.

This should solve an error that occurs when loading models like "almanach/camemberta-base-generator".

* fix the same issue for reshaping after projection

* fix layernorm size

* add self.embedding_size to scope

* fix embed_proj scope name

* apply the same changes to TF Deberta

* add the changes to deberta

* added self.embedding_size instead of config.embedding_size

* added the same change to debertav2

* added coppied from deberta to deberta2 model

* config.embedding_size fix

* black

* fix deberta config name

860d11ff

`Pix2StructImageProcessor` requires `torch>=1.11.0` (#24270) · a04ebc8b
Yih-Dar authored Jun 14, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a04ebc8b
Update check of core deps (#24277) · 8978b696
Sylvain Gugger authored Jun 14, 2023

8978b696
Adapt Wav2Vec2 conversion for MMS lang identification (#24234) · c4fec38b
Patrick von Platen authored Jun 14, 2023
```
* Add conversion for mms lid

* make style
```
c4fec38b
TF: CTRL with native embedding layers (#23456) · 4626df50
Joao Gante authored Jun 14, 2023

4626df50
Skip some `TQAPipelineTests` tests in past CI (#24267) · eac8dede
Yih-Dar authored Jun 14, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
eac8dede

QA doc: import torch before it is used (#24228) · 91b62f5a

ByronHsu authored Jun 14, 2023



* import torch before it is used

* style
Signed-off-by: byhsu <byhsu@linkedin.com>

---------
Signed-off-by: byhsu <byhsu@linkedin.com>
Co-authored-by: byhsu <byhsu@linkedin.com>

91b62f5a

Fix URL in comment for contrastive loss function (#24271) · 6ab045d6

TAE YOUNGDON authored Jun 14, 2023

* Update language_modeling.py

in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" 

so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do

* Update language_modeling.py

* Fix URL in comment for contrastive loss function

6ab045d6

13 Jun, 2023 11 commits

update FSDP save and load logic (#24249) · b89fcccd
Sourab Mangrulkar authored Jun 14, 2023
```
* update fsdp save and load logic

* fix

* see if this resolves the failing tests
```
b89fcccd

docs wrt using accelerate launcher with trainer (#24250) · e0603d89

Sourab Mangrulkar authored Jun 14, 2023



* update docs

* missing part

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address comments

* address Zach's comment

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e0603d89

Skip `GPT-J` fx tests for torch < 1.12 (#24256) · 23311314
Yih-Dar authored Jun 13, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
23311314

Stop storing references to bound methods via tf.function (#24146) · 3bd1fe43

Matt authored Jun 13, 2023

* Stop storing references to bound methods in tf.functions

* Remove the gc.collect calls now that we resolved the underlying problem

* Remove the default signature from model.serving entirely, big cleanup

* Remove _prune_signature as self.input_signature can prune itself

* Restore serving docstring

* Update int support test to check the input signature

* Make sure other tests also use model.input_signature and not serving.input_signature

* Restore _prune_signature

* Remove the doctest GC now it's no longer needed

* Correct core tests to use the pruned sig

* order lines correctly in core tests

* Add eager_serving back with a deprecation warning

3bd1fe43

Fix how we detect the TF package (#24255) · b979a206

Matt authored Jun 13, 2023

* Fix how we detect the TF package

* Add a comment as a talisman warding against future harm

* Actually put the comment in the right place

b979a206

Update urls in warnings for rich rendering (#24136) · e64d99fa

Ivan Reznikov authored Jun 13, 2023



* fixing typo in url in warnings

* fixing typo in url in warnings

* multi-line fix

* multi-line fix

* Update src/transformers/generation/utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/generation/flax_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/generation/tf_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e64d99fa

Add `torch >=1.12` requirement for `Tapas` (#24251) · cf561d7c

Yih-Dar authored Jun 13, 2023



* fix

* fix

* fix

* Update src/transformers/models/tapas/modeling_tapas.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

cf561d7c

Generate: GenerationConfig can overwrite attributes at from_pretrained time (#24238) · b1ea6b4b
Joao Gante authored Jun 13, 2023
```
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
```
b1ea6b4b
TF: standardize `test_model_common_attributes` for language models (#23457) · 7bb6933b
Joao Gante authored Jun 13, 2023

7bb6933b
[Time Series] use mean scaler when scaling is a boolean True (#24237) · 4ed07528
Kashif Rasul authored Jun 13, 2023
```
* use mean scaler when scaling is boolean True

* remove debug
```
4ed07528

Tied params cleanup (#24211) · 695928e1

Sylvain Gugger authored Jun 13, 2023

* First test

* Add info for all models

* style

* Repo consistency

* Fix last model and cleanup prints

* Repo consistency

* Use consistent function for detecting tied weights

695928e1