Commits · 881c0df952fbe1e166033806f9128b3c52dc507d · chenpangpang / transformers

19 Jun, 2023 1 commit
- error bug on saving distributed optim state when using data parallel (#24108) · 881c0df9
  Xiaoyang Sun authored Jun 19, 2023
```
Update checkpoint_reshaping_and_interoperability.py
```
  881c0df9
16 Jun, 2023 9 commits

Adding ddp_broadcast_buffers argument to Trainer (#24326) · ee88ae59
Teven authored Jun 16, 2023
```
adding ddp_broadcast_buffers argument
```
ee88ae59

Add test for proper TF input signatures (#24320) · 91389950

Matt authored Jun 16, 2023

* Add test for proper input signatures

* No more signature pruning

* Test the dummy inputs are valid too

* fine-tine -> fine-tune

* Fix indent in test_dataset_conversion

91389950

Fix ImageGPT doc example (#24317) · bdfd57d1

amyeroberts authored Jun 16, 2023

* Fix ImageGPT doc example

* Update src/transformers/models/imagegpt/image_processing_imagegpt.py

* Fix types

bdfd57d1

Tied weights load (#24310) · 096f2cf1

Sylvain Gugger authored Jun 16, 2023

* Use tied weight keys

* More

* Fix tied weight missing warning

* Only give info on unexpected keys with different classes

* Deal with empty archs

* Fix tests

* Refine test

096f2cf1

Fix ner average grouping with no groups (#24319) · 61ffdeba
Nicolas Patry authored Jun 16, 2023
```
Fixes #https://github.com/huggingface/transformers/issues/24314
```
61ffdeba

Big TF test cleanup (#24282) · 34037129

Matt authored Jun 16, 2023

* Fix one BLIP arg not being optional, remove misspelled arg

* Remove the lxmert test overrides and just use the base test_saved_model_creation

* saved_model_creation fixes and re-enabling tests across the board

* Remove unnecessary skip

* Stop caching sinusoidal embeddings in speech_to_text

* Fix transfo_xl compilation

* Fix transfo_xl compilation

* Fix the conditionals in xglm

* Set the save spec only when building

* Clarify comment

* Move comment correctly

* Correct embeddings generation for speech2text

* Mark RAG generation tests as @slow

* Remove redundant else:

* Add comment to clarify the save_spec line in build()

* Fix size tests for XGLM at last!

* make fixup

* Remove one band_part operation

* Mark test_keras_fit as @slow

34037129

Byebye pytorch 1.9 (#24080) · 896a58de

Yih-Dar authored Jun 16, 2023



byebye

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

896a58de

Fix functional TF Whisper and modernize tests (#24301) · 62d71f40

Matt authored Jun 16, 2023

* Revert whisper change and modify the test_compile_tf_model test

* make fixup

* Tweak test slightly

* Add functional model saving to test

* Ensure TF can infer shapes for data2vec

* Add override for efficientformer

* Mark test as slow

62d71f40

[`SwitchTransformers`] Fix return values (#24300) · ba3fb4b8
Arthur authored Jun 16, 2023
```
* clean history

* remove other changes

* fix

* fix coipes
```
ba3fb4b8

15 Jun, 2023 16 commits

Update test versions on README.md (#24307) · 0b7b4429
Sayed Qaiser Ali authored Jun 15, 2023
```
Update README.md

Updated the tested versions
```
0b7b4429
Make `can_generate` as class method (#24299) · 6134b9b4
Yih-Dar authored Jun 15, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6134b9b4

Beam search type (#24288) · e45bc143

jprivera44 authored Jun 15, 2023

* test check in

* adding in type hint fix on beam search

* fixed code quality issue

e45bc143

Update tokenizer_summary.mdx (grammar) (#24286) · 1a113fcf
Belladore authored Jun 15, 2023

1a113fcf
[Docs] Fix the paper URL for MMS model (#24302) · c3ca346b
hitchhicker authored Jun 15, 2023
```
Fix the paper URL for MMS model
```
c3ca346b

[EnCodec] Changes for 32kHz ckpt (#24296) · 4124a09f

Sanchit Gandhi authored Jun 15, 2023

* [EnCodec] Changes for 32kHz ckpt

* Update src/transformers/models/encodec/convert_encodec_checkpoint_to_pytorch.py

* Update src/transformers/models/encodec/convert_encodec_checkpoint_to_pytorch.py

4124a09f

deepspeed init during eval fix (#24298) · 01b55779

Sourab Mangrulkar authored Jun 15, 2023



* deepspeed init during eval fix

* commit suggestions
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

01b55779

Update README_zh-hans.md (#24181) · 6a081c51

Cooper authored Jun 15, 2023



* Update README_zh-hans.md

update document link

* Update README_zh-hans.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

6a081c51

[Docs] Improve docs for MMS loading of other languages (#24292) · 604a21b1

Patrick von Platen authored Jun 15, 2023



* Improve docs

* Apply suggestions from code review

* upload readme

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

604a21b1

Fix image segmentation tool bug (#23897) · e6122c3f
amyeroberts authored Jun 15, 2023
```
* Image segmentation tool bug

* Remove resizing in the tests
```
e6122c3f
[fix] bug in BatchEncoding.__getitem__ (#24293) · 6cd34d45
jiangmingyan authored Jun 15, 2023
```
Co-authored-by: luchen <luchen@luchendeMBP.lan>
```
6cd34d45
Split common test from core tests (#24284) · 372f5003
Sylvain Gugger authored Jun 15, 2023

372f5003

remove unused is_decoder parameter in DetrAttention (#24226) · a611ac9b

JayL0321 authored Jun 15, 2023

* issue#24161 remove unused is_decoder parameter in DetrAttention

* #24161 fix check_repository_consistency fail

a611ac9b

Fix LLaMa beam search when using parallelize (#24224) · 33196b45

Fei Wang authored Jun 15, 2023

* Fix LLaMa beam search when using parallelize

same issue as T5 #11717

* fix code format in modeling_llama.py

* fix format of _reorder_cache in modeling_llama.py

33196b45

Fix `check_config_attributes`: check all configuration classes (#24231) · 7504be35
Yih-Dar authored Jun 15, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
7504be35

Fix bug in slow tokenizer conversion, make it a lot faster (#24266) · 6793f0cf

Stephan Tulkens authored Jun 15, 2023



* Make conversion faster, fix None vs 0 bug

* Add second sort for consistency

* Update src/transformers/convert_slow_tokenizer.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

6793f0cf

14 Jun, 2023 11 commits

Add MMS CTC Fine-Tuning (#24281) · 1609a436

Patrick von Platen authored Jun 15, 2023

* Add mms ctc fine tuning

* make style

* More fixes that are needed

* make fix-copies

* make draft for README

* add new file

* move to new file

* make style

* make style

* add quick test

* make style

* make style

1609a436

[WIP] add EnCodec model (#23655) · 0c3fdccf

Matthijs Hollemans authored Jun 14, 2023



* boilerplate stuff

* messing around with the feature extractor

* fix feature extractor

* unit tests for feature extractor

* rename speech to audio

* quick-and-dirty import of Meta's code

* import weights (sort of)

* cleaning up

* more cleaning up

* move encoder/decoder args into config

* cleanup model

* rename EnCodec -> Encodec

* RVQ parameters in config

* add slow test

* add lstm init and test_init

* Add save & load

* finish EncodecModel

* remove decoder_input_values as they are ont used anywhere (not removed from doc yet)

* fix test feature extraction model name

* Add better slow test

* Fix tests

* some fixup and cleaning

* Improve further

* cleaning up quantizer

* fix up conversion script

* test don't pass, _encode_fram does not work

* update tests with output per encode and decode

* more cleanup

* rename _codebook

* remove old config cruft

* ratios & hop_length

* use ModuleList instead of Sequential

* clean up resnet block

* update types

* update tests

* fixup

* quick cleanup

* fix padding

* more styl,ing

* add patrick feedback

* fix copies

* fixup

* fix lstm

* fix shape issues

* fixup

* rename conv layers

* fixup

* fix decoding

* small conv refactoring

* remove norm_params

* simplify conv layers

* rename conv layers

* stuff

* Clean up

* Add padding logic

use padding mask

small conv refactoring

remove norm_params

simplify conv layers

rename conv layers

stuff

add batched test

update

Clean up

merge and update for padding

fix padding

fixup

* clean up more

* clean up more

* More clean ups

* cleanup convolutions

* typo

* fix typos

* fixup

* build PR doc?

* start refactoring docstring

* fix don't pad when no strid and chunk

* update docstring

* update docstring

* nits

* update going to lunch

* update config and model

* fix broken testse (becaue of the config changes)

* fix scale computation

* fixu[

* only return dict if speciefied or if config returns it

* remove todos

* update defaults in config

* update conversion script

* fix doctest

* more docstring + fixup

* nits on batched_tests

* more nits

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update basxed on review

* fix update

* updaet tests

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fixup

* add overlap and chunl_length_s

* cleanup feature extraction

* teste edge cases truncation and padding

* correct processor values

* update config encodec, nits

* fix tests

* fixup

* fix 24Hz test

* elle tests are green

* fix fixup

* Apply suggestions from code review

* revert readme changes

* fixup

* add example

* use facebook checkpoints

* fix typo

* no pipeline tests

* use slef.pad everywhere we can

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* update based on review

* update

* update mdx

* fix bug and tests

* fixup

* fix doctest

* remove comment

* more nits

* add more coverage for `test_truncation_and_padding`

* fixup

* add last test

* fix text

* nits

* Update tests/models/encodec/test_modeling_encodec.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* take care of the last comments

* typo

* fix test

* nits

* fixup

* Update src/transformers/models/encodec/feature_extraction_encodec.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: arthur.zucker@gmail.com <arthur.zucker@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0c3fdccf

Clean up old Accelerate checks (#24279) · 26a2ec56
Sylvain Gugger authored Jun 14, 2023
```
* Clean up old Accelerate checks

* Put back imports
```
26a2ec56

Fix Debertav2 embed_proj (#24205) · 860d11ff

Wissam Antoun authored Jun 14, 2023

* MLM prediction head output size from embed_size

Take the output size of the dense projection layer from embedding_size instead of hidden_size since there could be a projection of the input embedding into hidden_size if they are different

* project TFDebertaV2 mlm output to embedding size

embedding size can be different that hidden_size, so the final layer needs to project back to embedding size. like in ELECTRA or DeBERTaV3 style pertaining.

This should solve an error that occurs when loading models like "almanach/camemberta-base-generator".

* fix the same issue for reshaping after projection

* fix layernorm size

* add self.embedding_size to scope

* fix embed_proj scope name

* apply the same changes to TF Deberta

* add the changes to deberta

* added self.embedding_size instead of config.embedding_size

* added the same change to debertav2

* added coppied from deberta to deberta2 model

* config.embedding_size fix

* black

* fix deberta config name

860d11ff

`Pix2StructImageProcessor` requires `torch>=1.11.0` (#24270) · a04ebc8b
Yih-Dar authored Jun 14, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a04ebc8b
Update check of core deps (#24277) · 8978b696
Sylvain Gugger authored Jun 14, 2023

8978b696
Adapt Wav2Vec2 conversion for MMS lang identification (#24234) · c4fec38b
Patrick von Platen authored Jun 14, 2023
```
* Add conversion for mms lid

* make style
```
c4fec38b
TF: CTRL with native embedding layers (#23456) · 4626df50
Joao Gante authored Jun 14, 2023

4626df50
Skip some `TQAPipelineTests` tests in past CI (#24267) · eac8dede
Yih-Dar authored Jun 14, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
eac8dede

QA doc: import torch before it is used (#24228) · 91b62f5a

ByronHsu authored Jun 14, 2023



* import torch before it is used

* style
Signed-off-by: byhsu <byhsu@linkedin.com>

---------
Signed-off-by: byhsu <byhsu@linkedin.com>
Co-authored-by: byhsu <byhsu@linkedin.com>

91b62f5a

Fix URL in comment for contrastive loss function (#24271) · 6ab045d6

TAE YOUNGDON authored Jun 14, 2023

* Update language_modeling.py

in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" 

so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do

* Update language_modeling.py

* Fix URL in comment for contrastive loss function

6ab045d6

13 Jun, 2023 3 commits

update FSDP save and load logic (#24249) · b89fcccd
Sourab Mangrulkar authored Jun 14, 2023
```
* update fsdp save and load logic

* fix

* see if this resolves the failing tests
```
b89fcccd

docs wrt using accelerate launcher with trainer (#24250) · e0603d89

Sourab Mangrulkar authored Jun 14, 2023



* update docs

* missing part

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address comments

* address Zach's comment

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e0603d89

Skip `GPT-J` fx tests for torch < 1.12 (#24256) · 23311314
Yih-Dar authored Jun 13, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
23311314