Commits · 232c898f9fd1eec956e7d3f9fbc78999992c5b4a · chenpangpang / transformers

29 Jun, 2023 12 commits

MS Kim(tony9402) authored Jun 30, 2023

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

232c898f

Check all objects are equally in the main `__init__` file (#24573) · c817bc44
Yih-Dar authored Jun 29, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c817bc44

Fix ESM models buffers (#24576) · 8c4471d1

Sylvain Gugger authored Jun 29, 2023

* Fix ESM models buffers

* Remove modifs

* Tied weights keys are needed silly

* quality

8c4471d1

Removal of deprecated vision methods and specify deprecation versions (#24570) · b324557a
amyeroberts authored Jun 29, 2023
```
* Removal of deprecated methods and specify versions

* Fix tests
```
b324557a
Update some torchscript tests after #24505 (#24566) · 77db28dc
Yih-Dar authored Jun 29, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
77db28dc

Add Musicgen (#24109) · 1c1c9075

Sanchit Gandhi authored Jun 29, 2023



* Add Audiocraft

* add cross attention

* style

* add for lm

* convert and verify

* introduce t5

* split configs

* load t5 + lm

* clean conversion

* copy from t5

* style

* start pattern provider

* make generation work

* style

* fix pos embs

* propagate shape changes

* propagate shape changes

* style

* delay pattern: pad tokens at end

* audiocraft -> musicgen

* fix inits

* add mdx

* style

* fix pad token in processor

* override generate and add todos

* add init to test

* undo pattern delay mask after gen

* remove cfg logits processor

* remove cfg logits processor

* remove logits processor in favour of mask

* clean pos embs

* make fix copies

* update readmes

* clean pos emb

* refactor encoder/decoder

* make fix copies

* update conversion

* fix config imports

* update config docs

* make style

* send pattern mask to device

* pattern mask with delay

* recover prompted audio tokens

* fix docstrings

* laydown test file

* pattern edge case

* remove t5 ref

* add processing class

* config refactor

* better pattern comment

* check if mask is not present

* check if mask is not present

* refactor to auto class

* remove encoder configs

* fix processor

* processor import

* start updating conversion

* start updating tests

* make style

* convert t5, encodec, lm

* convert as composite

* also convert processor

* run generate

* classifier free gen

* comments and clean up

* make style

* docs for logit proc

* docstring for uncond gen

* start lm tests

* work tests

* let the lm generate

* refactor: reshape inside forward

* undo greedy loop changes

* from_enc_dec -> from_sub_model

* fix input id shapes in docstrings

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* undo generate changes

* from sub model config

* Update src/transformers/models/musicgen/modeling_musicgen.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* make generate work again

* generate uncond -> get uncond inputs

* remove prefix allowed tokens fn

* better error message

* logit proc checks

* Apply suggestions from code review
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* make decoder only tests work

* composite fast tests

* make style

* uncond generation

* feat extr padding

* make audio prompt work

* fix inputs docstrings

* unconditional inputs: dict -> model output

* clean up tests

* more clean up tests

* make style

* t5 encoder -> auto text encoder

* remove comments

* deal with frames

* fix auto text

* slow tests

* nice mdx

* remove can generate

* todo - hub id

* convert m/l

* make fix copies

* only import generation with torch

* ignore decoder from tests

* don't wrap uncond inputs

* make style

* cleaner uncond inputs

* add example to musicgen forward

* fix docs

* ignore MusicGen Model/ForConditionalGeneration in auto mapping

* add doc section to toctree

* add to doc tests

* add processor tests

* fix push to hub in conversion

* tips for decoder only loading

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix conversion for s / m / l checkpoints

* import stopping criteria from module

* remove from pipeline tests

* fix uncond docstring

* decode audio method

* fix docs

* org: sanchit-gandhi -> facebook

* fix max pos embeddings

* remove auto doc (not compatible with shapes)

* bump max pos emb

* make style

* fix doc

* fix config doc

* fix config doc

* ignore musicgen config from docstring

* make style

* fix config

* fix config for doctest

* consistent from_sub_models

* don't automap decoder

* fix mdx save audio file

* fix mdx save audio file

* processor batch decode for audio

* remove keys to ignore

* update doc md

* update generation config

* allow changes for default generation config

* update tests

* make style

* fix docstring for uncond

* fix processor test

* fix processor test

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1c1c9075

Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments" (#24574) · 2dc5e1a1
Sylvain Gugger authored Jun 29, 2023
```
Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (#24549)"

This reverts commit c5e29d43.
```
2dc5e1a1
Docs: 4 bit doc corrections (#24572) · 4f1b31c2
Joao Gante authored Jun 29, 2023
```
4 bit doc corrections
```
4f1b31c2
Fix annotations (#24571) · 1fd52e6e
MS Kim(tony9402) authored Jun 29, 2023
```
* fix annotations

* fix copies
```
1fd52e6e
Fix Typo (#24559) · 63cc30e7
MS Kim(tony9402) authored Jun 29, 2023

63cc30e7

Update old existing feature extractor references (#24552) · ae454f41

amyeroberts authored Jun 29, 2023

* Update old existing feature extractor references

* Typo

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* Address comments from review - update 'feature extractor'
Co-authored by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

ae454f41

Fixed OwlViTModel inplace operations (#24529) · 10c2ac7b
Pasquale De Marinis authored Jun 29, 2023
```
* fixed OwlViTModel inplace operations

* fixed operands order in owlvit
```
10c2ac7b

28 Jun, 2023 14 commits
- Update masked_language_modeling.md (#24560) · 66954ea2
  condor-cp authored Jun 28, 2023
```
See https://github.com/huggingface/transformers/issues/24546
```
  66954ea2
- Make PT/Flax tests could be run on GPU (#24557) · fd673510
  Yih-Dar authored Jun 28, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  fd673510
- Update PT/Flax weight conversion after #24030 (#24556) · faae8d82
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  faae8d82
- [`InstructBlip`] Add instruct blip int8 test (#24555) · 33b5ef5c
  Younes Belkada authored Jun 28, 2023
```
* add 8bit instructblip test

* update tests
```
  33b5ef5c
- Fix processor __init__ bug if image processor undefined (#24554) · c70c88a2
  amyeroberts authored Jun 28, 2023
```
Make sure feature_extractor is defined in all cases
```
  c70c88a2
- [`gpt2-int8`] Add gpt2-xl int8 test (#24543) · 903b97d8
  Younes Belkada authored Jun 28, 2023
```
add gpt2-xl test
```
  903b97d8
- Update `EncodecIntegrationTest` (#24553) · b0651655
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  b0651655
- Update PT/TF weight conversion after #24030 (#24547) · 6c57ce15
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  6c57ce15
- Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (#24549) · c5e29d43
  Max Ryabinin authored Jun 28, 2023
```
* Fix typing annotations for FSDP and DeepSpeed in TrainingArguments

* Change dict to Dict
```
  c5e29d43
- Allow for warn_only selection in enable_full_determinism (#24496) · daccde14
  Frank995 authored Jun 28, 2023
```
* Warn only in enable full determinism

* Add option in the function definition
```
  daccde14
- Unpin DeepSpeed and require DS >= 0.9.3 (#24541) · 11cb6e0f
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  11cb6e0f
- ⚠️ Time to say goodbye to py37 (#24091) · e84bf1f7
  Yih-Dar authored Jun 28, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e84bf1f7
- Add bitsandbytes support for gpt2 models (#24504) · 12240925
  Dario Sučić authored Jun 28, 2023
```
* Add bitsandbytes support for gpt2 models

* Guard Conv1D import to pass tensorflow test

* Appease ruff linter

* Fix 4bit test and remove int8 test boilerplate

* Update tests/bnb/test_mixed_int8.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
```
  12240925
- Finishing tidying keys to ignore on load (#24535) · 89b6ee49
  Sylvain Gugger authored Jun 27, 2023
  
  89b6ee49
27 Jun, 2023 14 commits

Fix Typo (#24530) · 04f46a22
MS Kim(tony9402) authored Jun 28, 2023
```
* Fix Typo

* Fix all copies
```
04f46a22
Allow backbones not in backbones_supported - Maskformer Mask2Former (#24532) · 462f77cb
amyeroberts authored Jun 27, 2023
```
Allow backbones not in backbones_supported
```
462f77cb

Clean load keys (#24505) · 8e5d1619

Sylvain Gugger authored Jun 27, 2023

* Preliminary work on some models

* Fix test load missing and make sure nonpersistent buffers are tested

* Always ignore nonpersistent buffers if in state_dict

* Treat models

* More models

* Treat remaining models

* Fix quality

* Fix tests

* Remove draft

* This test is not needed anymore

* Fix copies

* Fix last test

* Newly added models

* Fix last tests

* Address review comments

8e5d1619

[Mask2Former] Remove SwinConfig (#24259) · 53194991
NielsRogge authored Jun 27, 2023
```
Remove SwinConfig
```
53194991
Fix LR scheduler based on bs from auto bs finder (#24521) · fb6a6276
Zach Mueller authored Jun 27, 2023
```
* One solution

* args -> self
```
fb6a6276
Find module name in an OS-agnostic fashion (#24526) · 38db04ec
Sylvain Gugger authored Jun 27, 2023
```
* Find module name in an OS-agnostic fashion

* address review comment
```
38db04ec
Update `huggingface_hub` commit sha (#24527) · 7d150d68
Yih-Dar authored Jun 27, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
7d150d68
set model to training mode before accelerate.prepare (#24520) · 4e8929dc
Wang, Yi authored Jun 27, 2023

4e8929dc

[`T5`] Add T5ForQuestionAnswering and MT5ForQuestionAnswering (#24481) · 06910f5a

Sebastian authored Jun 27, 2023



* Adding T5ForQuestionAnswering

* Changed weight initialization that results in better initial loss when fine-tuning

* Update to class variables

* Running make fixup

* Running make fix-copies

* Remove model_parallel

* Adding MT5ForQuestionAnswering

* Adding docs

* Fix wrong doc

* Update src/transformers/models/mt5/modeling_mt5.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/models/t5/modeling_t5.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* File formatting

* Undoing change

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

06910f5a

Update hyperparameter_search.py (#24515) · bcf02ec7
Sourab Mangrulkar authored Jun 27, 2023
```
* Update hyperparameter_search.py

* resolve comments
```
bcf02ec7

use accelerate autocast in jit eval path, since mix precision logic is… (#24460) · 6fe8d198

Wang, Yi authored Jun 27, 2023



use accelerate autocast in jit eval path, since mix precision logic is in accelerator currently
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

6fe8d198

🌐

[i18n-KO] Translated `tflite.mdx` to Korean (#24435) · 0863436b

Hyeonseo Yun authored Jun 27, 2023



* docs: ko: tflite.mdx

* feat: nmt and manual edit `tflite.mdx`

* revised: resolve suggestions tflite.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* revised: resolve suggestions and new line tflite.mdx
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

0863436b

Fix poor past ci (#24485) · 4abd3ee4

Yih-Dar authored Jun 27, 2023



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4abd3ee4

Fix TypeError: Object of type int64 is not JSON serializable (#24340) · 239ace15

Xiaoli Wang authored Jun 27, 2023

* Fix TypeError: Object of type int64 is not JSON serializable

* Convert numpy.float64 and numpy.int64 to float and int for json serialization

* Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py

* * make style

239ace15