Commits · 66ded238cd04e29ba98485984dd647e7d37d1603 · chenpangpang / transformers · GitLab

01 Jul, 2023 1 commit
- fix pydantic install command · 66ded238
  ydshieh authored Jul 01, 2023
  
  66ded238
30 Jun, 2023 8 commits

Limit Pydantic to V1 in dependencies (#24596) · d51aa48a

Serge Matveenko authored Jul 01, 2023



* Limit Pydantic to V1 in dependencies

Pydantic is about to release V2 release which will break a lot of things. This change prevents `transformers` to be used with Pydantic V2 to avoid breaking things.

* more

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d51aa48a

Use protobuf 4 (#24599) · 299aafe5

Yih-Dar authored Jun 30, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

299aafe5

[several models] improve readability (#24585) · 49e812d1
Stas Bekman authored Jun 30, 2023
```
* [modeling_clip.py] improve readability

* apply to other models

* fix
```
49e812d1

Speed up TF tests by reducing hidden layer counts (#24595) · 134caef3

Matt authored Jun 30, 2023

* hidden layers, huh, what are they good for (absolutely nothing)

* Some tests break with 1 hidden layer, use 2

* Use 1 hidden layer in a few slow models

* Use num_hidden_layers=2 everywhere

* Slightly higher tol for groupvit

* Slightly higher tol for groupvit

134caef3

Make (TF) CI faster (test only a subset of model classes) (#24592) · 3441ad7d
Yih-Dar authored Jun 30, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
3441ad7d

Show a warning for missing attention masks when pad_token_id is not None (#24510) · 78a2b19f

JB (Don) authored Jun 30, 2023



* Adding warning messages to BERT for missing attention masks

These warning messages when there are pad tokens within the input ids and
no attention masks are given. The warning message should only show up once.

* Adding warning messages to BERT for missing attention masks

These warning messages are shown when the pad_token_id is not None
and no attention masks are given. The warning message should only
show up once.

* Ran fix copies to copy over the changes to some of the other models

* Add logger.warning_once.cache_clear() to the test

* Shows warning when there are no attention masks and input_ids start/end with pad tokens

* Using warning_once() instead and fix indexing in input_ids check

---------
Co-authored-by: JB Lau <hckyn@voyager2.local>

78a2b19f

Udate link to RunHouse hardware setup documentation. (#24590) · fd8dcd09
Jeroen Van Goey authored Jun 30, 2023
```
* Udate link to RunHouse hardware setup documentation.

* Fix link to hardware setup in other location as well
```
fd8dcd09

️[`T5Tokenize`] Fix T5 family tokenizers

️ (#24565) · b52a03cd

Arthur authored Jun 30, 2023



* don't add space before single letter chars that don't have a merge

* fix the fix

* fixup

* add a test

* more testing

* fixup

* hack to make sure fast is also fixed

* update switch transformers test

* revert convert slow

* Update src/transformers/models/t5/tokenization_t5.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add typechecking

* quality

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b52a03cd

29 Jun, 2023 13 commits

fix peft ckpts not being pushed to hub (#24578) · 9e287502
Sourab Mangrulkar authored Jun 30, 2023
```
* fix push to hub for peft ckpts

* oops
```
9e287502

Fix annotations (#24582) · 232c898f

MS Kim(tony9402) authored Jun 30, 2023

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

* fix annotations

232c898f

Check all objects are equally in the main `__init__` file (#24573) · c817bc44
Yih-Dar authored Jun 29, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c817bc44

Fix ESM models buffers (#24576) · 8c4471d1

Sylvain Gugger authored Jun 29, 2023

* Fix ESM models buffers

* Remove modifs

* Tied weights keys are needed silly

* quality

8c4471d1

Removal of deprecated vision methods and specify deprecation versions (#24570) · b324557a
amyeroberts authored Jun 29, 2023
```
* Removal of deprecated methods and specify versions

* Fix tests
```
b324557a
Update some torchscript tests after #24505 (#24566) · 77db28dc
Yih-Dar authored Jun 29, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
77db28dc

Add Musicgen (#24109) · 1c1c9075

Sanchit Gandhi authored Jun 29, 2023



* Add Audiocraft

* add cross attention

* style

* add for lm

* convert and verify

* introduce t5

* split configs

* load t5 + lm

* clean conversion

* copy from t5

* style

* start pattern provider

* make generation work

* style

* fix pos embs

* propagate shape changes

* propagate shape changes

* style

* delay pattern: pad tokens at end

* audiocraft -> musicgen

* fix inits

* add mdx

* style

* fix pad token in processor

* override generate and add todos

* add init to test

* undo pattern delay mask after gen

* remove cfg logits processor

* remove cfg logits processor

* remove logits processor in favour of mask

* clean pos embs

* make fix copies

* update readmes

* clean pos emb

* refactor encoder/decoder

* make fix copies

* update conversion

* fix config imports

* update config docs

* make style

* send pattern mask to device

* pattern mask with delay

* recover prompted audio tokens

* fix docstrings

* laydown test file

* pattern edge case

* remove t5 ref

* add processing class

* config refactor

* better pattern comment

* check if mask is not present

* check if mask is not present

* refactor to auto class

* remove encoder configs

* fix processor

* processor import

* start updating conversion

* start updating tests

* make style

* convert t5, encodec, lm

* convert as composite

* also convert processor

* run generate

* classifier free gen

* comments and clean up

* make style

* docs for logit proc

* docstring for uncond gen

* start lm tests

* work tests

* let the lm generate

* refactor: reshape inside forward

* undo greedy loop changes

* from_enc_dec -> from_sub_model

* fix input id shapes in docstrings

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* undo generate changes

* from sub model config

* Update src/transformers/models/musicgen/modeling_musicgen.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* make generate work again

* generate uncond -> get uncond inputs

* remove prefix allowed tokens fn

* better error message

* logit proc checks

* Apply suggestions from code review
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* make decoder only tests work

* composite fast tests

* make style

* uncond generation

* feat extr padding

* make audio prompt work

* fix inputs docstrings

* unconditional inputs: dict -> model output

* clean up tests

* more clean up tests

* make style

* t5 encoder -> auto text encoder

* remove comments

* deal with frames

* fix auto text

* slow tests

* nice mdx

* remove can generate

* todo - hub id

* convert m/l

* make fix copies

* only import generation with torch

* ignore decoder from tests

* don't wrap uncond inputs

* make style

* cleaner uncond inputs

* add example to musicgen forward

* fix docs

* ignore MusicGen Model/ForConditionalGeneration in auto mapping

* add doc section to toctree

* add to doc tests

* add processor tests

* fix push to hub in conversion

* tips for decoder only loading

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix conversion for s / m / l checkpoints

* import stopping criteria from module

* remove from pipeline tests

* fix uncond docstring

* decode audio method

* fix docs

* org: sanchit-gandhi -> facebook

* fix max pos embeddings

* remove auto doc (not compatible with shapes)

* bump max pos emb

* make style

* fix doc

* fix config doc

* fix config doc

* ignore musicgen config from docstring

* make style

* fix config

* fix config for doctest

* consistent from_sub_models

* don't automap decoder

* fix mdx save audio file

* fix mdx save audio file

* processor batch decode for audio

* remove keys to ignore

* update doc md

* update generation config

* allow changes for default generation config

* update tests

* make style

* fix docstring for uncond

* fix processor test

* fix processor test

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1c1c9075

Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments" (#24574) · 2dc5e1a1
Sylvain Gugger authored Jun 29, 2023
```
Revert "Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (#24549)"

This reverts commit c5e29d43.
```
2dc5e1a1
Docs: 4 bit doc corrections (#24572) · 4f1b31c2
Joao Gante authored Jun 29, 2023
```
4 bit doc corrections
```
4f1b31c2
Fix annotations (#24571) · 1fd52e6e
MS Kim(tony9402) authored Jun 29, 2023
```
* fix annotations

* fix copies
```
1fd52e6e
Fix Typo (#24559) · 63cc30e7
MS Kim(tony9402) authored Jun 29, 2023

63cc30e7

Update old existing feature extractor references (#24552) · ae454f41

amyeroberts authored Jun 29, 2023

* Update old existing feature extractor references

* Typo

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* Address comments from review - update 'feature extractor'
Co-authored by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

ae454f41

Fixed OwlViTModel inplace operations (#24529) · 10c2ac7b
Pasquale De Marinis authored Jun 29, 2023
```
* fixed OwlViTModel inplace operations

* fixed operands order in owlvit
```
10c2ac7b

28 Jun, 2023 14 commits
- Update masked_language_modeling.md (#24560) · 66954ea2
  condor-cp authored Jun 28, 2023
```
See https://github.com/huggingface/transformers/issues/24546
```
  66954ea2
- Make PT/Flax tests could be run on GPU (#24557) · fd673510
  Yih-Dar authored Jun 28, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  fd673510
- Update PT/Flax weight conversion after #24030 (#24556) · faae8d82
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  faae8d82
- [`InstructBlip`] Add instruct blip int8 test (#24555) · 33b5ef5c
  Younes Belkada authored Jun 28, 2023
```
* add 8bit instructblip test

* update tests
```
  33b5ef5c
- Fix processor __init__ bug if image processor undefined (#24554) · c70c88a2
  amyeroberts authored Jun 28, 2023
```
Make sure feature_extractor is defined in all cases
```
  c70c88a2
- [`gpt2-int8`] Add gpt2-xl int8 test (#24543) · 903b97d8
  Younes Belkada authored Jun 28, 2023
```
add gpt2-xl test
```
  903b97d8
- Update `EncodecIntegrationTest` (#24553) · b0651655
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  b0651655
- Update PT/TF weight conversion after #24030 (#24547) · 6c57ce15
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  6c57ce15
- Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (#24549) · c5e29d43
  Max Ryabinin authored Jun 28, 2023
```
* Fix typing annotations for FSDP and DeepSpeed in TrainingArguments

* Change dict to Dict
```
  c5e29d43
- Allow for warn_only selection in enable_full_determinism (#24496) · daccde14
  Frank995 authored Jun 28, 2023
```
* Warn only in enable full determinism

* Add option in the function definition
```
  daccde14
- Unpin DeepSpeed and require DS >= 0.9.3 (#24541) · 11cb6e0f
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  11cb6e0f
- ⚠️ Time to say goodbye to py37 (#24091) · e84bf1f7
  Yih-Dar authored Jun 28, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e84bf1f7
- Add bitsandbytes support for gpt2 models (#24504) · 12240925
  Dario Sučić authored Jun 28, 2023
```
* Add bitsandbytes support for gpt2 models

* Guard Conv1D import to pass tensorflow test

* Appease ruff linter

* Fix 4bit test and remove int8 test boilerplate

* Update tests/bnb/test_mixed_int8.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
```
  12240925
- Finishing tidying keys to ignore on load (#24535) · 89b6ee49
  Sylvain Gugger authored Jun 27, 2023
  
  89b6ee49
27 Jun, 2023 4 commits

Fix Typo (#24530) · 04f46a22
MS Kim(tony9402) authored Jun 28, 2023
```
* Fix Typo

* Fix all copies
```
04f46a22
Allow backbones not in backbones_supported - Maskformer Mask2Former (#24532) · 462f77cb
amyeroberts authored Jun 27, 2023
```
Allow backbones not in backbones_supported
```
462f77cb

Clean load keys (#24505) · 8e5d1619

Sylvain Gugger authored Jun 27, 2023

* Preliminary work on some models

* Fix test load missing and make sure nonpersistent buffers are tested

* Always ignore nonpersistent buffers if in state_dict

* Treat models

* More models

* Treat remaining models

* Fix quality

* Fix tests

* Remove draft

* This test is not needed anymore

* Fix copies

* Fix last test

* Newly added models

* Fix last tests

* Address review comments

8e5d1619

[Mask2Former] Remove SwinConfig (#24259) · 53194991
NielsRogge authored Jun 27, 2023
```
Remove SwinConfig
```
53194991