Commits · ae454f41d472617f3af0a184482c576c722b7e7e · chenpangpang / transformers

29 Jun, 2023 2 commits

Update old existing feature extractor references (#24552) · ae454f41

amyeroberts authored Jun 29, 2023

* Update old existing feature extractor references

* Typo

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* Address comments from review - update 'feature extractor'
Co-authored by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

ae454f41

Fixed OwlViTModel inplace operations (#24529) · 10c2ac7b
Pasquale De Marinis authored Jun 29, 2023
```
* fixed OwlViTModel inplace operations

* fixed operands order in owlvit
```
10c2ac7b

28 Jun, 2023 14 commits
- Update masked_language_modeling.md (#24560) · 66954ea2
  condor-cp authored Jun 28, 2023
```
See https://github.com/huggingface/transformers/issues/24546
```
  66954ea2
- Make PT/Flax tests could be run on GPU (#24557) · fd673510
  Yih-Dar authored Jun 28, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  fd673510
- Update PT/Flax weight conversion after #24030 (#24556) · faae8d82
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  faae8d82
- [`InstructBlip`] Add instruct blip int8 test (#24555) · 33b5ef5c
  Younes Belkada authored Jun 28, 2023
```
* add 8bit instructblip test

* update tests
```
  33b5ef5c
- Fix processor __init__ bug if image processor undefined (#24554) · c70c88a2
  amyeroberts authored Jun 28, 2023
```
Make sure feature_extractor is defined in all cases
```
  c70c88a2
- [`gpt2-int8`] Add gpt2-xl int8 test (#24543) · 903b97d8
  Younes Belkada authored Jun 28, 2023
```
add gpt2-xl test
```
  903b97d8
- Update `EncodecIntegrationTest` (#24553) · b0651655
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  b0651655
- Update PT/TF weight conversion after #24030 (#24547) · 6c57ce15
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  6c57ce15
- Fix typing annotations for FSDP and DeepSpeed in TrainingArguments (#24549) · c5e29d43
  Max Ryabinin authored Jun 28, 2023
```
* Fix typing annotations for FSDP and DeepSpeed in TrainingArguments

* Change dict to Dict
```
  c5e29d43
- Allow for warn_only selection in enable_full_determinism (#24496) · daccde14
  Frank995 authored Jun 28, 2023
```
* Warn only in enable full determinism

* Add option in the function definition
```
  daccde14
- Unpin DeepSpeed and require DS >= 0.9.3 (#24541) · 11cb6e0f
  Yih-Dar authored Jun 28, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  11cb6e0f
- ⚠️ Time to say goodbye to py37 (#24091) · e84bf1f7
  Yih-Dar authored Jun 28, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e84bf1f7
- Add bitsandbytes support for gpt2 models (#24504) · 12240925
  Dario Sučić authored Jun 28, 2023
```
* Add bitsandbytes support for gpt2 models

* Guard Conv1D import to pass tensorflow test

* Appease ruff linter

* Fix 4bit test and remove int8 test boilerplate

* Update tests/bnb/test_mixed_int8.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
```
  12240925
- Finishing tidying keys to ignore on load (#24535) · 89b6ee49
  Sylvain Gugger authored Jun 27, 2023
  
  89b6ee49
27 Jun, 2023 17 commits

Fix Typo (#24530) · 04f46a22
MS Kim(tony9402) authored Jun 28, 2023
```
* Fix Typo

* Fix all copies
```
04f46a22
Allow backbones not in backbones_supported - Maskformer Mask2Former (#24532) · 462f77cb
amyeroberts authored Jun 27, 2023
```
Allow backbones not in backbones_supported
```
462f77cb

Clean load keys (#24505) · 8e5d1619

Sylvain Gugger authored Jun 27, 2023

* Preliminary work on some models

* Fix test load missing and make sure nonpersistent buffers are tested

* Always ignore nonpersistent buffers if in state_dict

* Treat models

* More models

* Treat remaining models

* Fix quality

* Fix tests

* Remove draft

* This test is not needed anymore

* Fix copies

* Fix last test

* Newly added models

* Fix last tests

* Address review comments

8e5d1619

[Mask2Former] Remove SwinConfig (#24259) · 53194991
NielsRogge authored Jun 27, 2023
```
Remove SwinConfig
```
53194991
Fix LR scheduler based on bs from auto bs finder (#24521) · fb6a6276
Zach Mueller authored Jun 27, 2023
```
* One solution

* args -> self
```
fb6a6276
Find module name in an OS-agnostic fashion (#24526) · 38db04ec
Sylvain Gugger authored Jun 27, 2023
```
* Find module name in an OS-agnostic fashion

* address review comment
```
38db04ec
Update `huggingface_hub` commit sha (#24527) · 7d150d68
Yih-Dar authored Jun 27, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
7d150d68
set model to training mode before accelerate.prepare (#24520) · 4e8929dc
Wang, Yi authored Jun 27, 2023

4e8929dc

[`T5`] Add T5ForQuestionAnswering and MT5ForQuestionAnswering (#24481) · 06910f5a

Sebastian authored Jun 27, 2023



* Adding T5ForQuestionAnswering

* Changed weight initialization that results in better initial loss when fine-tuning

* Update to class variables

* Running make fixup

* Running make fix-copies

* Remove model_parallel

* Adding MT5ForQuestionAnswering

* Adding docs

* Fix wrong doc

* Update src/transformers/models/mt5/modeling_mt5.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/models/t5/modeling_t5.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* File formatting

* Undoing change

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

06910f5a

Update hyperparameter_search.py (#24515) · bcf02ec7
Sourab Mangrulkar authored Jun 27, 2023
```
* Update hyperparameter_search.py

* resolve comments
```
bcf02ec7

use accelerate autocast in jit eval path, since mix precision logic is… (#24460) · 6fe8d198

Wang, Yi authored Jun 27, 2023



use accelerate autocast in jit eval path, since mix precision logic is in accelerator currently
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

6fe8d198

🌐

[i18n-KO] Translated `tflite.mdx` to Korean (#24435) · 0863436b

Hyeonseo Yun authored Jun 27, 2023



* docs: ko: tflite.mdx

* feat: nmt and manual edit `tflite.mdx`

* revised: resolve suggestions tflite.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* revised: resolve suggestions and new line tflite.mdx
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

0863436b

Fix poor past ci (#24485) · 4abd3ee4

Yih-Dar authored Jun 27, 2023



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4abd3ee4

Fix TypeError: Object of type int64 is not JSON serializable (#24340) · 239ace15

Xiaoli Wang authored Jun 27, 2023

* Fix TypeError: Object of type int64 is not JSON serializable

* Convert numpy.float64 and numpy.int64 to float and int for json serialization

* Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py

* * make style

239ace15

Generate: `min_tokens_to_keep` has to be `>= 1` (#24453) · ac19871c
Joao Gante authored Jun 27, 2023

ac19871c
Generate: `group_beam_search` requires `diversity_penalty>0.0` (#24456) · 5f3efdf7
Joao Gante authored Jun 27, 2023
```
* add exception

* update docs
```
5f3efdf7

🚨

Fix group beam search (#24407) · 43479ef9

hukuda222 authored Jun 27, 2023



* group_beam_search now works correctly

* add argument descriptions

* add a comment

* format

* make style

* change comment

* Update src/transformers/generation/beam_search.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

---------
Co-authored-by: shogo.fujita <shogo.fujita@legalontech.jp>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

43479ef9

26 Jun, 2023 7 commits
- Fix link in utils (#24501) · 68c92981
  Gema Parreño authored Jun 26, 2023
```
* fix link

* new link

---------
Co-authored-by: Gema <gema@mbp-de-gema-2.lan>
```
  68c92981
- Compute `dropout_probability` only in training mode (SpeechT5) (#24498) · 7b4e3b5b
  Yih-Dar authored Jun 26, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  7b4e3b5b
- Fix 'local_rank' AttiributeError in Trainer class (#24297) · c9fd4985
  Tomoko Uchida authored Jun 27, 2023
```
fix attribute error
```
  c9fd4985
- Compute `dropout_probability` only in training mode (#24486) · 850cf4af
  Yih-Dar authored Jun 26, 2023
```
* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  850cf4af
- [`InstructBlip`] Add accelerate support for instructblip (#24488) · 9895670e
  Younes Belkada authored Jun 26, 2023
```
* add accelerate support for instructblip

* add `_keep_in_fp32_modules`

* dynamically adapt `_no_split_modules`

* better fix

* same logic for `_keep_in_fp32_modules`
```
  9895670e
- Add support for for loops in python interpreter (#24429) · 57579238
  Sylvain Gugger authored Jun 26, 2023
```
Add support for for loops
```
  57579238
- Update token_classification.md (#24484) · c2aa5e17
  condor-cp authored Jun 26, 2023
```
Add link to pytorch CrossEntropyLoss so that one understand why '-100' is ignore by the loss function.
```
  c2aa5e17