Commits · a04ebc8b33314c42349c3e12885960a292c9c9dd · chenpangpang / transformers

14 Jun, 2023 7 commits

`Pix2StructImageProcessor` requires `torch>=1.11.0` (#24270) · a04ebc8b
Yih-Dar authored Jun 14, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a04ebc8b
Update check of core deps (#24277) · 8978b696
Sylvain Gugger authored Jun 14, 2023

8978b696
Adapt Wav2Vec2 conversion for MMS lang identification (#24234) · c4fec38b
Patrick von Platen authored Jun 14, 2023
```
* Add conversion for mms lid

* make style
```
c4fec38b
TF: CTRL with native embedding layers (#23456) · 4626df50
Joao Gante authored Jun 14, 2023

4626df50
Skip some `TQAPipelineTests` tests in past CI (#24267) · eac8dede
Yih-Dar authored Jun 14, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
eac8dede

QA doc: import torch before it is used (#24228) · 91b62f5a

ByronHsu authored Jun 14, 2023



* import torch before it is used

* style
Signed-off-by: byhsu <byhsu@linkedin.com>

---------
Signed-off-by: byhsu <byhsu@linkedin.com>
Co-authored-by: byhsu <byhsu@linkedin.com>

91b62f5a

Fix URL in comment for contrastive loss function (#24271) · 6ab045d6

TAE YOUNGDON authored Jun 14, 2023

* Update language_modeling.py

in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" 

so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do

* Update language_modeling.py

* Fix URL in comment for contrastive loss function

6ab045d6

13 Jun, 2023 17 commits

update FSDP save and load logic (#24249) · b89fcccd
Sourab Mangrulkar authored Jun 14, 2023
```
* update fsdp save and load logic

* fix

* see if this resolves the failing tests
```
b89fcccd

docs wrt using accelerate launcher with trainer (#24250) · e0603d89

Sourab Mangrulkar authored Jun 14, 2023



* update docs

* missing part

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address comments

* address Zach's comment

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e0603d89

Skip `GPT-J` fx tests for torch < 1.12 (#24256) · 23311314
Yih-Dar authored Jun 13, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
23311314

Stop storing references to bound methods via tf.function (#24146) · 3bd1fe43

Matt authored Jun 13, 2023

* Stop storing references to bound methods in tf.functions

* Remove the gc.collect calls now that we resolved the underlying problem

* Remove the default signature from model.serving entirely, big cleanup

* Remove _prune_signature as self.input_signature can prune itself

* Restore serving docstring

* Update int support test to check the input signature

* Make sure other tests also use model.input_signature and not serving.input_signature

* Restore _prune_signature

* Remove the doctest GC now it's no longer needed

* Correct core tests to use the pruned sig

* order lines correctly in core tests

* Add eager_serving back with a deprecation warning

3bd1fe43

Fix how we detect the TF package (#24255) · b979a206

Matt authored Jun 13, 2023

* Fix how we detect the TF package

* Add a comment as a talisman warding against future harm

* Actually put the comment in the right place

b979a206

Update urls in warnings for rich rendering (#24136) · e64d99fa

Ivan Reznikov authored Jun 13, 2023



* fixing typo in url in warnings

* fixing typo in url in warnings

* multi-line fix

* multi-line fix

* Update src/transformers/generation/utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/generation/flax_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/generation/tf_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e64d99fa

Add `torch >=1.12` requirement for `Tapas` (#24251) · cf561d7c

Yih-Dar authored Jun 13, 2023



* fix

* fix

* fix

* Update src/transformers/models/tapas/modeling_tapas.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

cf561d7c

Generate: GenerationConfig can overwrite attributes at from_pretrained time (#24238) · b1ea6b4b
Joao Gante authored Jun 13, 2023
```
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
```
b1ea6b4b
TF: standardize `test_model_common_attributes` for language models (#23457) · 7bb6933b
Joao Gante authored Jun 13, 2023

7bb6933b
[Time Series] use mean scaler when scaling is a boolean True (#24237) · 4ed07528
Kashif Rasul authored Jun 13, 2023
```
* use mean scaler when scaling is boolean True

* remove debug
```
4ed07528

Tied params cleanup (#24211) · 695928e1

Sylvain Gugger authored Jun 13, 2023

* First test

* Add info for all models

* style

* Repo consistency

* Fix last model and cleanup prints

* Repo consistency

* Use consistent function for detecting tied weights

695928e1

deprecate `use_mps_device` (#24239) · 3723329d
Sourab Mangrulkar authored Jun 13, 2023

3723329d

fix overflow when training mDeberta in fp16 (#24116) · 3e142cb0

Sebastian authored Jun 13, 2023

* Porting changes from https://github.com/microsoft/DeBERTa/ that hopefully allows for fp16 training of mdeberta

* Updates to deberta modeling from microsoft repo

* Performing some cleanup

* Undoing changes that weren't necessary

* Undoing float calls

* Minimally change the p2c block

* Fix error

* Minimally changing the c2p block

* Switch to torch sqrt

* Remove math

* Adding back the to calls to scale

* Undoing attention_scores change

* Removing commented out code

* Updating modeling_sew_d.py to satisfy utils/check_copies.py

* Missed changed

* Further reduce changes needed to get fp16 working

* Reverting changes to modeling_sew_d.py

* Make same change in TF

3e142cb0

Safely import pytest in testing_utils.py (#24241) · f91810da
amyeroberts authored Jun 13, 2023

f91810da
Improving error message when using `use_safetensors=True`. (#24232) · fdd78d91
Nicolas Patry authored Jun 13, 2023

fdd78d91
Update `(TF)SamModelIntegrationTest` (#24199) · 74b846ca
Yih-Dar authored Jun 13, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
74b846ca

fix: TextIteratorStreamer cannot work with pipeline (#23641) · d7389cd2

yuanwu2017 authored Jun 13, 2023



* fix: TextIteratorStreamer cannot work with pipeline

Deepcopying the TextIteratorStreamer object causes the exception.
Signed-off-by: yuanwu <yuan.wu@intel.com>

* Update src/transformers/pipelines/text_generation.py

Got it. I will update the patch.
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/pipelines/text_generation.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update text_generation.py

---------
Signed-off-by: yuanwu <yuan.wu@intel.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

d7389cd2

12 Jun, 2023 16 commits

Fix README copies · 70c79940
Sylvain Gugger authored Jun 12, 2023

70c79940
Add the number of `model` test failures to slack CI report (#24207) · 41a8fa4e
Yih-Dar authored Jun 12, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
41a8fa4e
Finish dataloader integration (#24201) · 4da84008
Zach Mueller authored Jun 12, 2023

4da84008
Update `WhisperForAudioClassification` doc example (#24188) · 0675600a
Yih-Dar authored Jun 12, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0675600a
Remove unnecessary aten::to overhead in llama (#24203) · e5dd7432
fxmarty authored Jun 13, 2023
```
* fix dtype init

* fix copies

* fix fixcopies mess

* edit forward as well

* copy
```
e5dd7432

Skip RWKV test in past CI (#24204) · 4fe9716a

Yih-Dar authored Jun 12, 2023



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4fe9716a

Fix steps bugs in no trainer examples (#24197) · f7d80cb3
Ethan authored Jun 12, 2023
```
Fix step bugs in no trainer + load checkpoint + grad acc
```
f7d80cb3
Fix `_load_pretrained_model` (#24200) · 08ae37c8
Marc Sun authored Jun 12, 2023
```
Fix test
```
08ae37c8
🚨🚨🚨 Replace DataLoader logic for Accelerate in Trainer, remove unneeded tests 🚨🚨🚨 (#24028) · ebd94b0f
Zach Mueller authored Jun 12, 2023
```
* Working integration

* Fix failing test

* Revert label host logic

* Bring it back!
```
ebd94b0f

🌐

[i18n-KO] Translated tasks_summary.mdx to Korean (#23977) · dc42a9d7

Kihoon Son authored Jun 13, 2023

* 🌐

 [i18n-KO] Translated tasks_summary.mdx to Korean
Co-Authored-By: Hyeonseo Yun <0525yhs@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>

* Apply suggestions from code review
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

* Update _toctree.yml

* Delete generation_strategies.mdx

* Delete tasks_explained.mdx

---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>

dc42a9d7

Generate: detect special architectures when loaded from PEFT (#24198) · 60b69f7d
Joao Gante authored Jun 12, 2023

60b69f7d

typo: fix typos in CONTRIBUTING.md and deepspeed.mdx (#24184) · 97527898

Jacob authored Jun 12, 2023



* typo: fix typos in CONTRIBUTING.md and deepspeed.mdx

* Update CONTRIBUTING.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

97527898

Update `GPTNeoXLanguageGenerationTest` (#24193) · dadc9fb4

Yih-Dar authored Jun 12, 2023



* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

dadc9fb4

Fix device issue in `OpenLlamaModelTest::test_model_parallelism` (#24195) · a9cdb059
Yih-Dar authored Jun 12, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a9cdb059
Generate: force caching on the main model, in assisted generation (#24177) · 9f81f4f6
Joao Gante authored Jun 12, 2023

9f81f4f6

[i18n]Translated "attention.mdx" to korean (#23878) · 535f92ae

Kihoon Son authored Jun 12, 2023



* [i18n]Translated "attention.mdx" to korean
Co-Authored-By: Hyeonseo Yun <0525yhs@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

* Update _toctree.yml

---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

535f92ae