Commits · 0b5024ce725a0f6b6d8cfe740e7a2a6021257c37 · chenpangpang / transformers

20 Sep, 2023 9 commits

[`Trainer`] Refactor trainer + bnb logic (#26248) · 0b5024ce
Younes Belkada authored Sep 20, 2023
```
* refactor trainer + bnb logic

* remove logger.info

* oops
```
0b5024ce
include changes from llama (#26260) · f94c9b3d
Arthur authored Sep 20, 2023
```
* include changes from llama

* add a test
```
f94c9b3d
add bbox input validation (#26294) · 00247ea0
Jinho Park authored Sep 20, 2023

00247ea0
fix deepspeed available detection (#26252) · 24553206
fxmarty authored Sep 20, 2023

24553206
Rewrite for custom code warning messages (#26291) · f29fe745
Matt authored Sep 20, 2023
```
Quick britpicking for some warning messages!
```
f29fe745

Integrate AMD GPU in CI/CD environment (#26007) · 2d71307d

Funtowicz Morgan authored Sep 20, 2023



* Add a Dockerfile for PyTorch + ROCm based on official AMD released artifact

* Add a new artifact single-amdgpu testing on main

* Attempt to test the workflow without merging.

* Changed BERT to check if things are triggered

* Meet the dependencies graph on workflow

* Revert BERT changes

* Add check_runners_amdgpu to correctly mount and check availability

* Rename setup to setup_gpu for CUDA and add setup_amdgpu for AMD

* Fix all the needs.setup -> needs.setup_[gpu|amdgpu] dependencies

* Fix setup dependency graph to use check_runner_amdgpu

* Let's do the runner status check only on AMDGPU target

* Update the Dockerfile.amd to put ourselves in / rather than /var/lib

* Restore the whole setup for CUDA too.

* Let's redisable them

* Change BERT to trigger tests

* Restore BERT

* Add torchaudio with rocm 5.6 to AMD Dockerfile (#26050)

fix dockerfile
Co-authored-by: Felix Marty <felix@hf.co>

* Place AMD GPU ...

2d71307d

Update bros checkpoint (#26277) · 37c205eb
Jinho Park authored Sep 20, 2023
```
* fix bros integration test

* update bros checkpoint
```
37c205eb
fix name error when accelerate is not available (#26278) · 86ffd5ff
Sourab Mangrulkar authored Sep 20, 2023
```
* fix name error when accelerate is not available

* fix `is_fsdp_available`
```
86ffd5ff

FSDP tests and checkpointing fixes (#26180) · 382ba670

Sourab Mangrulkar authored Sep 20, 2023



* add fsdp tests

* Update test_fsdp.py

* Update test_fsdp.py

* fixes

* checks

* Update trainer.py

* fix

* fixes for saving/resuming checkpoints

* fixes

* add tests and delete debug statements

* fixing tests

* Update test_fsdp.py

* fix tests

* fix tests

* minor nits

* fix code style and quality

* refactor and modularize test code

* reduce the time of tests

* reduce the test time

* fix test

* reduce test time

* reduce test time

* fix failing tests

* fix

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* resolve comments

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

382ba670

19 Sep, 2023 6 commits

[FIX] resize_token_embeddings (#26102) · 8e3980a2

Sam Passaglia authored Sep 20, 2023



* fix roundup command

* add test for resize_token_embeddings

* Update tests/test_modeling_common.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* style

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

8e3980a2

DeepSpeed ZeRO-3 handling when resizing embedding layers (#26259) · ffbf989f
Sourab Mangrulkar authored Sep 20, 2023
```
* fix failing deepspeed slow tests

* fixes
```
ffbf989f

Fix `Error` not captured in PR doctesting (#26215) · 39df4eca

Yih-Dar authored Sep 19, 2023



* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

39df4eca

Add ViTMatte (#25843) · 7d6354e0

NielsRogge authored Sep 19, 2023

* First draft

* Simplify image processor

* Fix rebase

* Address comments

* Address more comments

* Address more comments

* Address more comments

* Address more comments

* Improve pad_image

* Add tests

* Update integration test

* Fix image processor tests

* Fix model tests

* Convert checkpoints

* Fix doc tests

* Remove file

* Apply suggestions

* Address comments

* Fix typing hint

* Add batch_norm_eps

* Address comments

* Fix style

7d6354e0

Fix gated repo tests (#26257) · 04191ea1
Lucain authored Sep 19, 2023
```
* Fix gated repo tests

* Apply suggestions from code review
```
04191ea1
Fix some docstring in image processors (#26235) · eb848997
Yih-Dar authored Sep 19, 2023
```
Fix doc
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
eb848997

18 Sep, 2023 18 commits

Fix the gitlab user mention in issue templates to the correct user (#26237) · e469be34
Ralf Müller-Zimmermann authored Sep 19, 2023

e469be34
[docs] Fix model reference in zero shot image classification example (#26206) · 373d0d99
Aleksandar Ivanovski authored Sep 19, 2023

373d0d99
Update add_new_pipeline.md (#26197) · 500dfb5b
Nino Risteski authored Sep 19, 2023
```
fixed a few typos
```
500dfb5b
Update README.md (#26198) · 7d4e0c23
Nino Risteski authored Sep 19, 2023
```
Fixed a few typos
```
7d4e0c23
[AutoBackbone] Add test (#26094) · de8bec6d
NielsRogge authored Sep 18, 2023
```
* Add test

* Add config_class
```
de8bec6d
Create the return value on device to avoid unnecessary copying from CPU (#26151) · 97f439ae
mksit authored Sep 18, 2023

97f439ae

🌐

[i18n-KO] Translated `whisper.md` to Korean (#26002) · 42791a57

SeongWooChoi authored Sep 19, 2023



* docs: ko-whisper.md

* fix: chatgpt draft

* feat: manual edits

* Feat: manual edits

* fix: resolve suggestions
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

---------
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

42791a57

🚨

[`Tokenizer`] attemp to fix add_token issues

🚨

(#23909) · 2da88537

Arthur authored Sep 18, 2023

* fix test for bart. Order is correct now let's skip BPEs

* ouf

* styling

* fix bert....

* slow refactoring

* current updates

* massive refactoring

* update

* NICE!

* update to see where I am at

* updates

* update

* update

* revert

* updates

* updates

* start supporting legacy_save

* styling

* big update

* revert some changes

* nits

* nniiiiiice

* small fixes

* kinda fix t5 with new behaviour

* major update

* fixup

* fix copies

* today's updates

* fix byt5

* upfate

* update

* update

* updates

* update vocab size test

* Barthez does not use not need the fairseq offset ids

* super calll must be after

* calll super

* move all super init

* move other super init

* fixup

* nits

* more fixes

* nits

* more fixes

* nits

* more fix

* remove useless files

* ouch all of them are affected
...

2da88537

[Check] Fix config docstring (#26222) · 835b0a05
Sanchit Gandhi authored Sep 18, 2023

835b0a05
[Permisson] Style fix (#26228) · e5f7e03b
Sanchit Gandhi authored Sep 18, 2023
```
fix copies
```
e5f7e03b
[Wav2Vec2-Conf / LLaMA] Style fix (#26188) · e4e55af7
Sanchit Gandhi authored Sep 18, 2023
```
* torch.nn -> nn

* fix llama

* copies
```
e4e55af7
refactor: change default block_size in block size > max position embeddings (#26069) · 8b5da9fc
Phuc Van Phan authored Sep 18, 2023
```
* refactor: change default block_size when not initialize

* reformat: add the min of block size
```
8b5da9fc
refactor decay_parameters production into its own function (#26152) · c63e2701
Shijie Wu authored Sep 18, 2023

c63e2701
[FSMT] Fix non-shared weights (#26187) · 77ed9fa1
Lysandre Debut authored Sep 18, 2023
```
* Fix non-shared weights

* Add tests

* Edit tied weights keys
```
77ed9fa1

Fix ConversationalPipeline tests (#26217) · f0a6057f

Matt authored Sep 18, 2023

Add BlenderbotSmall templates and correct handling for conversation.past_user_inputs

f0a6057f

moved `ctrl` to `Salesforce/ctrl` (#26183) · bc7ce180

Julien Chaumond authored Sep 18, 2023



* moved `ctrl` to `Salesforce/ctrl`

redirects should theoretically work, but still updating those repo references for clarity

* Fixup

* Slow doc tests

* Add modeling file

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

bc7ce180

Remove `utils/documentation_tests.txt` (#26213) · f02b915b

Yih-Dar authored Sep 18, 2023



* update

* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f02b915b

No doctest for `convert_bros_to_pytorch.py` (#26212) · d020a2b8
Yih-Dar authored Sep 18, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d020a2b8

15 Sep, 2023 7 commits

[PEFT] Allow PEFT model dict to be loaded (#25721) · 0a55d9f7

Patrick von Platen authored Sep 15, 2023



* Allow PEFT model dict to be loaded

* make style

* make style

* Apply suggestions from code review

* address comments

* fixup

* final change

* added tests

* fix test

* better logic for handling if adapter has been loaded

* Update tests/peft_integration/test_peft_integration.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0a55d9f7

[docs] IDEFICS guide and task guides restructure (#26035) · 8b134714

Maria Khalusova authored Sep 15, 2023



* initial commit for the IDEFICS task guide

* conversational example

* updated TOC

* fixed typos

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* addressed feedback

* bad_words_ids

* Apply suggestions from code review
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* rank classification note

* feedback addressed

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Victor SANH <victorsanh@gmail.com>

8b134714

Fix pad to multiple of (#25732) · eb644980

Arthur authored Sep 15, 2023

* nits

* update the test

* nits

* update

* fix bark

* fix bark tests and allow padding to multiple of without new tokens

eb644980

Update notebook.py to support multi eval datasets (#25796) · ebd21e90

Matrix authored Sep 15, 2023

* Update notebook.py

fix multi eval datasets

* Update notebook.py

* Update notebook.py

using `black` to reformat

* Update notebook.py

support Validation Loss

* Update notebook.py

reformat

* Update notebook.py

ebd21e90

[Whisper] Check length of prompt + max new tokens (#26164) · c7b4d0b4
Sanchit Gandhi authored Sep 15, 2023

c7b4d0b4

Tweaks to Chat Templates docs (#26168) · 2518e368

Matt authored Sep 15, 2023

* Put tokenizer methods in the right alphabetical order in the docs

* Quick tweak to ConversationalPipeline

* Typo fixes in the developer doc

* make fixup

2518e368

[TTA Pipeline] Test MusicGen and VITS (#26146) · d70fab8b
Sanchit Gandhi authored Sep 15, 2023

d70fab8b