Commits · bd50402b56980ff17e957342ef69bd9b0dd45a7b · chenpangpang / transformers

28 Nov, 2023 11 commits

[docs] Quantization (#27641) · bd50402b
Steven Liu authored Nov 28, 2023
```
* first draft

* benchmarks

* feedback
```
bd50402b
Docs: Fix broken cross-references, i.e. `~transformer.` -> `~transformers.` (#27740) · f2ad4b53
Tom Aarsen authored Nov 28, 2023
```
~transformer. -> ~transformers.
```
f2ad4b53
CLVP Fixes (#27547) · dfbd209c
Susnato Dhar authored Nov 28, 2023
```
* fixes

* more fixes

* style fix

* more fix

* comments
```
dfbd209c
Trigger corresponding pipeline tests if `tests/utils/tiny_model_summary.json` is modified (#27693) · 30e92ea3
Yih-Dar authored Nov 28, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
30e92ea3
Enforce pin memory disabling when using cpu only (#27745) · 0b9c9345
Quentin Gallouédec authored Nov 28, 2023
```
if use_cpu: dataloader_pin_memory = False
```
0b9c9345

Add madlad-400 MT models (#27471) · fdd86eed

Juarez Bochi authored Nov 28, 2023



* Add madlad-400 models

* Add madlad-400 to the doc table

* Update docs/source/en/model_doc/madlad-400.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Fill missing details in documentation

* Update docs/source/en/model_doc/madlad-400.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Do not doctest madlad-400

Tests are timing out.

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

fdd86eed

Log a warning in `TransfoXLTokenizer.__init__` (#27721) · 6336a7f7
Yih-Dar authored Nov 28, 2023
```
* log

* log

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6336a7f7
Update tiny model creation script (#27674) · 93170298
Yih-Dar authored Nov 28, 2023
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
93170298

Add BeitBackbone (#25952) · 1fb3c23b

NielsRogge authored Nov 28, 2023



* First draft

* Add backwards compatibility

* More improvements

* More improvements

* Improve error message

* Address comment

* Add conversion script

* Fix style

* Update code snippet

* Adddress comment

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

1fb3c23b

Fix AMD Push CI not triggered (#27732) · 7a757bb6

Yih-Dar authored Nov 28, 2023



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

7a757bb6

Fixed passing scheduler-specific kwargs via TrainingArguments lr_scheduler_kwargs (#27595) · 2ca73e5e
Charbel Abi Daher authored Nov 28, 2023
```
* Fix passing scheduler-specific kwargs through TrainingArguments `lr_scheduler_kwargs`

* Added test for lr_scheduler_kwargs
```
2ca73e5e

27 Nov, 2023 13 commits

Translate `en/model_doc` to JP (#27264) · 0864dd3b

Rockerz authored Nov 28, 2023



* Add `model_docs`

* Add

* Update Model adoc

* Update docs/source/ja/model_doc/bark.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/model_doc/beit.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/model_doc/bit.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/model_doc/blenderbot.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/model_doc/blenderbot-small.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* update reiew-1

* Update toctree.yml

* translating docs and fixes of PR #27401

* Update docs/source/ja/model_doc/bert.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ja/model_doc/bert-generation.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update the model docs

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

0864dd3b

translation main-class files to chinese (#27588) · cad1b119

jiaqiw09 authored Nov 28, 2023



* translate work

* update

* update

* update [[autodoc]]

* Update callback.md

---------
Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>

cad1b119

Update chat template warnings/guides (#27634) · 74a3cebf

Matt authored Nov 27, 2023



* Update default ChatML template

* Update docs/warnings

* Update docs/source/en/chat_templating.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Slight rework

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

74a3cebf

docs: replace torch.distributed.run by torchrun (#27528) · ce315081

Peter Pan authored Nov 28, 2023



* docs: replace torch.distributed.run by torchrun

 `transformers` now officially support pytorch >= 1.10.
 The entrypoint `torchrun`` is present from 1.10 onwards.
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

* Update src/transformers/trainer.py

with @ArthurZucker's suggestion
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ce315081

Fix owlv2 code snippet (#27698) · c832bcb8
NielsRogge authored Nov 27, 2023
```
* Fix code snippet

* Improve code snippet
```
c832bcb8
Modify group_sub_entities in TokenClassification Pipeline to support label with "-" (#27325) · 334a6d18
Yixiao Yuan authored Nov 27, 2023
```
* fix group_sub_entities bug

* add space
```
334a6d18
Update forward signature test for vision models (#27681) · 59499bbe
NielsRogge authored Nov 27, 2023
```
* Update forward signature

* Empty-Commit
```
59499bbe

fix assisted decoding assistant model inputs (#27503) · 1d7f406e

jiqing-feng authored Nov 27, 2023

* fix assisted decoding attention_cat

* fix attention_mask for assisted decoding

* fix attention_mask len

* fix attn len

* Use a more clean way to prepare assistant models inputs

* fix param meaning

* fix param name

* fix assistant model inputs

* update token type ids

* fix assistant kwargs copy

* add encoder-decoder tests of assisted decoding

* check if assistant kwargs contains updated keys

* revert test

* fix whisper tests

* fix assistant kwargs

* revert whisper test

* delete _extend funcs

1d7f406e

Fix oneformer instance segmentation RuntimeError (#27725) · 307cf3a2
yhshin11 authored Nov 27, 2023

307cf3a2

Fix mistral generate for long prompt / response (#27548) · b09912c8

Yanan Xie authored Nov 27, 2023

* Fix mistral generate for long prompt / response

* Add unit test

* fix linter

* fix linter

* fix test

* add assisted generation test for mistral and load the model in 4 bit + fa2

b09912c8

Reorder the code on the Hub to explicit that sharing on the Hub isn't a requirement (#27691) · 27b752bc
Lysandre Debut authored Nov 27, 2023
```
Reorder
```
27b752bc
fix warning (#27689) · 5c30dd40
Arthur authored Nov 27, 2023

5c30dd40

Fix Past CI (#27696) · e11e26df

Yih-Dar authored Nov 27, 2023



fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e11e26df

26 Nov, 2023 1 commit

Fix sliding_window hasattr in Mistral (#27041) · f70db283

Ilya Gusev authored Nov 26, 2023



* Fix sliding_window hasattr in Mistral

* hasattr -> getattr for sliding_window in Mistral

---------
Co-authored-by: Ilya Gusev <ilya.gusev@booking.com>

f70db283

24 Nov, 2023 10 commits

Fix `TVPModelTest` (#27695) · 35551f9a

Yih-Dar authored Nov 24, 2023



* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

35551f9a

Successfully Resolved The ZeroDivisionError Exception. (#27524) · 29c94808

Chi authored Nov 24, 2023

* Successfully resolved the ZeroDivisionError exception in the utils.notebook.y file.

* Now I update little code mentioned by Peter

* Using Black package to reformat my file

* Now I using ruff libary to reformated my file

29c94808

Reflect RoCm support in the documentation (#27636) · c13a43aa

fxmarty authored Nov 24, 2023



* reflect RoCm support in the documentation

* Update docs/source/en/main_classes/trainer.md
Co-authored-by: Lysandre Debut <hi@lysand.re>

* fix review comments

* use ROCm instead of RoCm

---------
Co-authored-by: Lysandre Debut <hi@lysand.re>

c13a43aa

[`DocString`] Support a revision in the docstring `add_code_sample_docstrings`... · a6d178e2

Arthur authored Nov 24, 2023


[`DocString`] Support a revision in the docstring `add_code_sample_docstrings` to facilitate integrations (#27645)

* initial commit

* dummy changes

* style

* Update src/transformers/utils/doc.py
Co-authored-by: Alex McKinney <44398246+vvvm23@users.noreply.github.com>

* nits

* nit use ` if re.match(r'^refs/pr/\d*', revision):`

* restrict

* nit

* test the doc vuilder

* wow

* oke the order was wrong

---------
Co-authored-by: Alex McKinney <44398246+vvvm23@users.noreply.github.com>

a6d178e2

Fix semantic error in evaluation section (#27675) · 2098d343

Anirudh Haritas Murali authored Nov 24, 2023

Change "convert predictions to logits" to "convert logits to
predictions" to fix semantic error in the evaluation section. Logits
need to be converted to predictions to evaluate the accuracy, not the
other way round

2098d343

Docs/Add conversion code to the musicgen docs (#27665) · 181f85da
yoinked authored Nov 24, 2023
```
* Update musicgen.md

please make it less hidden

* Add cleaner formatting
```
181f85da

Fix typo in warning message (#27055) · 80e9f768

liuxueyang authored Nov 24, 2023

* Fix typo in warning message

The path of `default_cache_path` is hf_cache_home/hub. There is no
directory named transformers under hf_cache_home

* Fix a typo in comment

* Update the version number

v4.22.0 is the earlist version that contains those changes in PR #18492

80e9f768

Deprecate `TransfoXL` (#27607) · 7293fdc5

Yih-Dar authored Nov 24, 2023



* fix

* fix

* trigger

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <hi@lysand.re>

* tic

* revert

* revert

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Lysandre Debut <hi@lysand.re>

7293fdc5

Skip pipeline tests for 2 models for now (#27687) · 623432dc
Yih-Dar authored Nov 24, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
623432dc

Refactoring Trainer, adds `save_only_model` arg and simplifying FSDP integration (#27652) · a761d6e9

Sourab Mangrulkar authored Nov 24, 2023



* add code changes

1. Refactor FSDP
2. Add `--save_only_model` option: When checkpointing, whether to only save the model, or also the optimizer, scheduler & rng state.
3. Bump up the minimum `accelerate` version to `0.21.0`

* quality

* fix quality?

* Revert "fix quality?"

This reverts commit 149330a6abc078827be274db84c8a2d26a76eba1.

* fix fsdp doc strings

* fix quality

* Update src/transformers/training_args.py
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* please fix the quality issue 😅



* Apply suggestions from code review
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

* address comment

* simplify conditional check as per the comment

* update documentation

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

a761d6e9

23 Nov, 2023 5 commits

Update tiny model summary file (#27388) · b8db265b

Yih-Dar authored Nov 23, 2023



* update

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b8db265b

[DPT, Dinov2] Add resources (#27655) · fe1c16e9

NielsRogge authored Nov 23, 2023



* Add resources

* Remove script

* Update docs/source/en/model_doc/dinov2.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

fe1c16e9

Update TVP arxiv link (#27672) · b406c4d2
amyeroberts authored Nov 23, 2023
```
Update arxiv link
```
b406c4d2

Extended semantic segmentation to image segmentation (#27039) · baabd387

Merve Noyan authored Nov 23, 2023



* Extended semantic segmentation

* Update image_segmentation.md

* Changed title

* Update docs/source/en/tasks/semantic_segmentation.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/en/tasks/semantic_segmentation.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/en/tasks/semantic_segmentation.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/en/tasks/semantic_segmentation.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/en/tasks/semantic_segmentation.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update semantic_segmentation.md

* Update docs/source/en/tasks/semantic_segmentation.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/tasks/semantic_segmentation.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Addressed Niels' and Maria's comments

* Added detail on panoptic segmentation

* Added redirection and renamed the file

* Update _toctree.yml

* Update _redirects.yml

* Rename image_segmentation.md to semantic_segmentation.md

---------
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

baabd387

[`FA2`] Add flash attention for opt (#26414) · 3bc50d81

Susnato Dhar authored Nov 23, 2023



* added flash attention for opt

* added to list

* fix use cache (#3)

* style fix

* fix text

* test fix2

* reverted until 689f599

* torch fx tests are working now!

* small fix

* added TODO docstring

* changes

* comments and .md file modification

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

3bc50d81