Commits · 2629c8f36ac57e546ea45e611536351289fe4944 · chenpangpang / transformers

09 Oct, 2023 1 commit
- [DINOv2] Convert more checkpoints (#26177) · 2629c8f3
  NielsRogge authored Oct 09, 2023
```
* Convert checkpoints

* Update doc test

* Address comment
```
  2629c8f3
06 Oct, 2023 11 commits

docs(zh): review and punctuation & space fix (#26627) · 897a826d
Jabasukuriputo Wang authored Oct 06, 2023

897a826d
[docstring] Fix docstring for `AlbertConfig` (#26636) · 360ea8fc
Yih-Dar authored Oct 06, 2023
```
example fix docstring
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
360ea8fc

[`LlamaTokenizerFast`] Adds edge cases for the template processor (#26606) · 9ad815e4

Arthur authored Oct 06, 2023

* make sure eos and bos are properly handled for fast tokenizer

* fix code llama as well

* nits

* fix the conversion script as well

* fix failing test

9ad815e4

remove SharedDDP as it is deprecated (#25702) · 27597fea

statelesshz authored Oct 06, 2023



* remove SharedDDP as it was drepracated

* apply review suggestion

* make style

* Oops,forgot to remove the compute_loss context manager in Seq2SeqTrainer.

* remove the unnecessary conditional statement

* keep the logic of IPEX

* clean code

* mix precision setup & make fixup

---------
Co-authored-by: statelesshz <jihuazhong1@huawei.com>

27597fea

Fix failing `MusicgenTest .test_pipeline_text_to_audio` (#26586) · e840aa67
Yih-Dar authored Oct 06, 2023
```
* fix

* fix

* Fix

* Fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
e840aa67
fix RoPE t range issue for fp16 (#26602) · 87499420
rui-ren authored Oct 06, 2023

87499420
Update chat template docs with more tips on writing a template (#26625) · ea52ed9d
Matt authored Oct 06, 2023

ea52ed9d

Remove unnecessary unsqueeze - squeeze in rotary positional embedding (#26162) · 64845307

fxmarty authored Oct 06, 2023

* remove unnecessary unsqueeze-squeeze in llama

* correct other models

* fix

* revert gpt_neox_japanese

* fix copie

* fix test

64845307

Update tokenization_code_llama_fast.py (#26576) · 65aabafe

Tianqi Liu authored Oct 06, 2023

* Update tokenization_code_llama_fast.py

* Update test_tokenization_code_llama.py

* Update test_tokenization_code_llama.py

65aabafe

Fixed inconsistency in several fast tokenizers (#26561) · af38c837
Towdo authored Oct 06, 2023

af38c837

Remove unnecessary `view`s of `position_ids` (#26059) · 8878eb1b

Ramiro Leal-Cavazos authored Oct 06, 2023

* Remove unnecessary `view` of `position_ids` in `modeling_llama`

When `position_ids` is `None`, its value is generated using
`torch.arange`, which creates a tensor of size `(seq_length +
past_key_values_length) - past_key_values_length = seq_length`. The
tensor is then unsqueezed, resulting in a tensor of shape `(1,
seq_length)`. This means that the last `view` to a tensor of shape
`(-1, seq_length)` is a no-op.

This commit removes the unnecessary view.

* Remove no-op `view` of `position_ids` in rest of transformer models

8878eb1b

05 Oct, 2023 11 commits

Don't install `pytorch-quantization` in Doc Builder docker file (#26622) · 75a33d60
Yih-Dar authored Oct 05, 2023
```
Fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
75a33d60

[docs] Update to scripts building index.md (#26546) · 18fbeec8

Maria Khalusova authored Oct 05, 2023

* build the table in index.md with links to the model_doc

* removed list generation on index.md

* fixed missing models

* make style

18fbeec8

Fix `transformers-pytorch-gpu` docker build (#26615) · 9d206012
Yih-Dar authored Oct 05, 2023
```
Fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9d206012
Don't close ClearML task if it was created externally (#26614) · 9e78c9ac
eajechiloae authored Oct 05, 2023
```
don't close clearml task if it was created externally
```
9e78c9ac

#26566 swin2 sr allow in out channels (#26568) · 0a3b9d02

Marvin Gabler authored Oct 05, 2023



* feat: close #26566, changed model & config files to accept arbitary in and out channels

* updated docstrings

* fix: linter error

* fix: update Copy docstrings

* fix: linter update

* fix: rename num_channels_in to num_channels to prevent breaking changes

* fix: make num_channels_out None per default

* Update src/transformers/models/swin2sr/configuration_swin2sr.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix: update tests to include num_channels_out

* fix:linter

* fix: remove normalization with precomputed rgb values when #input_channels!=#output_channels

---------
Co-authored-by: marvingabler <marvingabler@outlook.de>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0a3b9d02

[`core`] fix silent bug `keep_in_fp32` modules (#26589) · e6d250e4
Younes Belkada authored Oct 05, 2023
```
* fix silent bug `keep_in_fp32` modules

* final fix

* added a common test.

* Trigger CI

* revert
```
e6d250e4

Make `ModelOutput` serializable (#26493) · 19f0b7dd

Charles Bensimon authored Oct 05, 2023

* Make `ModelOutput` serializable

Original PR from diffusers : https://github.com/huggingface/diffusers/pull/5234

* Black

19f0b7dd

Fix failing tests on `main` due to torch 2.1 (#26607) · 54e17a15
Yih-Dar authored Oct 05, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
54e17a15
[Falcon] Set `use_cache=False` before creating `presents` which relies on `use_cache` (#26328) · 2ab76c2c
Yun Dai authored Oct 05, 2023
```
* Set `presents=None` when `use_cache` is set to False for activation ckpt

* Update modeling_falcon.py

* fix black
```
2ab76c2c

[`GPTNeoX`] Faster rotary embedding for GPTNeoX (based on llama changes) (#25830) · 253f9a3f

Arthur authored Oct 05, 2023

* Faster rotary embedding for GPTNeoX

* there might be un-necessary moves from device

* fixup

* fix dtype issue

* add copied from statements

* fox copies

* oupsy

* add copied from Llama for scaled ones as well

* fixup

* fix

* fix copies

253f9a3f

[ `NougatProcessor`] Fix the default channel (#26608) · b4e66d7a
Arthur authored Oct 05, 2023
```
fix
```
b4e66d7a

04 Oct, 2023 14 commits

add zh translation for installation (#26084) · 43bfd093
Yeyang authored Oct 05, 2023
```
* translate installation to zh

* fix translation typo
```
43bfd093
[Wav2Vec2] Fix tokenizer set lang (#26349) · 2d8ee981
Sanchit Gandhi authored Oct 04, 2023
```
* fix wav2vec2 doctest

* suggestion

* fix

* final fix

* revert since we need AddedTokens
```
2d8ee981
Update mistral.md to update 404 link (#26590) · f9ab07f9
Galland authored Oct 04, 2023

f9ab07f9
skip flaky hub tests (#26594) · c037b2e3
Arthur authored Oct 04, 2023
```
skip flaky
```
c037b2e3
Fix encoder->decoder typo bug in convert_t5x_checkpoint_to_pytorch.py (#26587) · ca7912d1
Soyoung Yoon authored Oct 05, 2023
```
Fix bug in convert_t5x_checkpoint_to_pytorch.py
```
ca7912d1
Fix embarrassing typo in the doc chat template! (#26596) · 8b03615b
Matt authored Oct 04, 2023

8b03615b
Add # Copied from statements to audio feature extractors that use the floats_list function (#26581) · 9deb18ca
dg845 authored Oct 04, 2023
```
Add # Copied from statements to audio feature extractors that use the floats_list function.
```
9deb18ca
[Mistral] Update config docstring (#26593) · 0a49f909
Sanchit Gandhi authored Oct 04, 2023
```
* fix copies

* fix missing docstring

* make style

* oops
```
0a49f909

refactor: change default block_size (#26229) · 6015f91a

Phuc Van Phan authored Oct 04, 2023

* refactor: change default block_size

* fix: return tf to origin

* fix: change files to origin

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* refactor: add min block_size to files

* reformat: add min block_size for run_clm tf

6015f91a

Add add_generation_prompt argument to apply_chat_template (#26573) · 8b46c5bc

Matt authored Oct 04, 2023

* Add add_generation_prompt argument to apply_chat_template

* Add add_generation_prompt argument to apply_chat_template and update default templates

* Fix typo

* Add generation prompts section to chat templating guide

* Add generation prompts section to chat templating guide

* Minor style fix

8b46c5bc

Docstring check (#26052) · 03af4c42

Sylvain Gugger authored Oct 04, 2023



* Fix number of minimal calls to the Hub with peft integration

* Alternate design

* And this way?

* Revert

* Nits to fix

* Add util

* Print when changes are made

* Add list to ignore

* Add more rules

* Manual fixes

* deal with kwargs

* deal with enum defaults

* avoid many digits for floats

* Manual fixes

* Fix regex

* Fix regex

* Auto fix

* Style

* Apply script

* Add ignored list

* Add check that templates are filled

* Adding to CI checks

* Add back semi-fix

* Ignore more objects

* More auto-fixes

* Ignore missing objects

* Remove temp semi-fix

* Fixes

* Update src/transformers/models/pvt/configuration_pvt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update utils/check_docstrings.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/utils/quantization_config.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Deal with float defaults

* Fix small defaults

* Address review comment

* Treat

* Post-rebase cleanup

* Address review comment

* Update src/transformers/models/deprecated/mctct/configuration_mctct.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comment

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

03af4c42

feat: add trainer label to wandb run upon initialization (#26466) · 122b2657
Bharat Ramanathan authored Oct 04, 2023

122b2657
Extend Trainer to enable Ascend NPU to use the fused Adamw optimizer when training (#26194) · 4fdf47cd
statelesshz authored Oct 04, 2023

4fdf47cd

Bump pillow from 9.3.0 to 10.0.1 in /examples/research_projects/decision_transformer (#26580) · fc296f41

dependabot[bot] authored Oct 04, 2023

Bump pillow in /examples/research_projects/decision_transformer

Bumps [pillow](https://github.com/python-pillow/Pillow) from 9.3.0 to 10.0.1.
- [Release notes](https://github.com/python-pillow/Pillow/releases)
- [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst)
- [Commits](https://github.com/python-pillow/Pillow/compare/9.3.0...10.0.1

)

---
updated-dependencies:
- dependency-name: pillow
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

fc296f41

03 Oct, 2023 3 commits

docs: feat: add clip notebook resources from OSSCA community (#26505) · 2f3ea08a
김준재_T3056 authored Oct 04, 2023

2f3ea08a
[Tokenizers] Skip tests temporarily (#26574) · 5c66378c
Lysandre Debut authored Oct 03, 2023
```
* Skip tests temporarily

* style

* Add additional test
```
5c66378c

🌐

[i18n-KO] Translated `semantic_segmentation.md` to Korean (#26515) · 2c7b26f5

Jungnerd authored Oct 04, 2023



* docs: ko: sementic_segmentation.md

* feat: manual draft

* fix: manual edits

* fix: resolve suggestions
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* fix: resolve suggestions
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix: edit the title

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

2c7b26f5