Commits · 65aabafe2ff2735f6351b23983f4cec45dbb134c · chenpangpang / transformers

06 Oct, 2023 3 commits

Update tokenization_code_llama_fast.py (#26576) · 65aabafe

Tianqi Liu authored Oct 06, 2023

* Update tokenization_code_llama_fast.py

* Update test_tokenization_code_llama.py

* Update test_tokenization_code_llama.py

65aabafe

Fixed inconsistency in several fast tokenizers (#26561) · af38c837
Towdo authored Oct 06, 2023

af38c837

Remove unnecessary `view`s of `position_ids` (#26059) · 8878eb1b

Ramiro Leal-Cavazos authored Oct 06, 2023

* Remove unnecessary `view` of `position_ids` in `modeling_llama`

When `position_ids` is `None`, its value is generated using
`torch.arange`, which creates a tensor of size `(seq_length +
past_key_values_length) - past_key_values_length = seq_length`. The
tensor is then unsqueezed, resulting in a tensor of shape `(1,
seq_length)`. This means that the last `view` to a tensor of shape
`(-1, seq_length)` is a no-op.

This commit removes the unnecessary view.

* Remove no-op `view` of `position_ids` in rest of transformer models

8878eb1b

05 Oct, 2023 11 commits

Don't install `pytorch-quantization` in Doc Builder docker file (#26622) · 75a33d60
Yih-Dar authored Oct 05, 2023
```
Fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
75a33d60

[docs] Update to scripts building index.md (#26546) · 18fbeec8

Maria Khalusova authored Oct 05, 2023

* build the table in index.md with links to the model_doc

* removed list generation on index.md

* fixed missing models

* make style

18fbeec8

Fix `transformers-pytorch-gpu` docker build (#26615) · 9d206012
Yih-Dar authored Oct 05, 2023
```
Fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9d206012
Don't close ClearML task if it was created externally (#26614) · 9e78c9ac
eajechiloae authored Oct 05, 2023
```
don't close clearml task if it was created externally
```
9e78c9ac

#26566 swin2 sr allow in out channels (#26568) · 0a3b9d02

Marvin Gabler authored Oct 05, 2023



* feat: close #26566, changed model & config files to accept arbitary in and out channels

* updated docstrings

* fix: linter error

* fix: update Copy docstrings

* fix: linter update

* fix: rename num_channels_in to num_channels to prevent breaking changes

* fix: make num_channels_out None per default

* Update src/transformers/models/swin2sr/configuration_swin2sr.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix: update tests to include num_channels_out

* fix:linter

* fix: remove normalization with precomputed rgb values when #input_channels!=#output_channels

---------
Co-authored-by: marvingabler <marvingabler@outlook.de>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0a3b9d02

[`core`] fix silent bug `keep_in_fp32` modules (#26589) · e6d250e4
Younes Belkada authored Oct 05, 2023
```
* fix silent bug `keep_in_fp32` modules

* final fix

* added a common test.

* Trigger CI

* revert
```
e6d250e4

Make `ModelOutput` serializable (#26493) · 19f0b7dd

Charles Bensimon authored Oct 05, 2023

* Make `ModelOutput` serializable

Original PR from diffusers : https://github.com/huggingface/diffusers/pull/5234

* Black

19f0b7dd

Fix failing tests on `main` due to torch 2.1 (#26607) · 54e17a15
Yih-Dar authored Oct 05, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
54e17a15
[Falcon] Set `use_cache=False` before creating `presents` which relies on `use_cache` (#26328) · 2ab76c2c
Yun Dai authored Oct 05, 2023
```
* Set `presents=None` when `use_cache` is set to False for activation ckpt

* Update modeling_falcon.py

* fix black
```
2ab76c2c

[`GPTNeoX`] Faster rotary embedding for GPTNeoX (based on llama changes) (#25830) · 253f9a3f

Arthur authored Oct 05, 2023

* Faster rotary embedding for GPTNeoX

* there might be un-necessary moves from device

* fixup

* fix dtype issue

* add copied from statements

* fox copies

* oupsy

* add copied from Llama for scaled ones as well

* fixup

* fix

* fix copies

253f9a3f

[ `NougatProcessor`] Fix the default channel (#26608) · b4e66d7a
Arthur authored Oct 05, 2023
```
fix
```
b4e66d7a

04 Oct, 2023 14 commits

add zh translation for installation (#26084) · 43bfd093
Yeyang authored Oct 05, 2023
```
* translate installation to zh

* fix translation typo
```
43bfd093
[Wav2Vec2] Fix tokenizer set lang (#26349) · 2d8ee981
Sanchit Gandhi authored Oct 04, 2023
```
* fix wav2vec2 doctest

* suggestion

* fix

* final fix

* revert since we need AddedTokens
```
2d8ee981
Update mistral.md to update 404 link (#26590) · f9ab07f9
Galland authored Oct 04, 2023

f9ab07f9
skip flaky hub tests (#26594) · c037b2e3
Arthur authored Oct 04, 2023
```
skip flaky
```
c037b2e3
Fix encoder->decoder typo bug in convert_t5x_checkpoint_to_pytorch.py (#26587) · ca7912d1
Soyoung Yoon authored Oct 05, 2023
```
Fix bug in convert_t5x_checkpoint_to_pytorch.py
```
ca7912d1
Fix embarrassing typo in the doc chat template! (#26596) · 8b03615b
Matt authored Oct 04, 2023

8b03615b
Add # Copied from statements to audio feature extractors that use the floats_list function (#26581) · 9deb18ca
dg845 authored Oct 04, 2023
```
Add # Copied from statements to audio feature extractors that use the floats_list function.
```
9deb18ca
[Mistral] Update config docstring (#26593) · 0a49f909
Sanchit Gandhi authored Oct 04, 2023
```
* fix copies

* fix missing docstring

* make style

* oops
```
0a49f909

refactor: change default block_size (#26229) · 6015f91a

Phuc Van Phan authored Oct 04, 2023

* refactor: change default block_size

* fix: return tf to origin

* fix: change files to origin

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* refactor: add min block_size to files

* reformat: add min block_size for run_clm tf

6015f91a

Add add_generation_prompt argument to apply_chat_template (#26573) · 8b46c5bc

Matt authored Oct 04, 2023

* Add add_generation_prompt argument to apply_chat_template

* Add add_generation_prompt argument to apply_chat_template and update default templates

* Fix typo

* Add generation prompts section to chat templating guide

* Add generation prompts section to chat templating guide

* Minor style fix

8b46c5bc

Docstring check (#26052) · 03af4c42

Sylvain Gugger authored Oct 04, 2023



* Fix number of minimal calls to the Hub with peft integration

* Alternate design

* And this way?

* Revert

* Nits to fix

* Add util

* Print when changes are made

* Add list to ignore

* Add more rules

* Manual fixes

* deal with kwargs

* deal with enum defaults

* avoid many digits for floats

* Manual fixes

* Fix regex

* Fix regex

* Auto fix

* Style

* Apply script

* Add ignored list

* Add check that templates are filled

* Adding to CI checks

* Add back semi-fix

* Ignore more objects

* More auto-fixes

* Ignore missing objects

* Remove temp semi-fix

* Fixes

* Update src/transformers/models/pvt/configuration_pvt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update utils/check_docstrings.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/utils/quantization_config.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Deal with float defaults

* Fix small defaults

* Address review comment

* Treat

* Post-rebase cleanup

* Address review comment

* Update src/transformers/models/deprecated/mctct/configuration_mctct.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comment

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

03af4c42

feat: add trainer label to wandb run upon initialization (#26466) · 122b2657
Bharat Ramanathan authored Oct 04, 2023

122b2657
Extend Trainer to enable Ascend NPU to use the fused Adamw optimizer when training (#26194) · 4fdf47cd
statelesshz authored Oct 04, 2023

4fdf47cd

Bump pillow from 9.3.0 to 10.0.1 in /examples/research_projects/decision_transformer (#26580) · fc296f41

dependabot[bot] authored Oct 04, 2023

Bump pillow in /examples/research_projects/decision_transformer

Bumps [pillow](https://github.com/python-pillow/Pillow) from 9.3.0 to 10.0.1.
- [Release notes](https://github.com/python-pillow/Pillow/releases)
- [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst)
- [Commits](https://github.com/python-pillow/Pillow/compare/9.3.0...10.0.1

)

---
updated-dependencies:
- dependency-name: pillow
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

fc296f41

03 Oct, 2023 12 commits

docs: feat: add clip notebook resources from OSSCA community (#26505) · 2f3ea08a
김준재_T3056 authored Oct 04, 2023

2f3ea08a
[Tokenizers] Skip tests temporarily (#26574) · 5c66378c
Lysandre Debut authored Oct 03, 2023
```
* Skip tests temporarily

* style

* Add additional test
```
5c66378c

🌐

[i18n-KO] Translated `semantic_segmentation.md` to Korean (#26515) · 2c7b26f5

Jungnerd authored Oct 04, 2023



* docs: ko: sementic_segmentation.md

* feat: manual draft

* fix: manual edits

* fix: resolve suggestions
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* fix: resolve suggestions
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix: edit the title

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

2c7b26f5

[Whisper] Allow basic text normalization (#26149) · 57f44dc4
Sanchit Gandhi authored Oct 03, 2023
```
* [Whisper] Allow basic text normalization

* up

* style copies
```
57f44dc4
v4.35.0.dev0 · bd620591
Lysandre authored Oct 03, 2023

bd620591

[`Nougat`] from transformers import * (#26562) · c26b2a29

Arthur authored Oct 03, 2023



* remove unprotected import to PIL

* cleanup

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

c26b2a29

[`PEFT`] Final fixes (#26559) · 2aef9a96

Younes Belkada authored Oct 03, 2023

* fix issues with PEFT

* logger warning futurewarning issues

* fixup

* adapt from suggestions

* oops

* rm test

2aef9a96

[`Mistral`] Add Flash Attention-2 support for `mistral` (#26464) · ae9a344c

Younes Belkada authored Oct 03, 2023



* add FA-2 support for mistral

* fixup

* add sliding windows

* fixing few nits

* v1 slicing cache - logits do not match

* add comment

* fix bugs

* more mem efficient

* add warning once

* add warning once

* oops

* fixup

* more comments

* copy

* add safety checker

* fixup

* Update src/transformers/models/mistral/modeling_mistral.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* copied from

* up

* raise when padding side is right

* fixup

* add doc + few minor changes

* fixup

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ae9a344c

Nit-added-tokens (#26538) · 1a2e966c

Arthur authored Oct 03, 2023

* fix stripping

* nits

* fix another test

* styling

* fix?

* update

* revert bad merge

* found the bug

* YES SIR

* is that change really required?

* make fast even faster

* re order functions

1a2e966c

[Doctest] Add `configuration_encoder_decoder.py` (#26519) · 245da7ed

Srijan Sahay Srivastava authored Oct 03, 2023

* [Doctest] Add configuration_encoder_decoder.py

Added configuration_encoder_decoder.py to utils/documentation_tests.txt for doctest

* Revert "[Doctest] Add configuration_encoder_decoder.py"

This reverts commit bd653535a4356dc3c9f43e65883819079a2053b0.

* [Doctest] Add configuration_encoder_decoder.py

add configuration_encoder_decoder.py to utils/documentation_tests.txt

* [Doctest] Add configuration_encoder_decoder.py

add configuration_encoder_decoder.py to utils/documentation_tests.txt

* [Doctest] Add configuration_encoder_decoder.py

add configuration_encoder_decoder.py to utils/documentation_tests.txt

* changed as per request

* fixed line 46

245da7ed

[AMD] Add initial version for run_tests_multi_gpu (#26346) · 3632fb3c

Funtowicz Morgan authored Oct 03, 2023



* Add initial version for run_tests_multi_gpu

* Trigger change in BERT

* fix typo setup -> setup_gpu

* Add tag mi210

* Enable multi-gpu jobs

* One more

* Use dynamic device allocation

* Attempt to fix syntax for docker create

* fix script path

* fix

* temp machine type

* fix label

* Enable multi-gpu tests

* Rename multi-amd-gpu to multi-gpu

* Let's not be lazy dude

* Update rocm-smi output

* Add gpu_flavour in the matrix

* Fix typos

* merge single/multi dispatch into the matrix

* Format.

* Revert BERT's change

---------
Co-authored-by: Guillaume LEGENDRE <glegendre01@gmail.com>

3632fb3c

[Wav2Vec2 and Co] Update init tests for PT 2.1 (#26494) · 768aa3d9
Sanchit Gandhi authored Oct 03, 2023

768aa3d9