Commits · 03af4c42a624ea44b3325ff78151f499392dd617 · chenpangpang / transformers

04 Oct, 2023 3 commits

Sylvain Gugger authored Oct 04, 2023



* Fix number of minimal calls to the Hub with peft integration

* Alternate design

* And this way?

* Revert

* Nits to fix

* Add util

* Print when changes are made

* Add list to ignore

* Add more rules

* Manual fixes

* deal with kwargs

* deal with enum defaults

* avoid many digits for floats

* Manual fixes

* Fix regex

* Fix regex

* Auto fix

* Style

* Apply script

* Add ignored list

* Add check that templates are filled

* Adding to CI checks

* Add back semi-fix

* Ignore more objects

* More auto-fixes

* Ignore missing objects

* Remove temp semi-fix

* Fixes

* Update src/transformers/models/pvt/configuration_pvt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update utils/check_docstrings.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/utils/quantization_config.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Deal with float defaults

* Fix small defaults

* Address review comment

* Treat

* Post-rebase cleanup

* Address review comment

* Update src/transformers/models/deprecated/mctct/configuration_mctct.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comment

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

03af4c42

feat: add trainer label to wandb run upon initialization (#26466) · 122b2657
Bharat Ramanathan authored Oct 04, 2023

122b2657
Extend Trainer to enable Ascend NPU to use the fused Adamw optimizer when training (#26194) · 4fdf47cd
statelesshz authored Oct 04, 2023

4fdf47cd

03 Oct, 2023 9 commits

[Whisper] Allow basic text normalization (#26149) · 57f44dc4
Sanchit Gandhi authored Oct 03, 2023
```
* [Whisper] Allow basic text normalization

* up

* style copies
```
57f44dc4
v4.35.0.dev0 · bd620591
Lysandre authored Oct 03, 2023

bd620591

[`Nougat`] from transformers import * (#26562) · c26b2a29

Arthur authored Oct 03, 2023



* remove unprotected import to PIL

* cleanup

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

c26b2a29

[`PEFT`] Final fixes (#26559) · 2aef9a96

Younes Belkada authored Oct 03, 2023

* fix issues with PEFT

* logger warning futurewarning issues

* fixup

* adapt from suggestions

* oops

* rm test

2aef9a96

[`Mistral`] Add Flash Attention-2 support for `mistral` (#26464) · ae9a344c

Younes Belkada authored Oct 03, 2023



* add FA-2 support for mistral

* fixup

* add sliding windows

* fixing few nits

* v1 slicing cache - logits do not match

* add comment

* fix bugs

* more mem efficient

* add warning once

* add warning once

* oops

* fixup

* more comments

* copy

* add safety checker

* fixup

* Update src/transformers/models/mistral/modeling_mistral.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* copied from

* up

* raise when padding side is right

* fixup

* add doc + few minor changes

* fixup

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ae9a344c

Nit-added-tokens (#26538) · 1a2e966c

Arthur authored Oct 03, 2023

* fix stripping

* nits

* fix another test

* styling

* fix?

* update

* revert bad merge

* found the bug

* YES SIR

* is that change really required?

* make fast even faster

* re order functions

1a2e966c

[Doctest] Add `configuration_encoder_decoder.py` (#26519) · 245da7ed

Srijan Sahay Srivastava authored Oct 03, 2023

* [Doctest] Add configuration_encoder_decoder.py

Added configuration_encoder_decoder.py to utils/documentation_tests.txt for doctest

* Revert "[Doctest] Add configuration_encoder_decoder.py"

This reverts commit bd653535a4356dc3c9f43e65883819079a2053b0.

* [Doctest] Add configuration_encoder_decoder.py

add configuration_encoder_decoder.py to utils/documentation_tests.txt

* [Doctest] Add configuration_encoder_decoder.py

add configuration_encoder_decoder.py to utils/documentation_tests.txt

* [Doctest] Add configuration_encoder_decoder.py

add configuration_encoder_decoder.py to utils/documentation_tests.txt

* changed as per request

* fixed line 46

245da7ed

Add tokenizer kwargs to fill mask pipeline. (#26234) · b5ca8fcd

Nathan Cahill authored Oct 03, 2023



* add tokenizer kwarg inputs

* Adding tokenizer_kwargs to _sanitize_parameters

* Add truncation=True example to tests

* Update test_pipelines_fill_mask.py

* Update test_pipelines_fill_mask.py

* make fix-copies and make style

* Update fill_mask.py

Replace single tick with double

* make fix-copies

* Style

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

b5ca8fcd

[RFC, Logging] Change warning to info (#26545) · df6a855e
Patrick von Platen authored Oct 03, 2023
```
[Logging] Change warning to info
```
df6a855e

02 Oct, 2023 11 commits

add build_inputs_with_special_tokens to LlamaFast (#26297) · c20d90d5

Arthur authored Oct 02, 2023

* add build_inputs_with_special_tokens to LlamaFast

* fixup

* Update src/transformers/models/llama/tokenization_llama_fast.py

c20d90d5

Code-llama-nit (#26300) · bab33319

Arthur authored Oct 02, 2023

* fix encoding when the fill token is None

* add tests and edge cases

* fiuxp

* Update tests/models/code_llama/test_tokenization_code_llama.py

bab33319

[Doctest] Add configuration_roformer.py (#26530) · 4b4c6aab

Adithya Hegde Kota authored Oct 02, 2023

* [Doctest] Add configuration_roformer.py

* [Doctest] Add configuration_roformer.py

* [Doctest] Add configuration_roformer.py

* [Doctest] Add configuration_roformer.py

* Removed documentation_test.txt

* Removed configuration_roformer.py

* Update not_doctested.txt

4b4c6aab

Remove-warns (#26483) · e4dad4fe

Arthur authored Oct 02, 2023

* fix stripping

* remove some warnings and update some warnings

* revert changes for other PR

e4dad4fe

[`PEFT`] Protect `adapter_kwargs` check (#26537) · 1b8decb0
Younes Belkada authored Oct 02, 2023
```
Update modeling_utils.py
```
1b8decb0

Fix model integration ci (#26322) · 63864e05

Arthur authored Oct 02, 2023

* fix wav2vec2

* nit

* stash

* one more file to update

* fix byt5

* vocab size is 256, don't change that!

* use other revision

* test persimon in smaller size

* style

* tests

* nits

* update add tokens from pretrained

* test tokenization

* nits

* potential fnet fix?

* more nits

* nits

* correct test

* assert close

* udpate

* ouch

* fix it

* some more nits

* FINALLU

* use `adept` checkpoints

* more adept checkpoints

* that was invlved!

63864e05

[`PEFT`] Pass token when calling `find_adapter_config` (#26488) · 24178c24
Younes Belkada authored Oct 02, 2023
```
* try

* nit

* nits
```
24178c24

Fix issue of canine forward requiring input_ids anyway (#26290) · 6d02ca4b

marcmk6 authored Oct 02, 2023

* fix issue of canine forward requires input_ids anyway

The `forward` requires `input_ids` for deriving other variables in all cases. Change this to use the given one between `input_ids` and `inputs_embeds`

* fix canine forward

The current `forward` requires (the shape of) `input_ids` for deriving other variables whenever `input_ids` or `inputs_embeds` is provided. Change this to use the given one instead of `input_ids` all the time.

* fix format

* fix format

6d02ca4b

Fix requests connection error during modelcard creation (#26518) · 7d77d7f7
Jan Philipp Harries authored Oct 02, 2023
```
fix requests connection error
Co-authored-by: Jan Philipp Harries <jphme@users.noreply.github.com>
```
7d77d7f7

Fix num_heads in _upad_input (#26490) · ca0379b8

Florian Seiler authored Oct 02, 2023



* Fix num_heads in _upad_input

The variable num_key_value_heads has falsely been named num_heads, which led to reshaping the query_layer using the wrong attention head count. (It would have been enough to use the correct variable self.num_heads instead of num_heads, but I renamed num_heads to num_key_value_heads for clarity)

* fixed copies using make fix-copies and ran make fixup

---------
Co-authored-by: fseiler <f.seiler@jerocom.de>

ca0379b8

Revert falcon exception (#26472) · 67239f73

Lysandre Debut authored Oct 02, 2023

* Revert "Falcon: fix revision propagation (#26006)"

This reverts commit 118c676ef3124423e5d062b665f05cde55bc9a90.

* Revert "Put Falcon back (#25960)"

This reverts commit 22a69f1d.

67239f73

29 Sep, 2023 2 commits

[ASR Pipe] Improve docs and error messages (#26476) · 0b192de1

Sanchit Gandhi authored Sep 29, 2023



* improve docs/errors

* why whisper

* Update docs/source/en/pipeline_tutorial.md
Co-authored-by: Lysandre Debut <hi@lysand.re>

* specify pt only

---------
Co-authored-by: Lysandre Debut <hi@lysand.re>

0b192de1

[docs] navigation improvement between text gen pipelines and text gen params (#26477) · 14170b78
Maria Khalusova authored Sep 29, 2023
```
* navigation improvement between text generation pipelines and text generation docs

* make style
```
14170b78

28 Sep, 2023 9 commits

[Whisper Tokenizer] Make decoding faster after adding timestamps (#26299) · 211f93aa
Sanchit Gandhi authored Sep 28, 2023
```
make decoding faster
```
211f93aa

Esm checkpointing (#26454) · 4e931a8e

Amelie Schreiber authored Sep 28, 2023



* Fixed in-place operation error in EsmEmbeddings

* Fixed in-place operation error in EsmEmbeddings again

---------
Co-authored-by: Schreiber-Finance <amelie.schreiber.finance@gmail.com>

4e931a8e

fix_mbart_tied_weights (#26422) · 5e11d72d
Marc Sun authored Sep 28, 2023
```
* fix_mbart_tied_weights

* add test
```
5e11d72d

Do not warn about unexpected decoder weights when loading T5EncoderModel and... · 216dff75

fleance authored Sep 28, 2023

Do not warn about unexpected decoder weights when loading T5EncoderModel and LongT5EncoderModel (#26211)

Ignore decoder weights when using T5EncoderModel and LongT5EncoderModel

Both T5EncoderModel and LongT5EncoderModel do not have any decoder layers, so
loading a pretrained model checkpoint such as t5-small will give warnings about
keys found in the model checkpoint that are not in the model itself.

To prevent this log warning, r"decoder" has been added to _keys_to_ignore_on_load_unexpected for
both T5EncoderModel and LongT5EncoderModel

216dff75

[`PEFT`] introducing `adapter_kwargs` for loading adapters from different Hub... · 38e96324

Younes Belkada authored Sep 28, 2023


[`PEFT`] introducing `adapter_kwargs` for loading adapters from different Hub location (`subfolder`, `revision`) than the base model (#26270)

* make use of adapter_revision

* v1 adapter kwargs

* fix CI

* fix CI

* fix CI

* fixup

* add BC

* Update src/transformers/integrations/peft.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

* change it to error

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* fixup

* change

* Update src/transformers/integrations/peft.py

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

38e96324

[VITS] Fix speaker_embed device mismatch (#26115) · 52e2c13d

Fakhir Ali authored Sep 28, 2023

* [VITS] Fix speaker_embed device mismatch

- pass device arg to speaker_id tensor

* [VITS] put speaker_embed on device when int

* [VITS] device=self.device
instead of self.embed_speaker.weight.device

* [VITS] make tensor directly on device
using torch.full()

52e2c13d

change mention of decoder_input_ids to input_ids and same with decode_inputs_embeds (#26406) · 098c3f40

Tanishq Abraham authored Sep 28, 2023



* change mention of decoder_input_ids to input_ids and same with decoder_input_embeds

* Style

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

098c3f40

Fix `cos_sin` device issue in Falcon model (#26448) · 375b4e09
Yih-Dar authored Sep 28, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
375b4e09

optimize VRAM for calculating pos_bias in LayoutLM v2, v3 (#26139) · a7e0ed82

Norm Inui authored Sep 28, 2023



* optimize layoutv2, v3 for VRAM saving

* reformat codes

---------
Co-authored-by: NormXU <xunuo@datagrand.com>

a7e0ed82

27 Sep, 2023 5 commits

[Mistral] Mistral-7B-v0.1 support (#26447) · 72958fcd

Chris Bamford authored Sep 27, 2023



* [Mistral] Mistral-7B-v0.1 support

* fixing names

* slightly longer test

* fixups

* not_doctested

* wrongly formatted references

* make fixuped

---------
Co-authored-by: Timothee Lacroix <t@eugen.ai>
Co-authored-by: timlacroix <t@mistral.ai>

72958fcd

[`PEFT`] Fix PEFT multi adapters support (#26407) · 3ca18d6d

Younes Belkada authored Sep 27, 2023



* fix PEFT multi adapters support

* refactor a bit

* save pretrained + BC + added tests

* Update src/transformers/integrations/peft.py
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

* add more tests

* add suggestion

* final changes

* adapt a bit

* fixup

* Update src/transformers/integrations/peft.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* adapt from suggestions

---------
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3ca18d6d

add bf16 mixed precision support for NPU (#26163) · 946bac79
statelesshz authored Sep 27, 2023
```
Co-authored-by: statelesshz <jihuazhong1@huawei.com>
```
946bac79

Fixing tokenizer when `transformers` is installed without `tokenizers` (#26236) · a0be960d

Uri Alon authored Sep 27, 2023

* Fixing tokenizer when tokenizers is not installed

* Adding __repr__ function and repr=True in dataclass

* Revert "Adding __repr__ function and repr=True in dataclass"

This reverts commit 18839505d1cada3170ed623744d3e75008a18bdc.

a0be960d

Fix padding for IDEFICS (#26396) · abd25310
Shauray Singh authored Sep 27, 2023
```
* fix

* fixup

* tests

* fixup
```
abd25310

26 Sep, 2023 1 commit
- Add torch `RMSProp` optimizer (#26425) · 408b2b3c
  Nathan Lambert authored Sep 26, 2023
```
add rmsprop
```
  408b2b3c