Commits · 72256bc72ac2f2e341da47b9aea57e2e37879700 · chenpangpang / transformers

12 Oct, 2023 2 commits
- Fix `PersimmonIntegrationTest` OOM (#26750) · 72256bc7
  Yih-Dar authored Oct 12, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  72256bc7
- Add many missing spaces in adjacent strings (#26751) · 40ea9ab2
  Tom Aarsen authored Oct 12, 2023
```
Add missing spaces in adjacent strings
```
  40ea9ab2
11 Oct, 2023 4 commits

[Assistant Generation] Improve Encoder Decoder (#26701) · da69de17

Patrick von Platen authored Oct 11, 2023

* [Assistant Generation] Improve enc dec

* save more

* Fix logit processor checks

* Clean

* make style

* fix deprecation

* fix generation test

* Apply suggestions from code review

* fix biogpt

* make style

da69de17

`Copied from` for test files (#26713) · 5334796d

Yih-Dar authored Oct 11, 2023



* copied statement for test files

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5334796d

In assisted decoding, pass model_kwargs to model's forward call (fix... · dcc49d8a

Billy Bradley authored Oct 11, 2023

In assisted decoding, pass model_kwargs to model's forward call (fix prepare_input_for_generation in all models) (#25242)

* In assisted decoding, pass model_kwargs to model's forward call

Previously, assisted decoding would ignore any additional kwargs
that it doesn't explicitly handle. This was inconsistent with other
generation methods, which pass the model_kwargs through
prepare_inputs_for_generation and forward the returned dict to the
model's forward call.

The prepare_inputs_for_generation method needs to be amended in all
models, as previously it only kept the last input ID when a past_key_values
was passed.

* Improve variable names in _extend_attention_mask

* Refactor extending token_type_ids into a function

* Replace deepcopy with copy to optimize performance

* Update new persimmon model with llama changes for assisted generation

* Update new mistral model for assisted generation with prepare_inputs_for_generation

* Update position_ids creation in falcon prepare_inputs_for_generation to support assisted generation

dcc49d8a

Make Whisper Encoder's sinusoidal PE non-trainable by default (#26032) · 1e3c9dda

Thien Tran authored Oct 11, 2023



* set encoder's PE as non-trainable

* freeze flax

* init sinusoids

* add test for non-trainable embed positions

* simplify TF encoder embed_pos

* revert tf

* clean up

* add sinusoidal init for jax

* make consistent sinusoidal function

* fix dtype

* add default dtype

* use numpy for sinusoids. fix jax

* add sinusoid init for TF

* fix

* use custom embedding

* use specialized init for each impl

* fix sinusoids init. add test for pytorch

* fix TF dtype

* simplify sinusoid init for flax and tf

* add tests for TF

* change default dtype to float32

* add sinusoid test for flax

* Update src/transformers/models/whisper/modeling_flax_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* move sinusoidal init to _init_weights

---------
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

1e3c9dda

09 Oct, 2023 1 commit
- Fixed malapropism error (#26660) · 86a4e5a9
  Shreyas S authored Oct 09, 2023
```
Update test_integration.py

Fixed malapropism clone>copy
```
  86a4e5a9
06 Oct, 2023 6 commits

[`LlamaTokenizerFast`] Adds edge cases for the template processor (#26606) · 9ad815e4

Arthur authored Oct 06, 2023

* make sure eos and bos are properly handled for fast tokenizer

* fix code llama as well

* nits

* fix the conversion script as well

* fix failing test

9ad815e4

remove SharedDDP as it is deprecated (#25702) · 27597fea

statelesshz authored Oct 06, 2023



* remove SharedDDP as it was drepracated

* apply review suggestion

* make style

* Oops,forgot to remove the compute_loss context manager in Seq2SeqTrainer.

* remove the unnecessary conditional statement

* keep the logic of IPEX

* clean code

* mix precision setup & make fixup

---------
Co-authored-by: statelesshz <jihuazhong1@huawei.com>

27597fea

Fix failing `MusicgenTest .test_pipeline_text_to_audio` (#26586) · e840aa67
Yih-Dar authored Oct 06, 2023
```
* fix

* fix

* Fix

* Fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
e840aa67

Remove unnecessary unsqueeze - squeeze in rotary positional embedding (#26162) · 64845307

fxmarty authored Oct 06, 2023

* remove unnecessary unsqueeze-squeeze in llama

* correct other models

* fix

* revert gpt_neox_japanese

* fix copie

* fix test

64845307

Update tokenization_code_llama_fast.py (#26576) · 65aabafe

Tianqi Liu authored Oct 06, 2023

* Update tokenization_code_llama_fast.py

* Update test_tokenization_code_llama.py

* Update test_tokenization_code_llama.py

65aabafe

Fixed inconsistency in several fast tokenizers (#26561) · af38c837
Towdo authored Oct 06, 2023

af38c837

05 Oct, 2023 3 commits

#26566 swin2 sr allow in out channels (#26568) · 0a3b9d02

Marvin Gabler authored Oct 05, 2023



* feat: close #26566, changed model & config files to accept arbitary in and out channels

* updated docstrings

* fix: linter error

* fix: update Copy docstrings

* fix: linter update

* fix: rename num_channels_in to num_channels to prevent breaking changes

* fix: make num_channels_out None per default

* Update src/transformers/models/swin2sr/configuration_swin2sr.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix: update tests to include num_channels_out

* fix:linter

* fix: remove normalization with precomputed rgb values when #input_channels!=#output_channels

---------
Co-authored-by: marvingabler <marvingabler@outlook.de>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0a3b9d02

[`core`] fix silent bug `keep_in_fp32` modules (#26589) · e6d250e4
Younes Belkada authored Oct 05, 2023
```
* fix silent bug `keep_in_fp32` modules

* final fix

* added a common test.

* Trigger CI

* revert
```
e6d250e4
Fix failing tests on `main` due to torch 2.1 (#26607) · 54e17a15
Yih-Dar authored Oct 05, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
54e17a15

04 Oct, 2023 3 commits

skip flaky hub tests (#26594) · c037b2e3
Arthur authored Oct 04, 2023
```
skip flaky
```
c037b2e3
Add # Copied from statements to audio feature extractors that use the floats_list function (#26581) · 9deb18ca
dg845 authored Oct 04, 2023
```
Add # Copied from statements to audio feature extractors that use the floats_list function.
```
9deb18ca

Docstring check (#26052) · 03af4c42

Sylvain Gugger authored Oct 04, 2023



* Fix number of minimal calls to the Hub with peft integration

* Alternate design

* And this way?

* Revert

* Nits to fix

* Add util

* Print when changes are made

* Add list to ignore

* Add more rules

* Manual fixes

* deal with kwargs

* deal with enum defaults

* avoid many digits for floats

* Manual fixes

* Fix regex

* Fix regex

* Auto fix

* Style

* Apply script

* Add ignored list

* Add check that templates are filled

* Adding to CI checks

* Add back semi-fix

* Ignore more objects

* More auto-fixes

* Ignore missing objects

* Remove temp semi-fix

* Fixes

* Update src/transformers/models/pvt/configuration_pvt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update utils/check_docstrings.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/utils/quantization_config.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Deal with float defaults

* Fix small defaults

* Address review comment

* Treat

* Post-rebase cleanup

* Address review comment

* Update src/transformers/models/deprecated/mctct/configuration_mctct.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comment

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

03af4c42

03 Oct, 2023 6 commits

[Tokenizers] Skip tests temporarily (#26574) · 5c66378c
Lysandre Debut authored Oct 03, 2023
```
* Skip tests temporarily

* style

* Add additional test
```
5c66378c
[Whisper] Allow basic text normalization (#26149) · 57f44dc4
Sanchit Gandhi authored Oct 03, 2023
```
* [Whisper] Allow basic text normalization

* up

* style copies
```
57f44dc4

[`PEFT`] Final fixes (#26559) · 2aef9a96

Younes Belkada authored Oct 03, 2023

* fix issues with PEFT

* logger warning futurewarning issues

* fixup

* adapt from suggestions

* oops

* rm test

2aef9a96

[`Mistral`] Add Flash Attention-2 support for `mistral` (#26464) · ae9a344c

Younes Belkada authored Oct 03, 2023



* add FA-2 support for mistral

* fixup

* add sliding windows

* fixing few nits

* v1 slicing cache - logits do not match

* add comment

* fix bugs

* more mem efficient

* add warning once

* add warning once

* oops

* fixup

* more comments

* copy

* add safety checker

* fixup

* Update src/transformers/models/mistral/modeling_mistral.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* copied from

* up

* raise when padding side is right

* fixup

* add doc + few minor changes

* fixup

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ae9a344c

[Wav2Vec2 and Co] Update init tests for PT 2.1 (#26494) · 768aa3d9
Sanchit Gandhi authored Oct 03, 2023

768aa3d9

Add tokenizer kwargs to fill mask pipeline. (#26234) · b5ca8fcd

Nathan Cahill authored Oct 03, 2023



* add tokenizer kwarg inputs

* Adding tokenizer_kwargs to _sanitize_parameters

* Add truncation=True example to tests

* Update test_pipelines_fill_mask.py

* Update test_pipelines_fill_mask.py

* make fix-copies and make style

* Update fill_mask.py

Replace single tick with double

* make fix-copies

* Style

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

b5ca8fcd

02 Oct, 2023 4 commits

Code-llama-nit (#26300) · bab33319

Arthur authored Oct 02, 2023

* fix encoding when the fill token is None

* add tests and edge cases

* fiuxp

* Update tests/models/code_llama/test_tokenization_code_llama.py

bab33319

Fix model integration ci (#26322) · 63864e05

Arthur authored Oct 02, 2023

* fix wav2vec2

* nit

* stash

* one more file to update

* fix byt5

* vocab size is 256, don't change that!

* use other revision

* test persimon in smaller size

* style

* tests

* nits

* update add tokens from pretrained

* test tokenization

* nits

* potential fnet fix?

* more nits

* nits

* correct test

* assert close

* udpate

* ouch

* fix it

* some more nits

* FINALLU

* use `adept` checkpoints

* more adept checkpoints

* that was invlved!

63864e05

[`core`/ `auto` ] Fix bnb test with code revision + bug with code revision (#26431) · 6824461f

Younes Belkada authored Oct 02, 2023

* fix bnb test with code revision

* fix test

* Apply suggestions from code review

* Update src/transformers/models/auto/auto_factory.py

* Update src/transformers/models/auto/auto_factory.py

* Update src/transformers/models/auto/auto_factory.py

6824461f

Revert falcon exception (#26472) · 67239f73

Lysandre Debut authored Oct 02, 2023

* Revert "Falcon: fix revision propagation (#26006)"

This reverts commit 118c676ef3124423e5d062b665f05cde55bc9a90.

* Revert "Put Falcon back (#25960)"

This reverts commit 22a69f1d.

67239f73

29 Sep, 2023 2 commits
- Avoid all-zeor attnetion mask used in testing (#26469) · 39117744
  Yih-Dar authored Sep 29, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  39117744
- Skip 2 failing persimmon pipeline tests for now (#26485) · 9b23d0de
  Yih-Dar authored Sep 29, 2023
```
skip
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  9b23d0de
28 Sep, 2023 2 commits

fix_mbart_tied_weights (#26422) · 5e11d72d
Marc Sun authored Sep 28, 2023
```
* fix_mbart_tied_weights

* add test
```
5e11d72d

[`PEFT`] introducing `adapter_kwargs` for loading adapters from different Hub... · 38e96324

Younes Belkada authored Sep 28, 2023


[`PEFT`] introducing `adapter_kwargs` for loading adapters from different Hub location (`subfolder`, `revision`) than the base model (#26270)

* make use of adapter_revision

* v1 adapter kwargs

* fix CI

* fix CI

* fix CI

* fixup

* add BC

* Update src/transformers/integrations/peft.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

* change it to error

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* fixup

* change

* Update src/transformers/integrations/peft.py

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

38e96324

27 Sep, 2023 4 commits

[Mistral] Mistral-7B-v0.1 support (#26447) · 72958fcd

Chris Bamford authored Sep 27, 2023



* [Mistral] Mistral-7B-v0.1 support

* fixing names

* slightly longer test

* fixups

* not_doctested

* wrongly formatted references

* make fixuped

---------
Co-authored-by: Timothee Lacroix <t@eugen.ai>
Co-authored-by: timlacroix <t@mistral.ai>

72958fcd

[`PEFT`] Fix PEFT multi adapters support (#26407) · 3ca18d6d

Younes Belkada authored Sep 27, 2023



* fix PEFT multi adapters support

* refactor a bit

* save pretrained + BC + added tests

* Update src/transformers/integrations/peft.py
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

* add more tests

* add suggestion

* final changes

* adapt a bit

* fixup

* Update src/transformers/integrations/peft.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* adapt from suggestions

---------
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3ca18d6d

[`FA` / `tests`] Add use_cache tests for FA models (#26415) · 153755ee
Younes Belkada authored Sep 27, 2023
```
* add use_cache tests for FA

* fixup
```
153755ee
Fix padding for IDEFICS (#26396) · abd25310
Shauray Singh authored Sep 27, 2023
```
* fix

* fixup

* tests

* fixup
```
abd25310

26 Sep, 2023 2 commits

added support for gradient checkpointing in ESM models (#26386) · 6ce6a5ad
sanjeevk-os authored Sep 26, 2023

6ce6a5ad

Add Nougat (#25942) · ace74d16

NielsRogge authored Sep 26, 2023



* Add conversion script

* Add NougatImageProcessor

* Add crop margin

* More improvements

* Add docs, READMEs

* Remove print statements

* Include model_max_length

* Add NougatTokenizerFast

* Fix imports

* Improve postprocessing

* Improve image processor

* Fix image processor

* Improve normalize method

* More improvements

* More improvements

* Add processor, improve docs

* Simplify fast tokenizer

* Remove test file

* Fix docstrings

* Use NougatProcessor in conversion script

* Add is_levensthein_available

* Add tokenizer tests

* More improvements

* Use numpy instead of opencv

* Add is_cv2_available

* Fix cv2_available

* Add is_nltk_available

* Add image processor tests, improve crop_margin

* Add integration tests

* Improve integration test

* Use do_rescale instead of hacks, thanks Amy

* Remove random_padding

* Address comments

* Address more comments

* Add import

* Address more comments

* Address more comments

* Address comment

* Address comment

* Set max_model_input_sizes

* Add tests

* Add requires_backends

* Add Nougat to exotic tests

* Use to_pil_image

* Address comment regarding nltk

* Add NLTK

* Improve variable names, integration test

* Add test

* refactor, document, and test regexes

* remove named capture groups, add comments

* format

* add non-markdown fixed tokenization

* format

* correct flakyness of args parse

* add regex comments

* test functionalities for crop_image, align long axis and expected output

* add regex tests

* remove cv2 dependency

* test crop_margin equality between cv2 and python

* refactor table regexes to markdown

add newline

* change print to log, improve doc

* fix high count tables correction

* address PR comments: naming, linting, asserts

* Address comments

* Add copied from

* Update conversion script

* Update conversion script to convert both small and base versions

* Add inference example

* Add more info

* Fix style

* Add require annotators to test

* Define all keyword arguments explicitly

* Move cv2 annotator

* Add tokenizer init method

* Transfer checkpoints

* Add reference to Donut

* Address comments

* Skip test

* Remove cv2 method

* Add copied from statements

* Use cached_property

* Fix docstring

* Add file to not doctested

---------
Co-authored-by: Pablo Montalvo <pablo.montalvo.leroux@gmail.com>

ace74d16

25 Sep, 2023 1 commit

Update tiny model information and pipeline tests (#26285) · d9e4bc28

Yih-Dar authored Sep 25, 2023



* Update tiny model summary file

* add to pipeline tests

* revert

* fix import

* fix import

* fix

* fix

* update

* update

* update

* fix

* remove BarkModelTest

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d9e4bc28