Commits · b8f1cde931392551f74a9abef5d2724c3cbc2208 · chenpangpang / transformers

16 Oct, 2023 3 commits

Fix Mistral OOM again (#26847) · b8f1cde9
Yih-Dar authored Oct 16, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b8f1cde9

[`Quantization`] Store the original dtype in the config as a private attribute

(#26761) · fd6a0ade

Younes Belkada authored Oct 16, 2023



* First step

* fix

* add adjustements for gptq

* change to `_pre_quantization_dtype`

* Update src/transformers/modeling_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix serialization

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

fd6a0ade

Conversation pipeline fixes (#26795) · 14b04b4b

Matt authored Oct 16, 2023

* Adjust length limits and allow naked conversation list inputs

* Adjust length limits and allow naked conversation list inputs

* Maybe use a slightly more reasonable limit than 1024

* Skip tests for old models that never supported this anyway

* Cleanup input docstrings

* More docstring cleanup + skip failing TF test

* Make fixup

14b04b4b

13 Oct, 2023 4 commits

Add OWLv2, bis (#26668) · 762af3e3

NielsRogge authored Oct 13, 2023

* First draft

* Update conversion script

* Update copied from statements

* Fix style

* Add copied from to config

* Add copied from to processor

* Run make fixup

* Add docstring

* Update docstrings

* Add method

* Improve docstrings

* Fix docstrings

* Improve docstrings

* Remove onnx

* Add flag

* Address comments

* Add copied from to model tests

* Add flag to conversion script

* Add code snippet

* Address more comments

* Address comment

* Improve conversion script

* More improvements

* Add expected objectness logits

* Skip test

* Improve conversion script

* Extend conversion script

* Convert large checkpoint

* Fix doc tests

* Convert all checkpoints, update integration tests

* Add checkpoint_path arg

* Fix repo_id

762af3e3

Fix Falcon generation test (#26770) · bdb391e9
Matt authored Oct 13, 2023

bdb391e9
Disable default system prompt for LLaMA (#26765) · c9785d95
Matt authored Oct 13, 2023
```
* Disable default system prompt for LLaMA

* Update test to not expect default prompt
```
c9785d95
Update expect outputs of `IdeficsProcessorTest.test_tokenizer_padding` (#26779) · 21da3b24
Yih-Dar authored Oct 13, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
21da3b24

12 Oct, 2023 6 commits
- Skip `TrainerIntegrationFSDP::test_basic_run_with_cpu_offload` if `torch < 2.1` (#26764) · 3e93dd29
  Yih-Dar authored Oct 12, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  3e93dd29
- chore: fix typos (#26756) · 883ed4b3
  Heinz-Alexander Fuetterer authored Oct 12, 2023
  
  883ed4b3
- Fix `PerceiverModelIntegrationTest::test_inference_masked_lm` (#26760) · a243cdca
  Yih-Dar authored Oct 12, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a243cdca
- Fix `MistralIntegrationTest` OOM (#26754) · db5e0c32
  Yih-Dar authored Oct 12, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  db5e0c32
- Fix `PersimmonIntegrationTest` OOM (#26750) · 72256bc7
  Yih-Dar authored Oct 12, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  72256bc7
- Add many missing spaces in adjacent strings (#26751) · 40ea9ab2
  Tom Aarsen authored Oct 12, 2023
```
Add missing spaces in adjacent strings
```
  40ea9ab2
11 Oct, 2023 4 commits

[Assistant Generation] Improve Encoder Decoder (#26701) · da69de17

Patrick von Platen authored Oct 11, 2023

* [Assistant Generation] Improve enc dec

* save more

* Fix logit processor checks

* Clean

* make style

* fix deprecation

* fix generation test

* Apply suggestions from code review

* fix biogpt

* make style

da69de17

`Copied from` for test files (#26713) · 5334796d

Yih-Dar authored Oct 11, 2023



* copied statement for test files

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5334796d

In assisted decoding, pass model_kwargs to model's forward call (fix... · dcc49d8a

Billy Bradley authored Oct 11, 2023

In assisted decoding, pass model_kwargs to model's forward call (fix prepare_input_for_generation in all models) (#25242)

* In assisted decoding, pass model_kwargs to model's forward call

Previously, assisted decoding would ignore any additional kwargs
that it doesn't explicitly handle. This was inconsistent with other
generation methods, which pass the model_kwargs through
prepare_inputs_for_generation and forward the returned dict to the
model's forward call.

The prepare_inputs_for_generation method needs to be amended in all
models, as previously it only kept the last input ID when a past_key_values
was passed.

* Improve variable names in _extend_attention_mask

* Refactor extending token_type_ids into a function

* Replace deepcopy with copy to optimize performance

* Update new persimmon model with llama changes for assisted generation

* Update new mistral model for assisted generation with prepare_inputs_for_generation

* Update position_ids creation in falcon prepare_inputs_for_generation to support assisted generation

dcc49d8a

Make Whisper Encoder's sinusoidal PE non-trainable by default (#26032) · 1e3c9dda

Thien Tran authored Oct 11, 2023



* set encoder's PE as non-trainable

* freeze flax

* init sinusoids

* add test for non-trainable embed positions

* simplify TF encoder embed_pos

* revert tf

* clean up

* add sinusoidal init for jax

* make consistent sinusoidal function

* fix dtype

* add default dtype

* use numpy for sinusoids. fix jax

* add sinusoid init for TF

* fix

* use custom embedding

* use specialized init for each impl

* fix sinusoids init. add test for pytorch

* fix TF dtype

* simplify sinusoid init for flax and tf

* add tests for TF

* change default dtype to float32

* add sinusoid test for flax

* Update src/transformers/models/whisper/modeling_flax_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* move sinusoidal init to _init_weights

---------
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

1e3c9dda

09 Oct, 2023 1 commit
- Fixed malapropism error (#26660) · 86a4e5a9
  Shreyas S authored Oct 09, 2023
```
Update test_integration.py

Fixed malapropism clone>copy
```
  86a4e5a9
06 Oct, 2023 6 commits

[`LlamaTokenizerFast`] Adds edge cases for the template processor (#26606) · 9ad815e4

Arthur authored Oct 06, 2023

* make sure eos and bos are properly handled for fast tokenizer

* fix code llama as well

* nits

* fix the conversion script as well

* fix failing test

9ad815e4

remove SharedDDP as it is deprecated (#25702) · 27597fea

statelesshz authored Oct 06, 2023



* remove SharedDDP as it was drepracated

* apply review suggestion

* make style

* Oops,forgot to remove the compute_loss context manager in Seq2SeqTrainer.

* remove the unnecessary conditional statement

* keep the logic of IPEX

* clean code

* mix precision setup & make fixup

---------
Co-authored-by: statelesshz <jihuazhong1@huawei.com>

27597fea

Fix failing `MusicgenTest .test_pipeline_text_to_audio` (#26586) · e840aa67
Yih-Dar authored Oct 06, 2023
```
* fix

* fix

* Fix

* Fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
e840aa67

Remove unnecessary unsqueeze - squeeze in rotary positional embedding (#26162) · 64845307

fxmarty authored Oct 06, 2023

* remove unnecessary unsqueeze-squeeze in llama

* correct other models

* fix

* revert gpt_neox_japanese

* fix copie

* fix test

64845307

Update tokenization_code_llama_fast.py (#26576) · 65aabafe

Tianqi Liu authored Oct 06, 2023

* Update tokenization_code_llama_fast.py

* Update test_tokenization_code_llama.py

* Update test_tokenization_code_llama.py

65aabafe

Fixed inconsistency in several fast tokenizers (#26561) · af38c837
Towdo authored Oct 06, 2023

af38c837

05 Oct, 2023 3 commits

#26566 swin2 sr allow in out channels (#26568) · 0a3b9d02

Marvin Gabler authored Oct 05, 2023



* feat: close #26566, changed model & config files to accept arbitary in and out channels

* updated docstrings

* fix: linter error

* fix: update Copy docstrings

* fix: linter update

* fix: rename num_channels_in to num_channels to prevent breaking changes

* fix: make num_channels_out None per default

* Update src/transformers/models/swin2sr/configuration_swin2sr.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix: update tests to include num_channels_out

* fix:linter

* fix: remove normalization with precomputed rgb values when #input_channels!=#output_channels

---------
Co-authored-by: marvingabler <marvingabler@outlook.de>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0a3b9d02

[`core`] fix silent bug `keep_in_fp32` modules (#26589) · e6d250e4
Younes Belkada authored Oct 05, 2023
```
* fix silent bug `keep_in_fp32` modules

* final fix

* added a common test.

* Trigger CI

* revert
```
e6d250e4
Fix failing tests on `main` due to torch 2.1 (#26607) · 54e17a15
Yih-Dar authored Oct 05, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
54e17a15

04 Oct, 2023 3 commits

skip flaky hub tests (#26594) · c037b2e3
Arthur authored Oct 04, 2023
```
skip flaky
```
c037b2e3
Add # Copied from statements to audio feature extractors that use the floats_list function (#26581) · 9deb18ca
dg845 authored Oct 04, 2023
```
Add # Copied from statements to audio feature extractors that use the floats_list function.
```
9deb18ca

Docstring check (#26052) · 03af4c42

Sylvain Gugger authored Oct 04, 2023



* Fix number of minimal calls to the Hub with peft integration

* Alternate design

* And this way?

* Revert

* Nits to fix

* Add util

* Print when changes are made

* Add list to ignore

* Add more rules

* Manual fixes

* deal with kwargs

* deal with enum defaults

* avoid many digits for floats

* Manual fixes

* Fix regex

* Fix regex

* Auto fix

* Style

* Apply script

* Add ignored list

* Add check that templates are filled

* Adding to CI checks

* Add back semi-fix

* Ignore more objects

* More auto-fixes

* Ignore missing objects

* Remove temp semi-fix

* Fixes

* Update src/transformers/models/pvt/configuration_pvt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update utils/check_docstrings.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/utils/quantization_config.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Deal with float defaults

* Fix small defaults

* Address review comment

* Treat

* Post-rebase cleanup

* Address review comment

* Update src/transformers/models/deprecated/mctct/configuration_mctct.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Address review comment

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

03af4c42

03 Oct, 2023 6 commits

[Tokenizers] Skip tests temporarily (#26574) · 5c66378c
Lysandre Debut authored Oct 03, 2023
```
* Skip tests temporarily

* style

* Add additional test
```
5c66378c
[Whisper] Allow basic text normalization (#26149) · 57f44dc4
Sanchit Gandhi authored Oct 03, 2023
```
* [Whisper] Allow basic text normalization

* up

* style copies
```
57f44dc4

[`PEFT`] Final fixes (#26559) · 2aef9a96

Younes Belkada authored Oct 03, 2023

* fix issues with PEFT

* logger warning futurewarning issues

* fixup

* adapt from suggestions

* oops

* rm test

2aef9a96

[`Mistral`] Add Flash Attention-2 support for `mistral` (#26464) · ae9a344c

Younes Belkada authored Oct 03, 2023



* add FA-2 support for mistral

* fixup

* add sliding windows

* fixing few nits

* v1 slicing cache - logits do not match

* add comment

* fix bugs

* more mem efficient

* add warning once

* add warning once

* oops

* fixup

* more comments

* copy

* add safety checker

* fixup

* Update src/transformers/models/mistral/modeling_mistral.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* copied from

* up

* raise when padding side is right

* fixup

* add doc + few minor changes

* fixup

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ae9a344c

[Wav2Vec2 and Co] Update init tests for PT 2.1 (#26494) · 768aa3d9
Sanchit Gandhi authored Oct 03, 2023

768aa3d9

Add tokenizer kwargs to fill mask pipeline. (#26234) · b5ca8fcd

Nathan Cahill authored Oct 03, 2023



* add tokenizer kwarg inputs

* Adding tokenizer_kwargs to _sanitize_parameters

* Add truncation=True example to tests

* Update test_pipelines_fill_mask.py

* Update test_pipelines_fill_mask.py

* make fix-copies and make style

* Update fill_mask.py

Replace single tick with double

* make fix-copies

* Style

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

b5ca8fcd

02 Oct, 2023 4 commits

Code-llama-nit (#26300) · bab33319

Arthur authored Oct 02, 2023

* fix encoding when the fill token is None

* add tests and edge cases

* fiuxp

* Update tests/models/code_llama/test_tokenization_code_llama.py

bab33319

Fix model integration ci (#26322) · 63864e05

Arthur authored Oct 02, 2023

* fix wav2vec2

* nit

* stash

* one more file to update

* fix byt5

* vocab size is 256, don't change that!

* use other revision

* test persimon in smaller size

* style

* tests

* nits

* update add tokens from pretrained

* test tokenization

* nits

* potential fnet fix?

* more nits

* nits

* correct test

* assert close

* udpate

* ouch

* fix it

* some more nits

* FINALLU

* use `adept` checkpoints

* more adept checkpoints

* that was invlved!

63864e05

[`core`/ `auto` ] Fix bnb test with code revision + bug with code revision (#26431) · 6824461f

Younes Belkada authored Oct 02, 2023

* fix bnb test with code revision

* fix test

* Apply suggestions from code review

* Update src/transformers/models/auto/auto_factory.py

* Update src/transformers/models/auto/auto_factory.py

* Update src/transformers/models/auto/auto_factory.py

6824461f

Revert falcon exception (#26472) · 67239f73

Lysandre Debut authored Oct 02, 2023

* Revert "Falcon: fix revision propagation (#26006)"

This reverts commit 118c676ef3124423e5d062b665f05cde55bc9a90.

* Revert "Put Falcon back (#25960)"

This reverts commit 22a69f1d.

67239f73