Commits · 75bbf20bce5bb6b0042deff41e051db5324f9d54 · chenpangpang / transformers

24 May, 2023 10 commits

Zachary Mueller authored May 24, 2023

* Check for use_sagemaker_dp

* Add a check for is_sagemaker_mp when setting _n_gpu again. Should be last broken thing

* Try explicit check?

* Quality

75bbf20b

Fix the regex in `get_imports` to support multiline try blocks and excepts... · 89159651

Daniel King authored May 24, 2023

Fix the regex in `get_imports` to support multiline try blocks and excepts with specific exception types (#23725)

* fix and test get_imports for multiline try blocks, and excepts with specific errors

* fixup

* add some more tests

* add license

89159651

Overhaul TF serving signatures + dummy inputs (#23234) · 814de8fa

Matt authored May 24, 2023

* Let's try autodetecting serving sigs

* Don't clobber existing sigs

* Change shapes for multiplechoice models

* Make default dummy inputs smarter too

* Fix missing f-string

* Let's YOLO a serving output too

* Read __class__.__name__ properly

* Don't just pass naked lists in there and expect it to be okay

* Code cleanup

* Update default serving sig

* Clearer error messages

* Further updates to the default serving output

* make fixup

* Update the serving output a bit more

* Cleanups and renames, raise errors appropriately when we can't infer inputs

* More renames

* we're building in a functional context again, yolo

* import DUMMY_INPUTS from the right place

* import DUMMY_INPUTS from the right place

* Support cross-attention in the dummies

* Support cross-attention in the dummies

* Complete removal of dummy/serving overrides in BERT

* Complete removal of dummy/serving overrides in RoBERTa

* Obliterate lots and lots of serving sig and dummy overrides

* merge type hint changes

* Fix for token_type_ids with vocab_size 1

* Add missing property decorator

* Fix T5 and hopefully some models that take conv inputs

* More signature pruning

* Fix T5's signature

* Fix Wav2Vec2 signature

* Fix LongformerForMultipleChoice input signature

* Fix BLIP and LED

* Better default serving output error handling

* Fix BART dummies

* Fix dummies for cross-attention, esp encoder-decoder models

* Fix visionencoderdecoder signature

* Fix BLIP serving output

* Small tweak to BART dummies

* Cleanup the ugly parameter inspection line that I used in a few places

* committed a breakpoint again

* Move the text_dims check

* Remove blip_text serving_output

* Add decoder_input_ids to the default input sig

* Remove all the manual overrides for encoder-decoder model signatures

* Tweak longformer/led input sigs

* Tweak default serving output

* output.keys() -> output

* make fixup

814de8fa

fix: Whisper generate, move text_prompt_ids trim up for max_new_tokens calculation (#23724) · 3d7baef1
Connor Henderson authored May 24, 2023
```
move text_prompt_ids trimming to top
```
3d7baef1

TF SAM memory reduction (#23732) · d2d88226

Matt authored May 24, 2023

* Extremely small change to TF SAM dummies to reduce memory usage on build

* remove debug breakpoint

* Debug print statement to track array sizes

* More debug shape printing

* More debug shape printing

* Now remove the debug shape printing

* make fixup

* make fixup

d2d88226

Better TF docstring types (#23477) · f8b25744

Matt authored May 24, 2023

* Rework TF type hints to use | None instead of Optional[] for tf.Tensor

* Rework TF type hints to use | None instead of Optional[] for tf.Tensor

* Don't forget the imports

* Add the imports to tests too

* make fixup

* Refactor tests that depended on get_type_hints

* Better test refactor

* Fix an old hidden bug in the test_keras_fit input creation code

* Fix for the Deit tests

f8b25744

fix gptj could not jit.trace in GPU (#23317) · 767e6b53
Wang, Yi authored May 24, 2023
```
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
767e6b53

fix: use bool instead of uint8/byte in Deberta/DebertaV2/SEW-D to make it... · b4698b7e

uchuhimo authored May 24, 2023


fix: use bool instead of uint8/byte in Deberta/DebertaV2/SEW-D to make it compatible with TensorRT (#23683)

* Use bool instead of uint8/byte in DebertaV2 to make it compatible with TensorRT

TensorRT cannot accept onnx graph with uint8/byte intermediate tensors. This PR uses bool tensors instead of unit8/byte tensors to make the exported onnx file can work with TensorRT.

* fix: use bool instead of uint8/byte in Deberta and SEW-D

---------
Co-authored-by: Yuxian Qiu <yuxianq@nvidia.com>

b4698b7e

Paged Optimizer + Lion Optimizer for Trainer (#23217) · 796162c5

Tim Dettmers authored May 24, 2023



* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

796162c5

4-bit QLoRA via bitsandbytes (4-bit base model + LoRA) (#23479) · 9d73b922

Tim Dettmers authored May 24, 2023



* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

* Initial draft. Some tests fail.

* Fixed dtype bug.

* Fixed bug caused by torch_dtype='auto'.

* All test green for 8-bit and 4-bit layers.

* Added fix for fp32 layer norms and bf16 compute in LLaMA.

* Initial draft. Some tests fail.

* Fixed dtype bug.

* Fixed bug caused by torch_dtype='auto'.

* All test green for 8-bit and 4-bit layers.

* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

* Fixing issues for PR #23479.

* Added fix for fp32 layer norms and bf16 compute in LLaMA.

* Reverted variable name change.

* Initial draft. Some tests fail.

* Fixed dtype bug.

* Fixed bug caused by torch_dtype='auto'.

* All test green for 8-bit and 4-bit layers.

* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

* Added missing tests.

* Fixup changes.

* Added fixup changes.

* Missed some variables to rename.

* revert trainer tests

* revert test trainer

* another revert

* fix tests and safety checkers

* protect import

* simplify a bit

* Update src/transformers/trainer.py

* few fixes

* add warning

* replace with `load_in_kbit = load_in_4bit or load_in_8bit`

* fix test

* fix tests

* this time fix tests

* safety checker

* add docs

* revert torch_dtype

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* multiple fixes

* update docs

* version checks and multiple fixes

* replace `is_loaded_in_kbit`

* replace `load_in_kbit`

* change methods names

* better checks

* oops

* oops

* address final comments

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9d73b922

23 May, 2023 9 commits

Fix some docs what layerdrop does (#23691) · 003a0cf8

zspo authored May 24, 2023



* Fix some docs what layerdrop does

* Update src/transformers/models/data2vec/configuration_data2vec_audio.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix more docs

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

003a0cf8

fix: load_best_model_at_end error when load_in_8bit is True (#23443) · 357f281b

小桐桐 authored May 24, 2023

Ref: https://github.com/huggingface/peft/issues/394
Loading a quantized checkpoint into non-quantized Linear8bitLt is not supported.
call module.cuda() before module.load_state_dict()

357f281b

is_batched fix for remaining 2-D numpy arrays (#23309) · 3d574044

LWprogramming authored May 23, 2023

* Fix is_batched code to allow 2-D numpy arrays for audio

* Tests

* Fix typo

* Incorporate comments from PR #23223

3d574044

[`Blip`] Fix blip doctest (#23698) · 6b7d6f84
Younes Belkada authored May 23, 2023
```
fix blip doctest
```
6b7d6f84

TF version compatibility fixes (#23663) · 876d9a32

Matt authored May 23, 2023

* New TF version compatibility fixes

* Remove dummy print statement, move expand_1d

* Make a proper framework inference function

* Make a proper framework inference function

* ValueError -> TypeError

876d9a32

[`SAM`] Fixes pipeline and adds a dummy pipeline test (#23684) · 42baa58f
Younes Belkada authored May 23, 2023
```
* add a dummy pipeline test

* change test name
```
42baa58f

Fix typo in a parameter name for open llama model (#23637) · b687af0b

Alex authored May 23, 2023

* Update modeling_open_llama.py

Fix typo in `use_memorry_efficient_attention` parameter name

* Update configuration_open_llama.py

Fix typo in `use_memorry_efficient_attention` parameter name

* Update configuration_open_llama.py

Take care of backwards compatibility ensuring that the previous parameter name is taken into account if used

* Update configuration_open_llama.py

format to adjust the line length

* Update configuration_open_llama.py

proper code formatting using `make fixup`

* Update configuration_open_llama.py

pop the argument not to let it be set later down the line

b687af0b

Add PerSAM [bis] (#23659) · 527ab894

NielsRogge authored May 23, 2023

* Add PerSAM args

* Make attn_sim optional

* Rename to attention_similarity

* Add docstrigns

* Improve docstrings

527ab894

small fix to remove unused eos in processor when it's not used. (#23408) · e30ceae0
Nicolas Patry authored May 23, 2023

e30ceae0

22 May, 2023 6 commits

[image-to-text pipeline] Add conditional text support + GIT (#23362) · 2f424d79

NielsRogge authored May 22, 2023

* First draft

* Remove print statements

* Add conditional generation

* Add more tests

* Remove scripts

* Remove BLIP specific linkes

* Add support for pix2struct

* Add fast test

* Address comment

* Fix style

2f424d79

Fix wav2vec2 is_batched check to include 2-D numpy arrays (#23223) · 5de2a6d5

LWprogramming authored May 22, 2023



* Fix wav2vec2 is_batched check to include 2-D numpy arrays

* address comment

* Add tests

* oops

* oops

* Switch to np array
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Switch to np array

* condition merge

* Specify mono channel only in comment

* oops, add other comment too

* make style

* Switch list check from falsiness to empty

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

5de2a6d5

Bugfix: LLaMA layer norm incorrectly changes input type and consumers lots of memory (#23535) · 4ddd9de9

Tim Dettmers authored May 22, 2023



* Fixed bug where LLaMA layer norm would change input type.

* make fix-copies

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

4ddd9de9

Muellerzr fix deepspeed (#23657) · fe34486f
Zachary Mueller authored May 22, 2023
```
* Fix deepspeed recursion

* Better fix
```
fe34486f

Fix tensor device while attention_mask is not None (#23538) · 29294b0e

zspo authored May 22, 2023

* Fix tensor device while attention_mask is not None

* Fix tensor device while attention_mask is not None

29294b0e

Debug example code for MegaForCausalLM (#23382) · 6397b7f0

Tyler authored May 22, 2023

* Debug example code for MegaForCausalLM

set ignore_mismatched_sizes=True in model loading code

* Fix up

6397b7f0

19 May, 2023 9 commits

[`Blip`] Remove redundant shift right (#23153) · 3cb93090
Younes Belkada authored May 19, 2023
```
* remove redundant shit right

* fix failing tests

* this time fix tests
```
3cb93090

Fix: Change tensors to integers for torch.dynamo and torch.compile compatibility (#23475) · 847e5691

Dennis Loevlie authored May 19, 2023

* Fix: Change tensors to integers in torch.split() for torch.dynamo and torch.compile compatibility

* Applied the suggested fix to the utils/check_copies.py test

* Applied the suggested fix by changing the original function that gets copied

847e5691

Fix PretrainedConfig `min_length` docstring (#23471) · 389bdba6
joaoareis authored May 19, 2023

389bdba6

Fix parallel mode check (#23409) · b455ad0a

Zachary Mueller authored May 19, 2023

* Fix sagemaker/distributed state

* Fix correctly

* Bring back -1

* Bring back local rank for distributed check

* better version

* Cleanest option

b455ad0a

Use config to set name and description if not present (#23473) · 2aa0cc2c
Sylvain Gugger authored May 19, 2023
```
Use config to set name and descriptiob if not present
```
2aa0cc2c
[`RWKV`] Rwkv fix for 8bit inference (#23468) · 21bd3be1
Younes Belkada authored May 19, 2023
```
* rwkv fix for 8bit inference

* add comment
```
21bd3be1

TF port of the Segment Anything Model (SAM) (#22970) · 1c460a52

Matt authored May 19, 2023



* First commit

* Add auto-translation with GPT-4

* make fixup

* Add a functional layernorm for TF

* Add all the auxiliary imports etc.

* Add the extra processor and tests

* rebase to main

* Add all the needed fixes to the GPT code

* make fixup

* Make convolutions channels-last so they run on CPU

* make fixup

* Fix final issues

* Fix other models affected by test change

* Clarify comment on the sparse_prompt_embeddings check

* Refactor functional_layernorm, use shape_list in place of .shape in some places

* Remove deprecated torch-alike code

* Update tests/models/sam/test_modeling_tf_sam.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/sam/test_modeling_tf_sam.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Refactor processor with common methods and separated private methods

* make fixup

* Quietly delete the file that didn't do anything (sorry Sylvain)

* Refactor the processor tests into one file

* make fixup

* Clean up some unnecessary indirection

* Fix TF mask postprocessing

* Add more processor equivalence tests

* Refactor generate_crop_boxes to use framework-neutral np code

* Make the serving output correctly conditional

* Fix error message line length

* Use dict keys rather than indices internally in both TF and PT SAM call/forward

* Return dicts internally in the call/forward methods

* Revert changes to common tests and just override check_pt_tf_outputs

* Revert changes to other model tests

* Clarify comments for functional layernorm

* Add missing transpose from PT code

* Removed unused copied from in PT code

* Remove overrides for tests that don't exist in TF

* Fix transpose and update tests for PT and TF to check pred_masks

* Add training flag

* Update tests to use TF checkpoints

* Update index.mdx

* Add missing cross-test decorator

* Remove optional extra asterisks

* Revert return_dict changes in PT code

* Update src/transformers/models/sam/modeling_tf_sam.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove None return annotations on init methods

* Update tests/models/sam/test_processor_sam.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Fix input_boxes shapes

* make fixup

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1c460a52

Remove .data usages in optimizations.py (#23417) · 8aa8513f
Jiewen Tan authored May 19, 2023
```
Patched the optimizers
```
8aa8513f

feat: Whisper prompting (#22496) · 2acedf47

Connor Henderson authored May 19, 2023

* initial working additions

* clean and rename, add cond stripping initial prompt to decode

* cleanup, edit create_initial_prompt_ids, add tests

* repo consistency, flip order of conditional

* fix error, move the processor fn to the tokenizer

* repo consistency, update test ids to corresponding tokenizer

* use convert_tokens_to_ids not get_vocab...

* use actual conditional in generate

* make sytle

* initial address comments

* initial working add new params to pipeline

* first draft of sequential generation for condition_on_previous_text

* add/update tests, make compatible with timestamps

* make compatible with diff. input kwargs and max length

* add None check

* add temperature check

* flip temp check operand

* refocusing to prev pr scope

* remove the params too

* make style

* edits, move max length incorporating prompt to whisper

* address comments

* remove asr pipeline prompt decoding, fix indexing

* address comments (more tests, validate prompt)

* un-comment out tests (from debug)

* remove old comment

* address comments

* fix typo

* remove timestamp token from test

* make style

* cleanup

* copy method to fast tokenizer, set max_new_tokens for test

* prompt_ids type just pt

* address Amy's comments

* make style

2acedf47

18 May, 2023 6 commits
- Clean up CUDA kernels (#23455) · b7b81d93
  Sylvain Gugger authored May 18, 2023
  
  b7b81d93
- Add an option to log result from the Agent (#23454) · 40ed18ae
  Sylvain Gugger authored May 18, 2023
  
  40ed18ae
- Properly guard PyTorch stuff (#23452) · 167aa76c
  Sylvain Gugger authored May 18, 2023
```
* Properly guard PyTorch stuff

* [all-test]

* [all-test] Fix model imports as well

* Making sure StoppingCriteria is always defined

* [all-test]
```
  167aa76c
- Make `RwkvModel` accept `attention_mask` but discard it internally (#23442) · 21f7e81b
  Yih-Dar authored May 18, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  21f7e81b
- Add local agent (#23438) · cf432008
  Sylvain Gugger authored May 18, 2023
```
* Add local agent

* Document LocalAgent
```
  cf432008
- TF: GPT2 with native embedding layers (#23436) · db136341
  Joao Gante authored May 18, 2023
  
  db136341