Commits · 0ded281557f88add8733aa536763fae61207b382 · chenpangpang / transformers

07 Nov, 2023 5 commits

[`FA2`] Add flash attention for `GPT-Neo` (#26486) · 0ded2815

Susnato Dhar authored Nov 07, 2023



* added flash attention for gpt-neo

* small change
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* readme updated

* .

* changes

* removed padding_mask

* Update src/transformers/models/gpt_neo/modeling_gpt_neo.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0ded2815

Fix Whisper Conversion Script: Correct decoder_attention_heads and _download function (#26834) · 606d9084

Xabier de Zuazo authored Nov 07, 2023

* Fix error in convert_openai_to_hf.py: "_download() missing 1 required positional argument: root"

* Fix error in convert_openai_to_hf.py: "TypeError: byte indices must be integers or slices, not str"

* Fix decoder_attention_heads value in convert_openai_to_hf.py.

Correct the assignment for `decoder_attention_heads` in the conversion script for the Whisper model.

* Black reformat convert_openai_to_hf.py file.

* Fix Whisper model configuration defaults (for Tiny).

- Correct encoder/decoder layers and attention heads count.
- Update model width (`d_model`) to 384.

* Add docstring to the convert_openai_to_hf.py script with a doctest

* Add shebang and +x permission to the convert_openai_to_hf.py

* convert_openai_to_hf.py: reuse the read model_bytes in the _download() function

* Move convert_openai_to_hf.py doctest example to whisper.md

* whisper.md: Add an inference example to the Conversion section.

* whisper.md: remove `model.config.forced_decoder_ids` from examples (deprecated)

* whisper.md: Remove "## Format Conversion" section; not used by users

* whisper.md: Use librispeech_asr_dummy dataset and load_dataset()

606d9084

Generate: skip tests on unsupported models instead of passing (#27265) · 90b4adc1
Joao Gante authored Nov 07, 2023

90b4adc1
Fix autoawq docker image (#27339) · 26d8d5f2
Younes Belkada authored Nov 07, 2023
```
* Update Dockerfile

* Update docker/transformers-all-latest-gpu/Dockerfile
```
26d8d5f2

[Whisper] Block language/task args for English-only (#27322) · da7ea9a4

Sanchit Gandhi authored Nov 07, 2023



* [Whisper] Block language/task args for English-only

* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

da7ea9a4

06 Nov, 2023 10 commits

[docs] fixed links with 404 (#27327) · 9beb2737
Maria Khalusova authored Nov 06, 2023
```
* fixed links with 404

* make style
```
9beb2737

Fix `Kosmos2Processor` batch mode (#27323) · 1b20e2bb

Yih-Dar authored Nov 06, 2023



* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1b20e2bb

Fix VideoMAEforPretrained dtype error (#27296) · a6e0d5a2
Iker García-Ferrero authored Nov 06, 2023
```
* Fix dtype error

* Fix mean and std dtype

* make style
```
a6e0d5a2

Update sequence_classification.md (#27281) · e9dbd392

Akshay Chintalapati authored Nov 06, 2023

I'm adding accelerate as one of the libraries to install because otherwise when running the Trainer, the model errorr out with the error.

ImportError: Using the `Trainer` with `PyTorch` requires `accelerate>=0.20.1`: Please run `pip install transformers[torch]` or `pip install accelerate -U`

Further context:
1. I've tried this across different environments so I believe that the environment is not the issue.
2. I had the latest transformers library version running.
3. Typically even after install accelerate and import it, it wouldn't resolve the issue until I restart the notebook and try again.

e9dbd392

[`PretrainedTokenizer`] add some of the most important functions to the doc (#27313) · 147f7746
Arthur authored Nov 06, 2023

147f7746
enable memory tracker metrics for npu (#27280) · 1ffc4dee
Hz, Ji authored Nov 06, 2023

1ffc4dee
Remove an unexpected argument for FlaxResNetBasicLayerCollection (#27272) · d7dcfa89
Pingzhi Li authored Nov 06, 2023
```
Remove unexpected argument for FlaxResNetBasicLayerCollection
```
d7dcfa89
Update doctest workflow file (#27306) · eef7ea98
Yih-Dar authored Nov 06, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
eef7ea98
Fix daily CI image build (#27307) · d788d37d
Yih-Dar authored Nov 06, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d788d37d
Fix tokenizer export for LLamaTokenizerFast (#27222) · b026b5ca
Mayank Mishra authored Nov 06, 2023
```
* fix tokenizer

* fix tokenizer
```
b026b5ca

03 Nov, 2023 12 commits

translate run_scripts.md to chinese (#27246) · cc3e4781

jiaqiw09 authored Nov 03, 2023

* translate run_scripts.md to chinese

* translate run_scripts.md to chinese

* translate run_scripts.md to chinese

cc3e4781

translate autoclass_tutorial to chinese (#27269) · bf7cfac2
jiaqiw09 authored Nov 03, 2023
```
* translate autoclass_tutorial.md  to chinese

* translate update
```
bf7cfac2

[`FA2`] Add flash attention for for `DistilBert` (#26489) · 1ac2463d

Susnato Dhar authored Nov 03, 2023

* flash attention added for DistilBert

* fixes

* removed padding_masks

* Update modeling_distilbert.py

* Update test_modeling_distilbert.py

* style fix

1ac2463d

[Docs] Model_doc structure/clarity improvements (#26876) · 5964f820

Maria Khalusova authored Nov 03, 2023

* first batch of structure improvements for model_docs

* second batch of structure improvements for model_docs

* more structure improvements for model_docs

* more structure improvements for model_docs

* structure improvements for cv model_docs

* more structural refactoring

* addressed feedback about image processors

5964f820

[`Docs` / `SAM` ] Reflect correct changes to run inference without OOM (#27268) · ad8ff962
Younes Belkada authored Nov 03, 2023
```
Update sam.md
```
ad8ff962
Fix switch transformer mixed precision issue (#27220) · f13f544a
Shiyu Li authored Nov 03, 2023
```
* Fix mixed precision error for switch transformer

* Fixup
```
f13f544a

Update the ConversationalPipeline docstring for chat templates (#27250) · db69bd88

Matt authored Nov 03, 2023

* Update the ConversationalPipeline docstring now that we're using chat templates

* Direct access to conversation.messages

* Explain the string init

db69bd88

[docs] Custom model doc update (#27213) · 011b15c1
Maria Khalusova authored Nov 03, 2023
```
doc update
```
011b15c1

Avoid many failing tests in doctesting (#27262) · af8d1dc3

Yih-Dar authored Nov 03, 2023



* fix

* update

* update

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

af8d1dc3

[`PEFT` / `Tests` ] Fix peft integration failing tests (#27258) · 8f1a43cd
Younes Belkada authored Nov 03, 2023
```
fix peft integration issues
```
8f1a43cd

Refactor: Use Llama RoPE implementation for Falcon (#26933) · 05ea7b79

Tom Aarsen authored Nov 03, 2023

* Use Llama RoPE implementation for Falcon

+ Add copy functionalities

* Use standard cache format for Falcon

* Simplify apply_rotary_pos_emb, copy from Llama

* Remove unnecessary cache conversion test

We don't need to convert any caches anymore!

* Resolve copy complaint

05ea7b79

Fuyu protection (#27248) · e9a6c72b
Lysandre Debut authored Nov 03, 2023

e9a6c72b

02 Nov, 2023 13 commits

Fixed base model class name extraction from PeftModels (#27162) · 552ff244

Komal Kumar authored Nov 02, 2023

* Fixed base model class name extraction from PeftModels

* Changes to first unwrap the model then extract the base model name

* Changed base_model to base_model.model to stay consistent with peft model abstractions

552ff244

Removed the redundant SiLUActivation class. (#27136) · 49912168

Chi authored Nov 02, 2023

* Removed the redundant SiLUActivation class and now use nn.functional.silu directly.

* I apologize for adding torch.functional.silu. I have replaced it with nn.SiLU.

49912168

translate peft.md to chinese (#27215) · 00d8502b
jiaqiw09 authored Nov 02, 2023
```
* tranlsate peft.md to chinese

* translate peft.md to chinese

* fix missing link
```
00d8502b
Dev version · bc78fd12
Lysandre authored Nov 02, 2023

bc78fd12

Enrich TTS pipeline parameters naming (#26473) · 0ed6729b

Yoach Lacombe authored Nov 02, 2023



* enrich TTS pipeline docstring for clearer forward_params use

* change token leghts

* update Pipeline parameters

* correct docstring and make style

* fix tests

* make style

* change music prompt
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* raise errors if generate_kwargs with forward-only models

* make style

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0ed6729b

Remove redundant code from T5 encoder mask creation (#27216) · 147e8ce4
Pietro Lesci authored Nov 02, 2023
```
* remove redundant code

* update

* add typecasting

* make `attention_mask` float again
```
147e8ce4
Generate: return `past_key_values` (#25086) · a6c82d45
Joao Gante authored Nov 02, 2023

a6c82d45
fix-deprecated-exllama-arg (#27243) · 441c3e0d
Marc Sun authored Nov 02, 2023
```
fix-exllama
```
441c3e0d

Fixing m4t. (#27240) · 8801861d

Nicolas Patry authored Nov 02, 2023

* Fixing m4t.

* Trying to remove comparison ? Odd test failure.

* Adding shared. But why on earth does it hang ????

* Putting back the model weights checks the test is silently failing on
cuda.

* Fix style + unremoved comment.

8801861d

Fix safetensors failing tests (#27231) · 443bf5e9

Lysandre Debut authored Nov 02, 2023



* Fix Kosmos2

* Fix ProphetNet

* Fix MarianMT

* Fix M4T

* XLM ProphetNet

* ProphetNet fix

* XLM ProphetNet

* Final M4T fixes

* Tied weights keys

* Revert M4T changes

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

443bf5e9

Wrap `_prepare_4d_causal_attention_mask` as a leaf function (#27236) · 4557a0de
Michael Benayoun authored Nov 02, 2023
```
Wrap _prepare_4d_causal_attention_mask as a leaf function
```
4557a0de

Fuyu: improve image processing (#27007) · 8a312956

Pablo Montalvo authored Nov 02, 2023



* Fix Fuyu image scaling bug

It could produce negative padding and hence inference errors for certain
image sizes.

* initial rework commit

* add batching capabilities, refactor image processing

* add functional batching for a list of images and texts

* make args explicit

* Fuyu processing update (#27133)

* Add file headers

* Add file headers

* First pass - preprocess method with standard args

* First pass image processor rework

* Small tweaks

* More args and docstrings

* Tidying iterating over batch

* Tidying up

* Modify to have quick tests (for now)

* Fix up

* BatchFeature

* Passing tests

* Add tests for processor

* Sense check when patchifying

* Add some tests

* FuyuBatchFeature

* Post-process box coordinates

* Update to `size` in processor

* Remove unused and duplicate constants

* Store unpadded dims after resize

* Fix up

* Return FuyuBatchFeature

* Get unpadded sizes after resize

* Update exception

* Fix return

* Convert input `<box>` coordinates to model format.

* Post-process point coords, support multiple boxes/points in a single
sequence

* Replace constants

* Update src/transformers/models/fuyu/image_processing_fuyu.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Preprocess List[List[image]]

* Update src/transformers/models/fuyu/image_processing_fuyu.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update to Amy's latest state.

* post-processing returns a list of tensors

* Fix error when target_sizes is None
Co-authored-by: Pablo Montalvo <pablo.montalvo.leroux@gmail.com>

* Update src/transformers/models/fuyu/image_processing_fuyu.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update src/transformers/models/fuyu/image_processing_fuyu.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update src/transformers/models/fuyu/image_processing_fuyu.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update src/transformers/models/fuyu/image_processing_fuyu.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Review comments

* Update src/transformers/models/fuyu/image_processing_fuyu.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Fix up

* Fix up

---------
Co-authored-by: Ubuntu <ubuntu@ip-172-31-72-126.ec2.internal>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pablo Montalvo <pablo.montalvo.leroux@gmail.com>

* Fix conflicts in fuyu_follow_up_image_processing (#27228)

fixing conflicts and updating on main

* Revert "Fix conflicts in fuyu_follow_up_image_processing" (#27232)

Revert "Fix conflicts in fuyu_follow_up_image_processing (#27228)"

This reverts commit acce10b6c653dc7041fb9d18cfed55775afd6207.

---------
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-72-126.ec2.internal>

8a312956

[`core` / `Quantization`] Fix for 8bit serialization tests (#27234) · 9b25c164
Younes Belkada authored Nov 02, 2023
```
* fix for 8bit serialization

* added regression tests.

* fixup
```
9b25c164