Commits · 9ced33ca7f909d9ace743dac083daba99c904d46 · chenpangpang / transformers

23 Jul, 2024 13 commits

Fix video batching to videollava (#32139) · 9ced33ca
Merve Noyan authored Jul 23, 2024
```
---------
Co-authored-by: Merve Noyan <mervenoyan@Merve-MacBook-Pro.local>
```
9ced33ca
Fix flash attention speed issue (#32028) · a5b226ce
Cyril Vallez authored Jul 23, 2024
```
Add the lru_cache for speed
```
a5b226ce

gguf conversion add_prefix_space=None for llama3 (#31937) · a1844a32

Ita Zaporozhets authored Jul 23, 2024

* gguf conversion forces add_prefix_space=False for llama3, this is not required and forces from_slow, which fails. changing to None + test

* typo

* clean test

a1844a32

Llama: RoPE refactor (#32135) · 2e113422

Joao Gante authored Jul 23, 2024


Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

2e113422

Modify resize_token_embeddings to ensure output type is same as input (#31979) · 5a4a76ed

bayllama authored Jul 23, 2024



* Change resize_token_embeddings to make it return same Class that is passed to it

* Add explanatory comment as requested in review

* Add explanatory comments for add resizing function in lxmert

* Add comment for padding_idx and moving _resize_bias in lxmert to LxmertForPreTraining

---------
Co-authored-by: Prashanth Sateesh <prasatee@Prashanths-MBP.attlocal.net>
Co-authored-by: Prashanth Sateesh <prasatee@Prashanths-MacBook-Pro.local>

5a4a76ed

Disable quick init for TapasPreTrainedModel (#32149) · 1535a2c9
Daniel Lok authored Jul 23, 2024
```
add attribute to model
Signed-off-by: Daniel Lok <daniel.lok@databricks.com>
```
1535a2c9

Add YaRN and Dynamic-YaRN RoPE Scaling Methods (#30910) · 34b43211

mig-mfreitas authored Jul 23, 2024

* Add YaRN and Dynamic-YaRN RoPE Scaling Methods

YaRN (Yet another RoPE extension method) combines the NTK-By-Parts
Interpolation and Attention Scaling methods, improving upon existing
RoPE interpolation methods for longer context window sizes.

Fine-tuned models maintain their original performance across benchmarks
while enabling efficient extrapolation and transfer learning for
quicker convergence, especially in compute-limited environments.

We implement YaRN and Dynamic-YaRN for the following list of models:

 - LLaMA
 - Falcon
 - GPT-NeoX
 - Olmo
 - Persimmon
 - Phi
 - StableLM
 - OpenLLaMA

New unit tests are added to assert YaRN's correct behavior on both
short and long sequence inputs.

For more details, please refer to https://arxiv.org/abs/2309.00071

.
Co-authored-by: Miguel Almeida <miguel.pessanha.almeida@tecnico.ulisboa.pt>

* Refactor YaRN implementation for LLaMA

Iterate on YaRN implementation for LLaMA and remove diff from remaining
models for increased PR modularity.

This commit includes the following changes:
- Merge 'yarn_rope_scaling' and 'rope_scaling' dictionaries
- Remove unnecessary attributes ('extrapolation_factor' and 'finetuned')
  from YaRN classes
- Inherit 'forward' method in YaRN classes from superclass
- Rename 'yarn' method to 'compute_yarn_scaling'
- Extend YaRN tests with further assertions
- Fix style inconsistencies
Co-authored-by: Miguel Monte e Freitas <miguelmontefreitas@tecnico.ulisboa.pt>

* Refactor Tensor Building Logic for YaRN

- Comply with the the tensor building logic introduced in #30743
- Add referencing to the optimized Attention Factor equation
- Remove Dynamic YaRN for a more agile deployment
Co-authored-by: mig-mfreitas <mig-mfreitas@users.noreply.github.com>

* remove unwanted file

---------
Co-authored-by: Miguel Almeida <miguel.pessanha.almeida@tecnico.ulisboa.pt>
Co-authored-by: mig-mfreitas <mig-mfreitas@users.noreply.github.com>
Co-authored-by: Joao Gante <joao@huggingface.co>

34b43211

Add method to retrieve used chat template (#32032) · 7405c1c7
KonradSzafer authored Jul 23, 2024
```
encapsulate chat template logic
```
7405c1c7

Fix mask creations of `GPTNeoX` and `GPT2` (#31944) · 605f3245

Anton Vlasjuk authored Jul 23, 2024

* fix mask creation of gpt2 and gpt_neox caused by me

* forgot the reshape of masks when shape > 2

* add tests for gpt neox and gpt2

* nit on a comment

605f3245

[modelling] remove un-necessary transpose for fa2 attention (#31749) · 2782aada
Sanchit Gandhi authored Jul 23, 2024
```
* [whisper] remove un-necessary transpose for fa2 attention

* propagate
```
2782aada
Remove `trust_remote_code` when loading Libri Dummy (#31748) · f83c6f1d
Sanchit Gandhi authored Jul 23, 2024
```
* [whisper integration] use parquet dataset for testing

* propagate to others

* more propagation

* last one
```
f83c6f1d
LLaVaNeXT: pad on right if training (#32134) · 3aefb4ec
Raushan Turganbay authored Jul 23, 2024
```
* pad on right if training

* docs

* add tests
```
3aefb4ec

Add llama3-llava-next-8b to llava_next conversion script (#31395) · 251a2409

James Thewlis authored Jul 23, 2024



* Add llama3-llava-next-8b to llava_next conversion script

Adds support for the lmms-lab/llama3-llava-next-8b model to the
convert_llava_next_weights_to_hf.py script, along with an example
prompt generated from the llava_llama_3 conv_template in the LLaVA-NeXT
repo.

* Exclude <|begin_of_text|> from prompt example

This token gets added automatically, so it should not be included in the
prompt example.

* Add llava-next-72b and llava-next-110b

Adds the Qwen-based LLaVA-Next models to the conversion script, along
with changes to load the models on multiple GPUs for inference.

* Add llama3 and qwen prompt formats to docs

* Chat prompt and padding side left for llama3 batched

* update

* Update src/transformers/models/llava_next/convert_llava_next_weights_to_hf.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/llava_next/convert_llava_next_weights_to_hf.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* remove code

* better naming

---------
Co-authored-by: raushan <raushan@huggingface.co>
Co-authored-by: Raushan Turganbay <raushan.turganbay@alumni.nu.edu.kz>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

251a2409

22 Jul, 2024 14 commits
- Add new quant method (#32047) · 96a074fa
  Marc Sun authored Jul 22, 2024
```
* Add new quant method

* update

* fix multi-device

* add test

* add offload

* style

* style

* add simple example

* initial doc

* docstring

* style again

* works ?

* better docs

* switch to non persistant

* remove print

* fix init

* code review
```
  96a074fa
- set warning level to info for special tokens have been added (#32138) · bd9dca3b
  Arthur authored Jul 22, 2024
```
fixes #7002
```
  bd9dca3b
- Don't default to other weights file when use_safetensors=True (#31874) · 817a676b
  amyeroberts authored Jul 22, 2024
```
* Don't default to other weights file when use_safetensors=True

* Add tests

* Update tests/utils/test_modeling_utils.py

* Add clarifying comments to tests

* Update tests/utils/test_modeling_utils.py

* Update tests/utils/test_modeling_utils.py
```
  817a676b
- Return assistant generated tokens mask in apply_chat_template (#30650) · 74d0eb3f
  Yoni Gottesman authored Jul 22, 2024
```
return assistant generated tokens mask in apply_chat_template
```
  74d0eb3f
- [RoBERTa] Minor clarifications to model doc (#31949) · 79877106
  Bertrand Thia authored Jul 22, 2024
```
* minor edits and clarifications

* address comment
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
```
  79877106
- fix: Fixed raising `TypeError` instead of `ValueError` for invalid type (#32111) · 12b6880c
  Sai-Suraj-27 authored Jul 22, 2024
```
* Raised TypeError instead of ValueError for invalid types.

* Updated formatting using ruff.

* Retrieved few changes.

* Retrieved few changes.

* Updated tests accordingly.
```
  12b6880c
- Update `ko/_toctree.yml` and remove `custom_tools.md` to reflect latest changes (#31969) · d1ec36b9
  Woojun Jung authored Jul 23, 2024
```
update `ko/_toctree.yml` and remove `custom_tools.md`
```
  d1ec36b9
- Fix failing test with race condition (#32140) · 7ba028fc
  Matt authored Jul 22, 2024
```
* Fix failing test with race condition

* make fixup

* monotonic_ns instead of randint

* uuid4 instead of monotonic_ns

* Add a finally cleanup step
```
  7ba028fc
- [generate] fix eos/pad id check on mps devices (#31695) · 5a649ff3
  Sanchit Gandhi authored Jul 22, 2024
```
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
```
  5a649ff3
- Mention model_info.id instead of model_info.modelId (#32106) · f2a1e3ca
  Lucain authored Jul 22, 2024
  
  f2a1e3ca
- fix: Replaced deprecated `mktemp()` function (#32123) · 0fcfc5cc
  Sai-Suraj-27 authored Jul 22, 2024
```
Replaced deprecated mktemp function.
```
  0fcfc5cc
- Generate: store special token tensors under a unique variable name (#31980) · c38c55f4
  Joao Gante authored Jul 22, 2024
```
* rename stuff

* english; this one shouldn't be changed

* add a _ to the new var names

* musicgen

* derp
```
  c38c55f4
- Fix shard order (#32023) · aa8f86a4
  Brian authored Jul 22, 2024
  
  aa8f86a4
- Agents planning (#31702) · b3818805
  Aymeric Roucher authored Jul 22, 2024
```
* Allow planning for agents
```
  b3818805
19 Jul, 2024 12 commits

Fix tests after `huggingface_hub` 0.24 (#32054) · 0fdea860
Lucain authored Jul 19, 2024
```
* adapt tests

* style

* comment
```
0fdea860
Chameleon: not supported with fast load (#32091) · fe008d6e
Raushan Turganbay authored Jul 19, 2024
```
fixes
```
fe008d6e
Disable quick init for deepspeed (#32066) · 62aa270f
Zach Mueller authored Jul 19, 2024
```
Disable via deepspeed
```
62aa270f

Support generating with fallback for short form audio in Whisper (#30984) · 89575b56

Kamil Akesbi authored Jul 19, 2024



* remove is_shortform

* adapt _retrieve_max_frames_and_seek for short_form

* return bos token in short and long form

* add decoder_input_ids to short form audios

* add eos token for  short form

* handle short form token_timestamps

* no need to return scores

* add is_shortform conditions

* handle when max_new_tokens is None - short form

* handle assistant decoding

* fix

* handle return_dict_in_generate

* handle split_by_batch for encoder_attentions attribute

* handle num_beams>1

* handle num_return_sequences>1 in generate_with_fallback

* handle num_return_sequences>1 with return_dict_in_generate=True

* raise error if max_new_tokens + decoder_inputs_ids > max_target_pos

* fix

* apply review suggestions

* fix

* Update src/transformers/models/whisper/generation_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update src/transformers/models/whisper/generation_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update src/transformers/models/whisper/generation_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* fix

* logits for both short form and long form

* handle if logits_processor is None

* test

* apply review changes to num_return_sequences

* add _expand_variables_for_generation

* remove short form commented section

* update comments

* uncomment num_beams line in generate_with_fallback

* update assistant decoding

* handle return_segment with short form generation

* up

* fix output format is_shortform

* overwrite beam_sample test

* update _set_return_timestamps

* apply review suggestions

* apply review suggestions

* remove seek_outputs_short_form

* fix _stack_split_outputs

* fix stack dim in _stack_split_outputs

* update tests

* fix past_key_values + beam tests

* fix

* clean _expand_variables_for_generation

* make style

* fix slow tests

* make style

* max_length condition

* make style

* add slow tests for shortform fallback

* Update src/transformers/models/whisper/generation_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update src/transformers/models/whisper/generation_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* apply review changes

* Update src/transformers/models/whisper/generation_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* up

* fix slow tests

* apply review suggestions

* update test

* make style

* small fix

* fix

* fix test_new_cache_format

* fix past_key_values

* fix

* make style

* fix slow tests

* fix

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

89575b56

Add image-text-to-text task guide (#31777) · 46835ec6

Merve Noyan authored Jul 19, 2024



* Add image-text-to-text task page

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Address comments

* Fix heading

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/tasks/image_text_to_text.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Address comments

* Update image_text_to_text.md

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

46835ec6

Fixes to chameleon docs (#32078) · 4bd8f129
Merve Noyan authored Jul 19, 2024
```
* Fixes

* Let's not use auto
```
4bd8f129

Fix progress callback deepcopy (#32070) · 566b0f1f

Keith Stevens authored Jul 19, 2024

* Replacing ProgressCallbacks deepcopy with a shallowcopy

* Using items instead of entries

* code cleanup for copy in trainer callback

* Style fix for ProgressCallback

566b0f1f

VideoLLaVa: fix chat format in docs (#32083) · e316c521
Raushan Turganbay authored Jul 19, 2024
```
fix chat format
```
e316c521
[mistral] Fix FA2 attention reshape for Mistral Nemo (#32065) · 22f888b3
Joshua Lochner authored Jul 19, 2024
```
* [mistral] Fix FA2 attention reshape

* [run-slow] mistral
```
22f888b3

Incorrect Whisper long-form decoding timestamps (#32003) · cd48553f

Kamil Akesbi authored Jul 19, 2024



* fix lo form timestamps in decode_batch

* Update src/transformers/models/whisper/tokenization_whisper.py
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

* Update src/transformers/models/whisper/tokenization_whisper.py
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

* add test

* make style

* fix copies

* Update src/transformers/models/whisper/tokenization_whisper_fast.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/whisper/tokenization_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/whisper/processing_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/whisper/tokenization_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* apply review suggestions

* fix

* fix copies

* fix

* Update src/transformers/models/whisper/tokenization_whisper_fast.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix-copies

---------
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

cd48553f

[Chameleon, Hiera] Improve docs (#32038) · 56a77457
NielsRogge authored Jul 19, 2024
```
* Improve docs

* Fix docs

* Fix code snippet
```
56a77457

Llava: add default chat templates (#31691) · b873234c

Raushan Turganbay authored Jul 19, 2024



* add default chat templates

* Update src/transformers/models/llava/processing_llava.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/llava_next/processing_llava_next.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* more clear docstring and docs

* Update docs/source/en/model_doc/llava.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/llava_next.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/vipllava.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* add tests

* remove default templates (see #31733)

* load chat template from another file

* Update docs/source/en/model_doc/llava_next.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* revert some changes in docs

* forgot vipllava

* chat template file is not temporary hack

* warn if loading from processor

* not that file

* similarly modify `save_pretrained`

* Update tests/models/llava_next/test_processor_llava_next.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/vipllava/test_processor_vipllava.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/vipllava.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/processing_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/processing_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/vipllava.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/llava.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/llava.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/llava_next.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/llava_next.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/processing_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/llava_next.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

b873234c

18 Jul, 2024 1 commit
- docs: Fixed 2 links in the docs along with some minor fixes (#32058) · 271fd8e6
  Sai-Suraj-27 authored Jul 19, 2024
```
* Fixed 2 links in the docs along with some minor fixes.

* Updated Contributing.md
```
  271fd8e6