Commits · 16271080333ad52be5349fb31d789fb232b68760 · chenpangpang / transformers

30 Jul, 2024 8 commits

fix: Added missing raise keyword for few exceptions (#32333) · 16271080
Sai-Suraj-27 authored Jul 30, 2024
```
Fixed raising of few exceptions.
```
16271080

Alternative agent plan (#32295) · bd54ed2e

plaggy authored Jul 30, 2024

* new agent plan

* plan type assertion

* style corrections

* better prompt naming

* make fixup

bd54ed2e

Docs: formatting nits (#32247) · e68ec18c

Joao Gante authored Jul 30, 2024



* doc formatting nits

* ignore non-autodocs

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/esm/modeling_esm.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/esm/modeling_esm.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* make fixup

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e68ec18c

Fix M4T for ASR pipeline (#32296) · 2fbbcf50
Yoach Lacombe authored Jul 30, 2024
```
* tentative fix

* do the same for M4T
```
2fbbcf50
feat(ci): set `fetch-depth: 0` in trufflehog checkout step (#31663) · 084b5094
Luc Georges authored Jul 30, 2024

084b5094

Cast epochs_trained to int when resuming training (#32286) · 20528f06

Teddy Ferdinan authored Jul 30, 2024



* fix epochs_trained as int when resuming training

* refactor

---------
Co-authored-by: teddyferdinan <teddy.ferdinan@pwr.edu.pl>

20528f06

Fix GGUF dequantize for `gguf==0.9.1` (#32298) · 934fe150
Isotr0py authored Jul 30, 2024
```
* fix gguf dequantize for gguf==0.9.1

* fix old version

* make style
```
934fe150

Docs: fix GaLore optimizer code example (#32249) · 3e8106d2

Gilad Turok authored Jul 30, 2024

Docs: fix GaLore optimizer example

Fix incorrect usage of GaLore optimizer in Transformers trainer code example.

The GaLore optimizer uses low-rank gradient updates to reduce memory usage. GaLore is quite popular and is implemented by the authors in [https://github.com/jiaweizzhao/GaLore](https://github.com/jiaweizzhao/GaLore). A few months ago GaLore was added to the HuggingFace Transformers library in https://github.com/huggingface/transformers/pull/29588.

Documentation of the Trainer module includes a few code examples of how to use GaLore. However, the `optim_targe_modules` argument to the `TrainingArguments` function is incorrect, as discussed in https://github.com/huggingface/transformers/pull/29588#issuecomment-2006289512. This pull request fixes this issue.

3e8106d2

29 Jul, 2024 13 commits

use torch 2.4 in 2 CI jobs (#32302) · f0bc49e7
Yih-Dar authored Jul 29, 2024
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f0bc49e7
Add stream messages from agent run for gradio chatbot (#32142) · a24a9a66
Aymeric Roucher authored Jul 29, 2024
```
* Add stream_to_gradio method for running agent in gradio demo
```
a24a9a66
Make static cache compatible with torch.export (#32168) · 811a9caa
Guang Yang authored Jul 29, 2024

811a9caa

[pipeline] fix padding for 1-d tensors (#31776) · 7f5d644e

Sanchit Gandhi authored Jul 29, 2024



* [pipeline] fix padding for 1-d tensors

* add test

* make style

* Update tests/pipelines/test_pipelines_automatic_speech_recognition.py
Co-authored-by: Kamil Akesbi <45195979+kamilakesbi@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_automatic_speech_recognition.py

---------
Co-authored-by: Kamil Akesbi <45195979+kamilakesbi@users.noreply.github.com>

7f5d644e

Whisper tokenizer word level timestamps (#32197) · 3fbaaaa6

Kamil Akesbi authored Jul 29, 2024

* fix _fix_key in PreTrainedModel

* fix _find_longest_common_sequence

* add test

* remove result.json

* nit

* update test

3fbaaaa6

Generate: end-to-end compilation (#30788) · 7ffe25f2

Joao Gante authored Jul 29, 2024

* mvp

* added test (a few models need fixes)

* fix a few test cases

* test nits

* harder test 😈

* revert changes in stablelm

* test with improved condition

* add todo

* tmp commit

* merged with main

* nits

* add todo

* final corrections

* add docs for generation compilation

* docs nits

* add  tip

* PR suggestions

* add more details to the compilation docs

* fix cache positions

* cache is now init in generate; update docs

* tag test as flaky

* docs

* post rebase make fixup and other nits

* remove unintended changes

* whisper (encoder-decoder) not supported

* move token default updates to ; add tests for token defaults

* push changes

* manual rebase

* chameleon doesn't support this

* fix test_static_cache_mha_mqa_gqa (broken in another PR)

* docs: dynamic is better with end-to-end compilation

7ffe25f2

fix(docs): Fixed a link in docs (#32274) · 49928892
Sai-Suraj-27 authored Jul 29, 2024
```
Fixed a link in docs.
```
49928892
make `p_mask` a numpy array before passing to `select_starts_ends` (#32076) · 6494479f
Fanli Lin authored Jul 29, 2024
```
* fix

* bug fix

* refine

* fix
```
6494479f
Repo: remove exceptions in `check_docstrings` (#32259) · 535fe78b
Joao Gante authored Jul 29, 2024
```
remove exceptions
```
535fe78b
fix: Fixed wrong argument passed to `convert_blip_checkpoint` function call (#32262) · a2ad9d5a
Sai-Suraj-27 authored Jul 29, 2024
```
Removed one wrong argument passed to convert_blip_checkpoint function call.
```
a2ad9d5a
Optimize t5 tokenize logic to avoid redundant calls (#32270) · 5019aabf
leejet authored Jul 29, 2024
```
* Optimize t5 tokenize logic to avoid redundant calls

* fix and overwrite copies
```
5019aabf
Upload new model failure report to Hub (#32264) · f2122cc6
Yih-Dar authored Jul 29, 2024
```
upload
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f2122cc6

🚨

Bloom support for cache class (#31445) · f7396876

Raushan Turganbay authored Jul 29, 2024



* bloom dynamic cache

* bloom follows standard cache format

* no skips for bloom anymore

* use cache position when possible

* clean up

* codestyle

* Update src/transformers/models/bloom/modeling_bloom.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/bloom/modeling_bloom.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/bloom/modeling_bloom.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* pr comments

* isinstance fix

* address comments

* make musicgen test happy

* [run-slow] bloom

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

f7396876

27 Jul, 2024 1 commit
- Llama 3.1: replace for loop by tensor ops at inv_freq initialization (#32244) · 44f6fdd7
  Joao Gante authored Jul 27, 2024
```
* replace for loop by tensor ops

* rm assert; readability
```
  44f6fdd7
26 Jul, 2024 10 commits

More flexible trigger condition (#32251) · 8da90687
Yih-Dar authored Jul 26, 2024
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8da90687
Flash-Attn: fix generation when no attention mask or no pading (#32241) · 81233c06
Raushan Turganbay authored Jul 26, 2024
```
* fix

* fix prev test (half of failures)

* [run-slow] llama, gemma2

* [run-slow] llama, gemma2
```
81233c06

[tests] fix `static` cache implementation is not compatible with... · 27c7f971

Fanli Lin authored Jul 26, 2024

[tests] fix `static` cache implementation is not compatible with `attn_implementation==flash_attention_2` (#32039)

* add flash attention check

* fix

* fix

27c7f971

Add check for `target_sizes is None` in `post_process_image_guided_detection` for owlv2 (#31934) · 5f841c74

Connor Anderson authored Jul 26, 2024

* Add check for target_sizes is None in post_process_image_guided_detection

* Make sure Owlvit and Owlv2 in sync

* Fix incorrect indentation; add check for correct size of target_sizes

5f841c74

Adds: extra_repr for RMSNorm layers in most models (#32204) · f9756d9e

Rohit Dwivedula authored Jul 26, 2024

* adds: extra_repr() to RMSNorm layers in multiple models

* adds: extra_repr for deprecated models as well

* formatting as per style guide

f9756d9e

Refactor: Removed un-necessary `object` base class (#32230) · b8e5cd53
Sai-Suraj-27 authored Jul 26, 2024
```
* Refactored to remove un-necessary object base class.

* small fix.
```
b8e5cd53

don't log base model architecture in wandb if log model is false (#32143) · 1c7ebf1d

João Nadkarni authored Jul 26, 2024



* don't log base model architecture in wandb is log model is false

* Update src/transformers/integrations/integration_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* convert log model setting into an enum

* fix formatting

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

1c7ebf1d

Resize embeds with DeepSpeed (#32214) · c46edfb8
Raushan Turganbay authored Jul 26, 2024
```
* fix resize when deepspeed

* deepsped uses new embeds

* we needed this
```
c46edfb8
Llava: generate without images (#32183) · fad15fba
Raushan Turganbay authored Jul 26, 2024
```
* llava w/o images

* tests
```
fad15fba

Generation: stop at `eos` for assisted decoding (#31301) · 4ab33c2d

Raushan Turganbay authored Jul 26, 2024



* fix

* move changes to prompt lookup

* add test

* set eos in assistant model

* style

* fix flakiness

* changes for new `main`

* Update tests/generation/test_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/generation/test_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add comment to explain

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

4ab33c2d

25 Jul, 2024 8 commits
- Fix code snippet for Grounding DINO (#32229) · 9d6c0641
  Pavel Iakubovskii authored Jul 25, 2024
```
Fix code snippet for grounding-dino
```
  9d6c0641
- Allow a specific microphone to be used by the ffmpeg audio pipeline utility... · 3a83ec48
  jrhe authored Jul 25, 2024
```
Allow a specific microphone to be used by the ffmpeg audio pipeline utility functions. Default to using the currently active microphone on Mac (#31846)

* use currently active microphone on mac for ffmpeg_microphone

* Allow ffmpeg_microphone device to be specified
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
```
  3a83ec48
- translate philosophy.md to chinese (#32177) · 6ed0bf1e
  Huazhong Ji authored Jul 26, 2024
```
* translate philosophy.md to chinese

* add the missing link
```
  6ed0bf1e
- Follow up for #31973 (#32025) · df6eee92
  Yih-Dar authored Jul 25, 2024
```
* fix

* [test_all] trigger full CI

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  df6eee92
- [warnings] fix E721 warnings (#32223) · de231889
  Kashif Rasul authored Jul 25, 2024
```
fix E721 warnings
```
  de231889
- [BigBird Pegasus] set _supports_param_buffer_assignment to False (#32222) · 9b9a54e6
  Kashif Rasul authored Jul 25, 2024
```
set _supports_param_buffer_assignment to False
```
  9b9a54e6
- Update question_answering.py (#32208) · 1ecedf1d
  Austin authored Jul 25, 2024
  
  1ecedf1d
- remove unnecessary guard code related with pytorch versions 1.4.2 ~ 1.7.0 (#32210) · f53a5dec
  Huazhong Ji authored Jul 25, 2024
```
remove unnecessary guard code related with pytorch versions 1.4.2 ~
1.7.0
```
  f53a5dec