Commits · 6824461f2a35546a3d781fe60576e00f6db7bedf · chenpangpang / transformers

02 Oct, 2023 7 commits

[`core`/ `auto` ] Fix bnb test with code revision + bug with code revision (#26431) · 6824461f

Younes Belkada authored Oct 02, 2023

* fix bnb test with code revision

* fix test

* Apply suggestions from code review

* Update src/transformers/models/auto/auto_factory.py

* Update src/transformers/models/auto/auto_factory.py

* Update src/transformers/models/auto/auto_factory.py

6824461f

[`PEFT`] Pass token when calling `find_adapter_config` (#26488) · 24178c24
Younes Belkada authored Oct 02, 2023
```
* try

* nit

* nits
```
24178c24
Fix broken link to video classification task (#26487) · 7d6627d0
HelgeS authored Oct 02, 2023

7d6627d0

Fix issue of canine forward requiring input_ids anyway (#26290) · 6d02ca4b

marcmk6 authored Oct 02, 2023

* fix issue of canine forward requires input_ids anyway

The `forward` requires `input_ids` for deriving other variables in all cases. Change this to use the given one between `input_ids` and `inputs_embeds`

* fix canine forward

The current `forward` requires (the shape of) `input_ids` for deriving other variables whenever `input_ids` or `inputs_embeds` is provided. Change this to use the given one instead of `input_ids` all the time.

* fix format

* fix format

6d02ca4b

Fix requests connection error during modelcard creation (#26518) · 7d77d7f7
Jan Philipp Harries authored Oct 02, 2023
```
fix requests connection error
Co-authored-by: Jan Philipp Harries <jphme@users.noreply.github.com>
```
7d77d7f7

Fix num_heads in _upad_input (#26490) · ca0379b8

Florian Seiler authored Oct 02, 2023



* Fix num_heads in _upad_input

The variable num_key_value_heads has falsely been named num_heads, which led to reshaping the query_layer using the wrong attention head count. (It would have been enough to use the correct variable self.num_heads instead of num_heads, but I renamed num_heads to num_key_value_heads for clarity)

* fixed copies using make fix-copies and ran make fixup

---------
Co-authored-by: fseiler <f.seiler@jerocom.de>

ca0379b8

Revert falcon exception (#26472) · 67239f73

Lysandre Debut authored Oct 02, 2023

* Revert "Falcon: fix revision propagation (#26006)"

This reverts commit 118c676ef3124423e5d062b665f05cde55bc9a90.

* Revert "Put Falcon back (#25960)"

This reverts commit 22a69f1d.

67239f73

29 Sep, 2023 6 commits

[ASR Pipe] Improve docs and error messages (#26476) · 0b192de1

Sanchit Gandhi authored Sep 29, 2023



* improve docs/errors

* why whisper

* Update docs/source/en/pipeline_tutorial.md
Co-authored-by: Lysandre Debut <hi@lysand.re>

* specify pt only

---------
Co-authored-by: Lysandre Debut <hi@lysand.re>

0b192de1

[Flax Examples] Seq2Seq ASR Fine-Tuning Script (#21764) · 68e85fc8

Sanchit Gandhi authored Sep 29, 2023

* from seq2seq speech

* [Flax] Example script for speech seq2seq

* tests and fixes

* make style

* fix: label padding tokens

* fix: label padding tokens over list

* update ln names for Whisper

* try datasets iter loader

* create readme and append results

* style

* make style

* adjust lr

* use pt dataloader

* make fast

* pin gen max len

* finish

* add pt to requirements for test

* fix pt -> torch

* add accelerate

68e85fc8

Avoid all-zeor attnetion mask used in testing (#26469) · 39117744
Yih-Dar authored Sep 29, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
39117744
Skip 2 failing persimmon pipeline tests for now (#26485) · 9b23d0de
Yih-Dar authored Sep 29, 2023
```
skip
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9b23d0de
[docs] navigation improvement between text gen pipelines and text gen params (#26477) · 14170b78
Maria Khalusova authored Sep 29, 2023
```
* navigation improvement between text generation pipelines and text generation docs

* make style
```
14170b78
[docs] Update offline mode docs (#26478) · 7bb1c0c1
Steven Liu authored Sep 29, 2023
```
update
```
7bb1c0c1

28 Sep, 2023 10 commits

[Whisper Tokenizer] Make decoding faster after adding timestamps (#26299) · 211f93aa
Sanchit Gandhi authored Sep 28, 2023
```
make decoding faster
```
211f93aa

Esm checkpointing (#26454) · 4e931a8e

Amelie Schreiber authored Sep 28, 2023



* Fixed in-place operation error in EsmEmbeddings

* Fixed in-place operation error in EsmEmbeddings again

---------
Co-authored-by: Schreiber-Finance <amelie.schreiber.finance@gmail.com>

4e931a8e

fix_mbart_tied_weights (#26422) · 5e11d72d
Marc Sun authored Sep 28, 2023
```
* fix_mbart_tied_weights

* add test
```
5e11d72d

Do not warn about unexpected decoder weights when loading T5EncoderModel and... · 216dff75

fleance authored Sep 28, 2023

Do not warn about unexpected decoder weights when loading T5EncoderModel and LongT5EncoderModel (#26211)

Ignore decoder weights when using T5EncoderModel and LongT5EncoderModel

Both T5EncoderModel and LongT5EncoderModel do not have any decoder layers, so
loading a pretrained model checkpoint such as t5-small will give warnings about
keys found in the model checkpoint that are not in the model itself.

To prevent this log warning, r"decoder" has been added to _keys_to_ignore_on_load_unexpected for
both T5EncoderModel and LongT5EncoderModel

216dff75

[`PEFT`] introducing `adapter_kwargs` for loading adapters from different Hub... · 38e96324

Younes Belkada authored Sep 28, 2023


[`PEFT`] introducing `adapter_kwargs` for loading adapters from different Hub location (`subfolder`, `revision`) than the base model (#26270)

* make use of adapter_revision

* v1 adapter kwargs

* fix CI

* fix CI

* fix CI

* fixup

* add BC

* Update src/transformers/integrations/peft.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

* change it to error

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* fixup

* change

* Update src/transformers/integrations/peft.py

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

38e96324

[VITS] Fix speaker_embed device mismatch (#26115) · 52e2c13d

Fakhir Ali authored Sep 28, 2023

* [VITS] Fix speaker_embed device mismatch

- pass device arg to speaker_id tensor

* [VITS] put speaker_embed on device when int

* [VITS] device=self.device
instead of self.embed_speaker.weight.device

* [VITS] make tensor directly on device
using torch.full()

52e2c13d

change mention of decoder_input_ids to input_ids and same with decode_inputs_embeds (#26406) · 098c3f40

Tanishq Abraham authored Sep 28, 2023



* change mention of decoder_input_ids to input_ids and same with decoder_input_embeds

* Style

---------
Co-authored-by: Lysandre <lysandre@huggingface.co>

098c3f40

docs: change assert to raise and some small docs (#26232) · ba47efbf

Phuc Van Phan authored Sep 28, 2023

* docs: change assert to raise and some small docs

* docs: add rule and some document

* fix: fix bug

* fix: fix bug

* chorse: revert logging

* chorse: revert

ba47efbf

Fix `cos_sin` device issue in Falcon model (#26448) · 375b4e09
Yih-Dar authored Sep 28, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
375b4e09

optimize VRAM for calculating pos_bias in LayoutLM v2, v3 (#26139) · a7e0ed82

Norm Inui authored Sep 28, 2023



* optimize layoutv2, v3 for VRAM saving

* reformat codes

---------
Co-authored-by: NormXU <xunuo@datagrand.com>

a7e0ed82

27 Sep, 2023 12 commits

🌐

[i18n-KO] Translated `perf_train_gpu_many.md` to Korean (#26244) · ab37b801

Wonhyeong Seo authored Sep 28, 2023



* dos: ko: perf_train_gpu_many.mdx

* feat: chatgpt draft

* fix: manual edits

* fix: resolve suggestions

Change description
Follow the glossary
Fix discrepancies
Co-Authored-By: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
Co-Authored-By: 이서정 <97655267+sjlee-wise@users.noreply.github.com>
Co-Authored-By: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Hyunho <105839613+hyunhp@users.noreply.github.com>
Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>
Co-authored-by: 이서정 <97655267+sjlee-wise@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

ab37b801

🌐

[i18n-KO] Translated `debugging.md` to Korean (#26246) · a0922a53

Wonhyeong Seo authored Sep 28, 2023



* docs:ko:Debugging.md

* feat: chatgpt draft

* fix: resolve suggestions
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Jang KyuJin <106062329+kj021@users.noreply.github.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

a0922a53

[i18n-DE] Complete first toc chapter (#26311) · ef81759e

Florian Zimmermeister authored Sep 27, 2023



* initial

* toctree

* add tf model

* run scripts

* peft

* llm and agents

* Update docs/source/de/peft.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/de/peft.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/de/peft.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/de/run_scripts.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/de/run_scripts.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/de/transformers_agents.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/de/transformers_agents.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

ef81759e

Update `runs-on` in workflow files (#26435) · 6ae71ec8

Yih-Dar authored Sep 27, 2023



* update

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6ae71ec8

Fix failing doctest (#26450) · 78dd1202
Lysandre Debut authored Sep 27, 2023
```
* Fix doctest

* Adding modeling also for now
```
78dd1202

[Mistral] Mistral-7B-v0.1 support (#26447) · 72958fcd

Chris Bamford authored Sep 27, 2023



* [Mistral] Mistral-7B-v0.1 support

* fixing names

* slightly longer test

* fixups

* not_doctested

* wrongly formatted references

* make fixuped

---------
Co-authored-by: Timothee Lacroix <t@eugen.ai>
Co-authored-by: timlacroix <t@mistral.ai>

72958fcd

[`PEFT`] Fix PEFT multi adapters support (#26407) · 3ca18d6d

Younes Belkada authored Sep 27, 2023



* fix PEFT multi adapters support

* refactor a bit

* save pretrained + BC + added tests

* Update src/transformers/integrations/peft.py
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

* add more tests

* add suggestion

* final changes

* adapt a bit

* fixup

* Update src/transformers/integrations/peft.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* adapt from suggestions

---------
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3ca18d6d

add bf16 mixed precision support for NPU (#26163) · 946bac79
statelesshz authored Sep 27, 2023
```
Co-authored-by: statelesshz <jihuazhong1@huawei.com>
```
946bac79
[`FA` / `tests`] Add use_cache tests for FA models (#26415) · 153755ee
Younes Belkada authored Sep 27, 2023
```
* add use_cache tests for FA

* fixup
```
153755ee

Fixing tokenizer when `transformers` is installed without `tokenizers` (#26236) · a0be960d

Uri Alon authored Sep 27, 2023

* Fixing tokenizer when tokenizers is not installed

* Adding __repr__ function and repr=True in dataclass

* Revert "Adding __repr__ function and repr=True in dataclass"

This reverts commit 18839505d1cada3170ed623744d3e75008a18bdc.

a0be960d

Update semantic_segmentation.md (#26419) · 777f2243
Nour Eddine ZEKAOUI authored Sep 27, 2023

777f2243
Fix padding for IDEFICS (#26396) · abd25310
Shauray Singh authored Sep 27, 2023
```
* fix

* fixup

* tests

* fixup
```
abd25310

26 Sep, 2023 5 commits
- Add torch `RMSProp` optimizer (#26425) · 408b2b3c
  Nathan Lambert authored Sep 26, 2023
```
add rmsprop
```
  408b2b3c
- [InternLM] Add support for InternLM (#26302) · 6ba63ac3
  Matt authored Sep 26, 2023
```
* Add config.bias to LLaMA to allow InternLM models to be ported as LLaMA checkpoints

* Rename bias -> attention_bias and add docstring
```
  6ba63ac3
- Fix DeepSpeed issue with Idefics (#26393) · 0ac38750
  Hugo Laurençon authored Sep 26, 2023
```
Fix deepspeed issue with Idefics
```
  0ac38750
- added support for gradient checkpointing in ESM models (#26386) · 6ce6a5ad
  sanjeevk-os authored Sep 26, 2023
  
  6ce6a5ad
- Deleted duplicate sentence (#26394) · a8531f3b
  titi authored Sep 26, 2023
  
  a8531f3b