Commits · 946bac798caefada3f5f1c9fecdcfd587ed24ac7 · chenpangpang / transformers

27 Sep, 2023 5 commits
- add bf16 mixed precision support for NPU (#26163) · 946bac79
  statelesshz authored Sep 27, 2023
```
Co-authored-by: statelesshz <jihuazhong1@huawei.com>
```
  946bac79
- [`FA` / `tests`] Add use_cache tests for FA models (#26415) · 153755ee
  Younes Belkada authored Sep 27, 2023
```
* add use_cache tests for FA

* fixup
```
  153755ee
- Fixing tokenizer when `transformers` is installed without `tokenizers` (#26236) · a0be960d
  Uri Alon authored Sep 27, 2023
```
* Fixing tokenizer when tokenizers is not installed

* Adding __repr__ function and repr=True in dataclass

* Revert "Adding __repr__ function and repr=True in dataclass"

This reverts commit 18839505d1cada3170ed623744d3e75008a18bdc.
```
  a0be960d
- Update semantic_segmentation.md (#26419) · 777f2243
  Nour Eddine ZEKAOUI authored Sep 27, 2023
  
  777f2243
- Fix padding for IDEFICS (#26396) · abd25310
  Shauray Singh authored Sep 27, 2023
```
* fix

* fixup

* tests

* fixup
```
  abd25310
26 Sep, 2023 7 commits

Add torch `RMSProp` optimizer (#26425) · 408b2b3c
Nathan Lambert authored Sep 26, 2023
```
add rmsprop
```
408b2b3c

[InternLM] Add support for InternLM (#26302) · 6ba63ac3

Matt authored Sep 26, 2023

* Add config.bias to LLaMA to allow InternLM models to be ported as LLaMA checkpoints

* Rename bias -> attention_bias and add docstring

6ba63ac3

Fix DeepSpeed issue with Idefics (#26393) · 0ac38750
Hugo Laurençon authored Sep 26, 2023
```
Fix deepspeed issue with Idefics
```
0ac38750
added support for gradient checkpointing in ESM models (#26386) · 6ce6a5ad
sanjeevk-os authored Sep 26, 2023

6ce6a5ad
Deleted duplicate sentence (#26394) · a8531f3b
titi authored Sep 26, 2023

a8531f3b
[ViTMatte] Add resources (#26317) · a09130fe
NielsRogge authored Sep 26, 2023
```
Add resource
```
a09130fe

Add Nougat (#25942) · ace74d16

NielsRogge authored Sep 26, 2023



* Add conversion script

* Add NougatImageProcessor

* Add crop margin

* More improvements

* Add docs, READMEs

* Remove print statements

* Include model_max_length

* Add NougatTokenizerFast

* Fix imports

* Improve postprocessing

* Improve image processor

* Fix image processor

* Improve normalize method

* More improvements

* More improvements

* Add processor, improve docs

* Simplify fast tokenizer

* Remove test file

* Fix docstrings

* Use NougatProcessor in conversion script

* Add is_levensthein_available

* Add tokenizer tests

* More improvements

* Use numpy instead of opencv

* Add is_cv2_available

* Fix cv2_available

* Add is_nltk_available

* Add image processor tests, improve crop_margin

* Add integration tests

* Improve integration test

* Use do_rescale instead of hacks, thanks Amy

* Remove random_padding

* Address comments

* Address more comments

* Add import

* Address more comments

* Address more comments

* Address comment

* Address comment

* Set max_model_input_sizes

* Add tests

* Add requires_backends

* Add Nougat to exotic tests

* Use to_pil_image

* Address comment regarding nltk

* Add NLTK

* Improve variable names, integration test

* Add test

* refactor, document, and test regexes

* remove named capture groups, add comments

* format

* add non-markdown fixed tokenization

* format

* correct flakyness of args parse

* add regex comments

* test functionalities for crop_image, align long axis and expected output

* add regex tests

* remove cv2 dependency

* test crop_margin equality between cv2 and python

* refactor table regexes to markdown

add newline

* change print to log, improve doc

* fix high count tables correction

* address PR comments: naming, linting, asserts

* Address comments

* Add copied from

* Update conversion script

* Update conversion script to convert both small and base versions

* Add inference example

* Add more info

* Fix style

* Add require annotators to test

* Define all keyword arguments explicitly

* Move cv2 annotator

* Add tokenizer init method

* Transfer checkpoints

* Add reference to Donut

* Address comments

* Skip test

* Remove cv2 method

* Add copied from statements

* Use cached_property

* Fix docstring

* Add file to not doctested

---------
Co-authored-by: Pablo Montalvo <pablo.montalvo.leroux@gmail.com>

ace74d16

25 Sep, 2023 6 commits

🌐

[i18n-KO] Translated `audio_classification.mdx` to Korean (#26200) · 5e09af2a

Gabriel Yang authored Sep 26, 2023

* 🌐

 [i18n-KO] Translated  to Korean

* update translation

* fix some sentence editing and fixing punctuation

* Update docs/source/ko/_toctree.yml
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Apply suggestions from code review
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

5e09af2a

Add Russian localization for README (#26208) · 033ec57c

qweme32 authored Sep 25, 2023



* Add Russian localization

* typo

* mistake in link

* Update README_ru.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update README_ru.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

033ec57c

Update tiny model information and pipeline tests (#26285) · d9e4bc28

Yih-Dar authored Sep 25, 2023



* Update tiny model summary file

* add to pipeline tests

* revert

* fix import

* fix import

* fix

* fix

* update

* update

* update

* fix

* remove BarkModelTest

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d9e4bc28

[docs] removed MaskFormerSwin and TimmBackbone from the table on index.md (#26347) · 546e7679
Maria Khalusova authored Sep 25, 2023
```
removed MaskFormerSwin and TimmBackbone from the table
```
546e7679
Fix MusicGen logging error (#26370) · 0ee45906
Omar Sanseviero authored Sep 25, 2023
```
* Fix logging error

* Update modeling_musicgen.py

* Update modeling_musicgen.py
```
0ee45906
Update add_new_model.md (#26365) · 6accd5ef
Nino Risteski authored Sep 25, 2023
```
fixed typos
```
6accd5ef

22 Sep, 2023 9 commits

Fixed unclosed p tags (#26240) · 5936c8c5
HanSeokhyeon authored Sep 23, 2023

5936c8c5

feat: adding num_proc to load_dataset (#26326) · 910faa3e

Phuc Van Phan authored Sep 23, 2023

* feat: adding num_proc to load_dataset

* feat: add add_num_proc for run_mlm_flax

* feat: add num_proc for bart and t5

* chorse: remove

910faa3e

Add image to image pipeline (#25393) · 576cd45a

LeviVasconcelos authored Sep 22, 2023



* Add image to image pipeline

Add image to image pipeline

* remove swin2sr from tf auto

* make ImageToImage importable

* make style

make style

make style

make style

* remove tf support

* remove nonused imports

* fix postprocessing

* add important comments; add unit tests

* add documentation

* remove support for TF

* make fixup

* fix typehint Image.Image

* fix documentation code

* address review request; fix unittest type checking

* address review request; fix unittest type checking

* make fixup

* address reviews

* Update src/transformers/pipelines/image_to_image.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* enhance docs

* make style

* make style

* improve docetest time

* improve docetest time

* Update tests/pipelines/test_pipelines_image_to_image.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* Update tests/pipelines/test_pipelines_image_to_image.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* make fixup

* undo faulty merge

* undo faulty merge

* add image-to-image to test pipeline mixin

* Update src/transformers/pipelines/image_to_image.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_image_to_image.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* improve docs

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

576cd45a

[TTA Pipeline] Fix MusicGen test (#26348) · 914771cb
Sanchit Gandhi authored Sep 22, 2023
```
* fix musicgen pipeline test

* fix wav2vec2 doctest

* revert wav2vec2
```
914771cb

[`core` ] Integrate Flash attention 2 in most used models (#25598) · 368a58e6

Younes Belkada authored Sep 22, 2023



* v1

* oops

* working v1

* fixup

* add some TODOs

* fixup

* padding support + try with module replacement

* nit

* alternative design

* oops

* add `use_cache` support for llama

* v1 falcon

* nit

* a bit of refactor

* nit

* nits nits

* add v1 padding support falcon (even though it seemed to work before)

* nit

* falcon works

* fixup

* v1 tests

* nit

* fix generation llama flash

* update tests

* fix tests + nits

* fix copies

* fix nit

* test- padding mask

* stype

* add more mem efficient support

* Update src/transformers/modeling_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fixup

* nit

* fixup

* remove it from config when saving

* fixup

* revert docstring

* add more checks

* use values

* oops

* new version

* fixup

* add same trick for falcon

* nit

* add another test

* change tests

* fix issues with GC and also falcon

* fixup

* oops

* Update src/transformers/models/falcon/modeling_falcon.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* add init_rope

* updates

* fix copies

* fixup

* fixup

* more clarification

* fixup

* right padding tests

* add docs

* add FA in docker image

* more clarifications

* add some figures

* add todo

* rectify comment

* Change to FA2

* Update docs/source/en/perf_infer_gpu_one.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* split in two lines

* change test name

* add more tests

* some clean up

* remove `rearrange` deps

* add more docs

* revert changes on dockerfile

* Revert "revert changes on dockerfile"

This reverts commit 8d72a66b4b9b771abc3f15a9b9506b4246d62d8e.

* revert changes on dockerfile

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <hi@lysand.re>

* address some comments

* docs

* use inheritance

* Update src/transformers/testing_utils.py
Co-authored-by: Lysandre Debut <hi@lysand.re>

* fixup

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/modeling_utils.py

* final comments

* clean up

* style

* add cast + warning for PEFT models

* fixup

---------
Co-authored-by: Felix Marty <9808326+fxmarty@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre Debut <hi@lysand.re>

368a58e6

[doc] fixed indices in obj detection example (#26343) · dcbfd93d
Maria Khalusova authored Sep 22, 2023
```
fixed indexes in obj detection example
```
dcbfd93d

Fix doctest CI (#26324) · c3ecf2d9

Yih-Dar authored Sep 22, 2023



fix doc CI
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

c3ecf2d9

Use CircleCI `store_test_results` (#26223) · 06ee91ae
Yih-Dar authored Sep 22, 2023
```
store_test_results
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
06ee91ae

[QUICK FIX LINK] Update trainer.py (#26293) · 587b7b16

Gema Parreño authored Sep 22, 2023



* Update trainer.py

Fix link

* Update src/transformers/trainer.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update trainer.py

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

587b7b16

21 Sep, 2023 5 commits

More error message fixup, plus some linebreaks! (#26296) · 000e52ae

Matt authored Sep 21, 2023



* More error message fixup, plus some linebreaks!

* Update src/transformers/dynamic_module_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/dynamic_module_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/dynamic_module_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

000e52ae

Porting the torchaudio kaldi fbank implementation to audio_utils (#26182) · 9a307534

Yoach Lacombe authored Sep 21, 2023



* add kaldi fbank

* make style

* add herz_to_mel_kaldi tests

* add mel to hertz kaldi test

* integration tests

* correct test and remove comment

* make style

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* change parameter name

* Apply suggestions from Arthur review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update remove_dc_offset description

* fix bug  + make style

* fix error in using np.exp instead of np.power

* make style

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

9a307534

update hf hub dependency to be compatible with the new tokenizers (#26301) · b132c170
Arthur authored Sep 21, 2023

b132c170
Fix FSMT weight sharing (#26292) · 26ba56cc
Lysandre Debut authored Sep 21, 2023

26ba56cc

Keep relevant weights in fp32 when `model._keep_in_fp32_modules` is set even... · da971b22

fxmarty authored Sep 21, 2023

Keep relevant weights in fp32 when `model._keep_in_fp32_modules` is set even when `accelerate` is not installed (#26225)

* fix bug where weight would not be kept in fp32

* nit

* address review comments

* fix test

da971b22

20 Sep, 2023 8 commits

add custom RMSNorm to `ALL_LAYERNORM_LAYERS` (#26227) · e3a4bd2b

Shijie Wu authored Sep 20, 2023

* add LlamaRMSNorm to ALL_LAYERNORM_LAYERS

* fixup

* add IdeficsRMSNorm to ALL_LAYERNORM_LAYERS and fixup

e3a4bd2b

[`Trainer`] Refactor trainer + bnb logic (#26248) · 0b5024ce
Younes Belkada authored Sep 20, 2023
```
* refactor trainer + bnb logic

* remove logger.info

* oops
```
0b5024ce
include changes from llama (#26260) · f94c9b3d
Arthur authored Sep 20, 2023
```
* include changes from llama

* add a test
```
f94c9b3d
add bbox input validation (#26294) · 00247ea0
Jinho Park authored Sep 20, 2023

00247ea0
fix deepspeed available detection (#26252) · 24553206
fxmarty authored Sep 20, 2023

24553206
Rewrite for custom code warning messages (#26291) · f29fe745
Matt authored Sep 20, 2023
```
Quick britpicking for some warning messages!
```
f29fe745

Integrate AMD GPU in CI/CD environment (#26007) · 2d71307d

Funtowicz Morgan authored Sep 20, 2023

* Add a Dockerfile for PyTorch + ROCm based on official AMD released artifact

* Add a new artifact single-amdgpu testing on main

* Attempt to test the workflow without merging.

* Changed BERT to check if things are triggered

* Meet the dependencies graph on workflow

* Revert BERT changes

* Add check_runners_amdgpu to correctly mount and check availability

* Rename setup to setup_gpu for CUDA and add setup_amdgpu for AMD

* Fix all the needs.setup -> needs.setup_[gpu|amdgpu] dependencies

* Fix setup dependency graph to use check_runner_amdgpu

* Let's do the runner status check only on AMDGPU target

* Update the Dockerfile.amd to put ourselves in / rather than /var/lib

* Restore the whole setup for CUDA too.

* Let's redisable them

* Change BERT to trigger tests

* Restore BERT

* Add torchaudio with rocm 5.6 to AMD Dockerfile (#26050)

fix dockerfile
Co-authored-by: Felix Marty <felix@hf.co>

* Place AMD GPU tests in a separate workflow (correct branch) (#26105)

AMDGPU CI lives in an other workflow

* Fix invalid job name is dependencies.

* Remove tests multi-amdgpu for now.

* Use single-amdgpu

* Use --net=host for now.

* Remote host networking.

* Removed duplicated check_runners_amdgpu step

* Let's tag machine-types with mi210 for now.

* Machine type should be only mi210

* Remove unnecessary push.branches item

* Apply review suggestions moving from `x-amdgpu` to `x-gpu` introducing `amd-gpu` and `miXXX` labels.

* Remove amdgpu from step names.

* finalize

* delete

---------
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>
Co-authored-by: Felix Marty <felix@hf.co>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2d71307d

Update bros checkpoint (#26277) · 37c205eb
Jinho Park authored Sep 20, 2023
```
* fix bros integration test

* update bros checkpoint
```
37c205eb