Commits · 627f44799a9f4948a6a1b8fe9e536eee0e29ea68 · chenpangpang / transformers

09 May, 2023 10 commits

[Doctests] Refactor doctests + add CI (#22987) · 627f4479

Arthur authored May 10, 2023



* intiial commit

* new styling

* update

* just run doctest in CI

* remove more test for fast dev

* update

* update refs

* update path and fetch upstream

* update documentatyion trests

* typo

* parse pwd

* don't check for files that are in hidden folders

* just give paths relative to transformers

* update

* update

* update

* major refactoring

* make sure options is ok

* lest test that mdx is tested

* doctest glob

* nits

* update doctest nightly

* some cleaning

* run correct test on diff

* debug

* run on a single worker

* skip_cuda_test tampkate

* updates

* add rA and continue on failure

* test options

* parse `py` codeblock?

* we don't need to replace ignore results, don't remember whyu I put it

* cleanup

* more cleaning

* fix arg

* more cleaning

* clean an todo

* more pre-processing

* doctest-module has none so extra `- ` is needed

* remove logs

* nits

* doctest-modules ....

* oups

* let's use sugar

* make dataset go quiet

* add proper timeout

* nites

* spleling timeout

* update

* properly skip tests that have CUDSA

* proper skipping

* cleaning main and get tests to run

* remove make report?

* remove tee

* some updates

* tee was removed but is the full output still available?

* [all-test]

* only our tests

* don't  touch tee in this PR

* no atee-sys

* proper sub

* monkey

* only replace call

* fix sub

* nits

* nits

* fix invalid syntax

* add skip cuda doctest env variable

* make sure all packages are installed

* move file

* update check repo

* revert changes

* nit

* finish cleanup

* fix re

* findall

* update don't test init files

* ignore pycache

* `-ignore-pycache` when running pytests

* try to fix the import missmatch error

* install dec

* pytest is required as doctest_utils imports things from it

* the only log issues were dataset, ignore results should work

* more cleaning

* Update .circleci/create_circleci_config.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* [ydshieh] empty string if cuda is found

* [ydshieh] fix condition

* style

* [ydshieh] fix

* Add comment

* style

* style

* show failure

* trigger CI

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

627f4479

Support ratios for `logging_steps`, `eval_steps`, and `save_steps` (#23235) · 650a71e1

Konstantin Dobler authored May 09, 2023



* Ratio option for `logging_steps`, `eval_steps`, `save_steps`

* Add guards if arguments are not set

* Add more detailed comments + formatting

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Convert args values to `int` if bigger than 1

* `black`

* `make fixup`

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

650a71e1

Proposed fix for TF example now running on safetensors. (#23208) · c34a525d

Nicolas Patry authored May 09, 2023



* Proposed fix for TF example now running on safetensors.

* Adding more warnings and returning keys.

* Trigger CI

* Trigger CI

---------
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

c34a525d

Add RWKV-4 (#22797) · b4d4d6fe

Sylvain Gugger authored May 09, 2023



* First draft of RWKV-4

* Add support for generate

* Style post-rebase

* Properly use state

* Write doc

* Fix doc

* More math

* Add model to README, dummies and clean config

* Fix init

* multiple fixes:

- fix common tests
- fix configuraion default values
- add CI test for checking state computation
- fix some CI tests

* correct tokenizer

* some tweaks

- fix config docstring
- fix failing tests

* fix CI tests

- add output_attention / output_hidden_states
- override test_initialization
- fix failing CIs

* fix conversion script

- fix sharded case
- add new arguments

* add slow tests + more fixes on conversion script

* add another test

* final fixes

* change single name variable

* add mock attention mask for pipeline to work

* correct eos token id

* fix nits

* add checkpoints

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add `tie_word_embeddings` in docstring

* change tensor name

* fix final nits

* Trigger CI

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

b4d4d6fe

Add Japanese translation to accelerate.mdx (#23232) · 9a50cb61
Rustin Welter authored May 09, 2023
```
Co-authored-by: rustinwelter <rustinwelter.alwp9@slmails.com>
```
9a50cb61
fix: Update run_qa.py to work with deepset/germanquad (#23225) · 1a8f6111
Sebastian authored May 09, 2023
```
Call str on id to make sure any ints are converted into the expected format for squad datasets
```
1a8f6111
Fix typo ; Update output.mdx (#23227) · 51ae5665
Furkan Akkurt authored May 09, 2023

51ae5665

make opt checkpoint dir name correct (#21660) · e02a8065

dumpmemory authored May 09, 2023

make opt checkpoint dir name corrent following https://github.com/huggingface/Megatron-LM/blob/100b522bb8044d98413398f9e71563af15b83325/megatron/checkpointing.py#L117

e02a8065

audio_utils improvements (#21998) · 7f919509

Matthijs Hollemans authored May 09, 2023

* silly change to allow making a PR

* clean up doc comments

* simplify hertz_to_mel and mel_to_hertz

* fixup

* clean up power_to_db

* also add amplitude_to_db

* move functions

* clean up mel_filter_bank

* fixup

* credit librosa & torchaudio authors

* add unit tests

* tests for power_to_db and amplitude_to_db

* add mel_filter_bank tests

* rewrite STFT

* add convenience spectrogram function

* missing transpose

* fewer transposes

* add integration test to M-CTC-T

* frame length can be either window or FFT length

* rewrite stft API

* add preemphasis coefficient

* move argument

* add log option to spectrogram

* replace M-CTC-T feature extractor

* fix api thing

* replace whisper STFT

* replace whisper mel filters

* replace tvlt's stft

* allow alternate window names

* replace speecht5 stft

* fixup

* fix integration tests

* fix doc comments

* remove manual FFT length calculation

* fix docs

* go away, deprecation warnings

* combine everything into spectrogram function

* add deprecated functions back

* fixup

7f919509

[SAM] Add resources (#23224) · 431b04d8
NielsRogge authored May 09, 2023
```
Add resources
```
431b04d8

08 May, 2023 6 commits
- Pin tensorflow-probability (#23220) · 006da469
  Sylvain Gugger authored May 08, 2023
```
* Pin tensorflow-probability

* [all-test]

* [all-test] Fix syntax for bash
```
  006da469
- docs: Fix broken link in 'How to add a model...' (#23216) · 188a8bfc
  Connor Henderson authored May 08, 2023
```
fix link
```
  188a8bfc
- New version of Accelerate for the Trainer (#23204) · 94056b57
  Sylvain Gugger authored May 08, 2023
  
  94056b57
- Skip failing test · fd6970bc
  Sylvain Gugger authored May 08, 2023
  
  fd6970bc
- Fixing class embedding selection in owl-vit (#23157) · 843fdf2e
  Orr Zohar authored May 08, 2023
```
fixing class embedding selection in owl-vit
```
  843fdf2e
- Generate: starcoder 🤜 🤛 assisted generation (#23182) · bbfb9fc2
  Joao Gante authored May 08, 2023
```
* starcoder has joined the chat

* indexing that works for all
```
  bbfb9fc2
07 May, 2023 3 commits

Fix hf_argparser.parse_json_file to open file with utf-8 encoding, close file... · dbc12269

Robert Baruch authored May 07, 2023

Fix hf_argparser.parse_json_file to open file with utf-8 encoding, close file when finished (#23194)

* Open json args in utf-8 encoding, close file when finished

* black formatted

dbc12269

fix random attention for pytorch's bigbird/pegasus_bigbird (#23056) · 6f8a0284

Bartosz Szmelczynski authored May 08, 2023

* fix random attention usage for bigbird and pegasus_bigbird

* remove staticmethod, update tests target valus

* revert style changes

6f8a0284

Update LLaMA docs with arxiv link (#23191) · ef0c380c
Ashwin Mathur authored May 08, 2023
```
* Update docs with arxiv link

* Update llama model docs
```
ef0c380c

06 May, 2023 1 commit
- search buffers for dtype (#23159) · ef42c2c4
  cyy authored May 06, 2023
  
  ef42c2c4
05 May, 2023 6 commits

Add FlaxWhisperForAudioClassification model (#23173) · 312b104f

raghavanone authored May 05, 2023

* Add FlaxWhisperForAudioClassification model

* Add models to init

* Add models to init

* Fix copies

* Fix automapping

* Fix failing test

312b104f

Add `no_trainer` scripts to pre-train Vision Transformers (#23156) · fc6c8b0e

Ashwin Mathur authored May 05, 2023



* Add run_mim_no_trainer.py draft from #20412

Add parse_args method and copy over other dependencies

Add Method call for sending telemetry

Initialize Accelerator

Make one log on every process

Set seed and Handle repository creation

Initialize dataset and Set validation split

Create Config

Adapt Config

Update Config

Create Feature Extractor

Create model

Set column names

Create transforms

Create mask generator

Create method to preprocess images

Shuffle datasets if needed and set transforms

Create Dataloaders

Add optimizer

Add learning rate scheduler

Prepare everything with our accelerator

Tie weights for TPU training

Recalculate training steps and training epochs

Set accelerator checkpointing steps

Initialize trackers and store configuration

Set total batch size

Fix typo: mlm -> mim

Log info at the start of training

Load in the weights and states from previous save

update the progress_bar if load from checkpoint

Define train loop

Add evaluation loop to training

Add to parse_args method

Push repo to hub

Save accelerator state

End training and save model and feature extractor

Remove unused imports

Fix trailing whitespace

* Update code based on comments, Rename feature_extractor to image_processor

* Fix linting

* Add argument for learning rate

* Add argument for setting number of training epochs

* Remove incorrect logger argument

* Convert max_train_steps to int for tqdm

---------
Co-authored-by: Saad Mahmud <shuvro.mahmud79@gmail.com>

fc6c8b0e

fix: Passing language as acronym to Whisper generate (#23141) · 17083b9b
Connor Henderson authored May 05, 2023
```
* add fix

* address comments

* remove error formatting
```
17083b9b
🌐 [i18n-KO] docs: ko: Translate `multiple_choice.mdx` (#23064) · 40082d59
Gabriel Yang authored May 06, 2023
```
* update doctree

* doc: ko: translate multiple choice

* Update reviews
```
40082d59
fixed whisper positional encoding (#23167) · 77412343
Andrei Filatov authored May 05, 2023

77412343
Add TrOCR resources (#23142) · 1b9c352e
Perry Huang authored May 05, 2023
```
* Add TrOCR resources

* Made fixes suggested by stevhliu
```
1b9c352e

04 May, 2023 12 commits

Revert "Add FlaxWhisperForAudioClassification model" (#23154) · 01734dba
Sylvain Gugger authored May 04, 2023
```
Revert "Add FlaxWhisperForAudioClassification model (#22883)"

This reverts commit c8f2c5c5.
```
01734dba
Generate: text generation pipeline no longer emits `max_length` warning when it is not set (#23139) · b369e507
Joao Gante authored May 04, 2023

b369e507

[docs] Text to speech task guide (#23107) · 516dc630

Maria Khalusova authored May 04, 2023



* First draft

* Some polishing

* Text polishing

* added TOC entry for TTS

* make style

* added links to images

* fixed links to images

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* feedback addressed

* feedback from Matthijs addresed

* Update docs/source/en/tasks/text-to-speech.mdx
Co-authored-by: Matthijs Hollemans <mail@hollance.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Matthijs Hollemans <mail@hollance.com>

516dc630

Add FlaxWhisperForAudioClassification model (#22883) · c8f2c5c5

raghavanone authored May 04, 2023

* Add FlaxWhisperForAudioClassification model

* Add models to init

* Add models to init

* Fix copies

* Fix automapping

c8f2c5c5

Pin urllib3 · 3341bb41
Sylvain Gugger authored May 04, 2023

3341bb41
[`GPT-J`] Fix causal mask dtype (#23147) · 57ffd8ab
Younes Belkada authored May 04, 2023
```
* fix #23136

* better fix

* same fix for `masked_bias`
```
57ffd8ab

GPTNeoXForQuestionAnswering (#23059) · 83b38fbe

peter-sk authored May 04, 2023



* first draft - gives index error in question_answering.py

* maturing

* no labels

* pipeline should know about QA

* fixing checks

* formatting

* fixed docstring

* initial commit

* formatting

* adding the class to many places

* towards less unhappy checks

* nearly there

* and gpt neox for qa

* use right model

* forgot this one

* base_model_prefix is "gpt_neox" for GPTNeoX* models

* unnecessary stuff

* Update src/transformers/models/gpt_neox/modeling_gpt_neox.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* format

* Update src/transformers/models/gpt_neox/modeling_gpt_neox.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* removed gpt2 stuff

---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

83b38fbe

gpt2 multi-gpu fix (#23149) · 510ad0a8
peter-sk authored May 04, 2023
```
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>
```
510ad0a8
fix resume fsdp (#23111) · adb0760b
Qingyang Wu authored May 04, 2023
```
* fix resume fsdp

* fix rank 0 loading

* fix style and quality
```
adb0760b
Remove typo in perf_train_gpu_many.mdx (#23144) · 3b74889e
Victor Geislinger authored May 04, 2023
```
- Excess `w` in  the word `bottom`
```
3b74889e
fix spelling error (#23143) · 5eeb5564
digger-yu authored May 04, 2023
```
change referrred to referred
```
5eeb5564

Add methods to update and verify out_features out_indices (#23031) · 90e8263d

amyeroberts authored May 04, 2023

* Add methods to update and verify out_features out_indices

* Safe update for config attributes

* Fix function names

* Save config correctly

* PR comments - use property setters

* PR comment - directly set attributes

* Update test

* Add updates to recently merged focalnet backbone

90e8263d

03 May, 2023 2 commits

GPTNeoForQuestionAnswering (#23057) · 78b7debf

peter-sk authored May 03, 2023



* first draft - gives index error in question_answering.py

* maturing

* no labels

* pipeline should know about QA

* fixing checks

* formatting

* fixed docstring

* initial commit

* formatting

* adding the class to many places

* towards less unhappy checks

* nearly there

* Update src/transformers/models/gpt_neo/modeling_gpt_neo.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* avoid error

* moving to device of star/end_logits

---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

78b7debf

Tidy Pytorch GLUE benchmark example (#23134) · b6933d76
Robert Stone authored May 03, 2023
```
Migration to Evaluate for metric is not quite complete
```
b6933d76