Commits · ae9230af40ecc8ccb940765830b2f8727049a845 · chenpangpang / transformers

28 Feb, 2023 7 commits
- [`T5`] Fix torchquant issue (#21843) · ae9230af
  Younes Belkada authored Feb 28, 2023
```
* fix torchquant issue

* add tests
```
  ae9230af
- Fix tf random token masking probability in data collator (#21834) · 2d506ea4
  anruijian authored Feb 28, 2023
```
* fix tf random mask tokens probability

* fix tf random mask tokens probability in collator for langauge modelling
```
  2d506ea4
- Fix gradient checkpointing imagegpt (#21816) · 4fe744f5
  Karim Foda authored Feb 28, 2023
```
* Fix gradient checkpointing bug in gptneox

* Fix gradient checkpointing bug in modeling_imagegpt.py

* Revert gpt neox changes

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  4fe744f5
- Fix gradient checkpointing bug in git (#21818) · e07a3d95
  Karim Foda authored Feb 28, 2023
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  e07a3d95
- check for None forced tokens (#21793) · 50db7414
  Andy Ehrenberg authored Feb 28, 2023
  
  50db7414
- Fix gradient checkpointing bug BioGpt (#21844) · 50644cf6
  saswatmeher authored Feb 28, 2023
```
Co-authored-by: saswatmeher <saswatmeher@cse.iitb.ac.in>
```
  50644cf6
- Rename `MobileViTModelTest` to `TFMobileViTModelTest` (#21825) · a9dd1243
  Yih-Dar authored Feb 28, 2023
```
Let's give TF a bit more love ❤️ 🙏

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a9dd1243
27 Feb, 2023 13 commits

introduce `logger.warning_once` and use it for grad checkpointing code (#21804) · c7f3abc2
Stas Bekman authored Feb 27, 2023
```
* logger.warning_once

* style
```
c7f3abc2

Fix quality with `ruff==0.0.253` (#21828) · f95f60c8

Yih-Dar authored Feb 27, 2023



fix quality with ruff 0.0.253
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f95f60c8

Inheritance-based framework detection (#21784) · 92dfceb1
Joao Gante authored Feb 27, 2023

92dfceb1
Fix gradient checkpointing bug in gptneox (#21815) · 7811bf7e
Karim Foda authored Feb 27, 2023
```
* Fix gradient checkpointing bug in gptneox

* Remove use_cache block
```
7811bf7e
Fix nn.init.trunc_normal_ call on torch.float16 data (#21789) · 0c7f93f5
fxmarty authored Feb 27, 2023
```
fix nn.init.trunc_normal_ call on half data
```
0c7f93f5
Fix PyTorch Perceiver `PerceiverFourierPositionEncoding` with fp16 (#21787) · ebf84f07
fxmarty authored Feb 27, 2023
```
* fix perceiver fp16

* hopefully fix tests
```
ebf84f07
[`tests`] add `accelerate` marker (#21743) · 831f3144
Younes Belkada authored Feb 27, 2023
```
* add `accelerate` marker

* add to docs

* Update docs/source/en/testing.mdx
```
831f3144

[torch] remove deprecated uint8 in favor of bool (#21384) · c51dc4f9

Arthur authored Feb 27, 2023



* uint8 -> bool

* fix copies

* style

* update test modeling commen when checking attention buffers

* style

* use logical not on random mask instead of subtraction with 1

* remove torch uint8

* quality

* remove modified modeling utils

* Update based on review
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

---------
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

c51dc4f9

[Pipeline] Add zero shot audio classificatoin pipeline (#21600) · cc44e72d

Arthur authored Feb 27, 2023



* add pipeline

* update init

* add zero shot to init

* update inits and correct checkpoints

* update base to support input features

* add tests

* Update src/transformers/pipelines/zero_shot_audio_classification.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/pipelines/zero_shot_audio_classification.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* update pieline code

* use tiny checkpoint

* nits and expected value with tiny model

* style

* last nit on tests values

* fix styling

* fix collate fn that was casting t float

* update

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

cc44e72d

[FX tracer] Make `concrete_args` from outside available (#21775) · 2ea1ef90
Tianqi Zhang (张天启) authored Feb 27, 2023
```
make concrete_args from outside available
```
2ea1ef90
Fix en documentation typos (#21799) · ba2a5f13
Thomas Paviot authored Feb 27, 2023
```
* fix wrong url

* typos in english documentation
```
ba2a5f13
Fix type in gpt2 config docstring (#21782) · a3698365
Julian Weber authored Feb 27, 2023
```
Fix docstring gpt2 config
```
a3698365

[examples/summarization] deal with `max_length` and `num_beams` (#21740) · 3c0ce608

bofeng huang authored Feb 27, 2023

* Override the decoding parameters of Seq2SeqTrainer

* Fix quality

* Fix max_length parameter

* Fix quality

* Remove redundant parameter max_length

* Separate the preprocess of train and validation to use different max_target_length

3c0ce608

25 Feb, 2023 1 commit

Fix resume_from_checkpoint for deepspeed (#21735) · 9ddf4f4f

Moshe Berchansky authored Feb 25, 2023



* Fix resume_from_checkpoint for deepspeed

Fix resume_from_checkpoint for deepspeed, by ensuring that the deepspeed engine is the one to load the checkpoint.

* Empty commit to trigger CI

* Removed deepspeed skipping 

Removed deepspeed skipping inside the _load_from_checkpoint function, as it is obsolete

* another adjustment

* Trigger CI

* trigger circleci

* style

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

9ddf4f4f

24 Feb, 2023 12 commits

[SpeechT5] Fix HiFiGAN tests (#21788) · 3dae0d7b
Sanchit Gandhi authored Feb 24, 2023

3dae0d7b

[GPT2, ProphetNet] Fix gradient checkpointing bug (#21772) · 59c1d5b9

Yi Heng Lim authored Feb 24, 2023

* fix gradient checkpointing bug

* fix gradient checkpointing bug

* ran make fix-copies

* fixed bug

* fixed bug

59c1d5b9

[time series] updated expected values for integration test. (#21762) · ba0e370d

Kashif Rasul authored Feb 24, 2023

* updated expected

* prediction_length fix

* prediction_length default value

* default prediction_length 24

* revert back prediction_length default

* move prediction_length test

ba0e370d

Generate - update cookie cutters to not initialize cache with training and... · 440f3975
Joao Gante authored Feb 24, 2023
```
Generate - update cookie cutters to not initialize cache with training and gradient checkpointing (#21759)
```
440f3975

Fix-ci-whisper (#21767) · 087436c9

Arthur authored Feb 24, 2023

* fix history

* input_features instead of input ids for TFWhisport doctest

* use translate intead of transcribe

087436c9

[Whisper] Add SpecAugment (#21298) · c8545d2a

bofeng huang authored Feb 24, 2023



* Return and rescale attention_mask

* Add SpecAugment to Whisper modeling

* Fix test

* Update docstring

* Add SpecAug related parameters to model config

* Add the _mask_input_features function to doc

* Fix quality

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Remove dev comments

* Add test

* Resolve conflict

* feat: mask {feature, time} prob fast tests

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c8545d2a

[Flax] Fix erroneous kwargs being passed to generate config (#21765) · 75bd49ff
Sanchit Gandhi authored Feb 24, 2023

75bd49ff
Different behavior in DistilBERT when using "inputs_embeds" (#21752) · 14f33205
Arthur authored Feb 24, 2023
```
* Different behavior in DistilBERT when using "inputs_embeds"
Fixes #21089

* fix failing test
```
14f33205
[Examples] Generalise run audio classification for log-mel models (#21756) · 13489248
Sanchit Gandhi authored Feb 24, 2023
```
* [Examples] Generalise run audio classification for log-mel models

* batch feature extractor

* make style
```
13489248

[Flax] adding support for batch norm layers (#21581) · f7ca656f

Shubhamai authored Feb 24, 2023

* [flax] adding support for batch norm layers

* fixing bugs related to pt+flax integration

* cleanup, batchnorm support in sharded pt to flax

* support for batchnorm tests in pt+flax integration

* simplifying checking batch norm layer

f7ca656f

fix: Change is_last chunk calc and add conditional break in chunk_iter (#21612) · 279008ad

Connor Henderson authored Feb 24, 2023

* fix: Change is_last chunk calc and add conditional break

* format fix

* account for 0 and full stride_rights, add comment

* add new test

* make style

* update slow whisper asr test timestamps

* use nested_simplify on output and round timestamp to hundreths place

279008ad

Graphormer fix (#21699) · 4446b6b0

Clémentine Fourrier authored Feb 24, 2023

* Removed useless check for backend

* fix style check for graphormer

* Reverted change and corrected requires_backend for cython

* code qual

4446b6b0

23 Feb, 2023 7 commits

[deepspeed tests] fix issues introduced by #21700 (#21769) · 63306263
Stas Bekman authored Feb 23, 2023
```
* [deepspeed tests] fix issues introduced by #21700

* fix

* fix
```
63306263

Auto api Value Error addition to Troubleshoot (#21708) · 04d90ac4

Maria Khalusova authored Feb 23, 2023



* troubleshooting guide: added an error description for missing auto-mapping

* minor polishing

* changed the example

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/troubleshooting.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

04d90ac4

Added Type Hints for modeling_tf_encoder_decoder.py (#21673) · 0ffa22f9

Batese2001 authored Feb 23, 2023



* Ran Black formatting

* Added imports and reformatted

* Update src/transformers/models/encoder_decoder/modeling_tf_encoder_decoder.py

---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

0ffa22f9

Skip test_log_level for now · aa3787c8
ydshieh authored Feb 23, 2023

aa3787c8
Generate: Fix GIT batched captioning (#21738) · 1d4b7978
Joao Gante authored Feb 23, 2023

1d4b7978

[`GPTNeo`] Fix gradient checkpointing bug (#21733) · 78a93d17

Younes Belkada authored Feb 23, 2023



* fix bug

* forward contrib credits from discussions

* change logic

---------
Co-authored-by: edbeeching <edbeeching@users.noreply.github.com>

78a93d17

Fix 2 quicktour file doctest (#21742) · 36a6a1ad

Yih-Dar authored Feb 23, 2023



* Update expect output values - as Hub repo. files are updated

* Update expect output values - as librosa is from 0.9.2 to 0.10.0 on CI docker

* fix

* update one more

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

36a6a1ad