Commits · 4e1dee0e8e06c1146d023c43812b88bfe2763329 · chenpangpang / transformers

17 Aug, 2023 19 commits

Revert "change version (#25387)" (#25573) · 4e1dee0e
Marc Sun authored Aug 17, 2023
```
This reverts commit 3a05e010.
```
4e1dee0e
[`Tests`] Fix failing 8bit test (#25564) · d4c0aa14
Younes Belkada authored Aug 17, 2023
```
* fix failing 8bit test

* trigger CI
```
d4c0aa14
[`NllbMoe`] Update code to properly support loss computation (#25429) · 181d778f
Arthur authored Aug 17, 2023
```
* update nllb_moe

* fix

* doc nits

* nits

* add a small test

* ficup

* remove adapted from
```
181d778f

Inconsistency in PreTrainedModel.resize_token_embeddings When ZeRO3 Is Enabled (#25394) · 9264fc91

Sina authored Aug 17, 2023

* Inconsistency in PreTrainedModel.resize_token_embeddings

This PR addresses https://github.com/huggingface/transformers/issues/25241

.

In previous implementation when ZeRO stage 3 was enbaled, resize_token_embeddings would create independent PyTorch weights on each device. Here we ensure that new embeddings are created with DeepSpeed init, and are properly partitioned accros devices.

* formatting with black

* adding the removed comments back in

---------
Co-authored-by: Sina Moeini <smoeini@amazon.com>

9264fc91

🚨

[`SPM`] Finish fix spm models

🚨

(#25224) · b4d55488

Arthur authored Aug 17, 2023

* fix EVERYTHING

* more fixes

* ⚗️⚗️ Tokenizer magic ⚗️⚗

️

* wrong value but test passes for the TODO

* update

* updat

* safe protobuf import?

* style

* non gated repo

* update

* fixup

* Update src/transformers/models/llama/tokenization_llama.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/llama/tokenization_llama.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/t5/test_tokenization_t5.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* nits

* fix t5 too

* use assert equal

* fix llama decoding

* nits on t5

* fixup

* only remove the prefix space, not other spaces

* more deconding tests and more todos

* fix CI as well

* fixup

* skip failing test on CI (its tf its ok)

* skip test_subword_regularization_tokenizer that is also crashing on the CI for TF

* update llama

* revert good fixes

* fixup

* empty

* explain why we need to encode with an additional token

* better warning?

* nits

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

b4d55488

[`SwitchTransformers`] Remove unused module (#25427) · 5347d000
Arthur authored Aug 17, 2023
```
* remove unused module

* remove old feed_forward_proj

* fixup
```
5347d000

[`resize_embedding`] Introduce `pad_to_multiple_of` and guidance (#25088) · d6bf08f7

Arthur authored Aug 17, 2023

* fix

* revert cahnges and update resizing of embedding layer

* use wraning

* fixup

* more styling nits

* fix all tests that overload the embedding tests

* 👀👀 remove breakpoint

* remove useless overload + overload correctly where needed

* resize lm head with new vocab size

* reverse not necessary changes

* style

* fix CIs!

* fix last CI tests, adapt bark and Marian

* fixup

d6bf08f7

Skip `test_beam_search_xla_generate_simple` for `T5` (#25566) · d2871b29
Yih-Dar authored Aug 17, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d2871b29

Adds `TRANSFORMERS_TEST_DEVICE` (#25506) · 1791ef8d

Alex McKinney authored Aug 17, 2023

* Adds `TRANSFORMERS_TEST_DEVICE`
Mirrors the same API in the diffusers library. Useful in transformers
too.

* replace backend checking with trying `torch.device`

* Adds better error message for unknown test devices

* `make style`

* adds documentation showing `TRANSFORMERS_TEST_DEVICE` usage.

1791ef8d

[`Docs`] Fix un-rendered images (#25561) · e7e9261a
Younes Belkada authored Aug 17, 2023
```
fix un-rendered images
```
e7e9261a
Skip `test_onnx_runtime_optimize` for now (#25560) · 8992589d
Yih-Dar authored Aug 17, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8992589d
YOLOS - reset default return_pixel_mask value (#25559) · e50c9253
amyeroberts authored Aug 17, 2023
```
Remove added back copied from statement
```
e50c9253
🚨🚨🚨 Vivit update default rescale_factor value (#25547) · c8346cb2
amyeroberts authored Aug 17, 2023
```
* Update default rescale_factor value

* Formatting
```
c8346cb2

Fix `torch.fx` tests on nightly CI (#25549) · 8fd65619

Yih-Dar authored Aug 17, 2023



* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

8fd65619

Fix MPT CI (#25548) · ec25306b

Yih-Dar authored Aug 17, 2023



fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ec25306b

Add documentation to dynamic module utils (#25534) · 297a6a7a
Sylvain Gugger authored Aug 17, 2023
```
* Add documentation to dynamic module utils

* Address review comments
```
297a6a7a
Update trainer.py (#25553) · d1832dd8
Yun Dai authored Aug 16, 2023

d1832dd8

[i18n-KO] Translated docs: ko: pr_checks.md to Korean (#24987) · db816c6e

Juntae authored Aug 17, 2023



* docs: ko: pr_checks.mdx

* feat: chatgpt draft

* fix: manual edits

* fix: resolve suggestions
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>

* feat: chatgpt draft

* fix: manual edits

---------
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>

db816c6e

More utils doc (#25457) · 2defb6b0

Sylvain Gugger authored Aug 17, 2023

* Document and clean more utils.

* More documentation and fixes

* Switch to Lysandre's token

* Address review comments

* Actually put else

2defb6b0

16 Aug, 2023 10 commits
- [ASR Pipeline] Fix init with timestamps (#25438) · 36f183eb
  Sanchit Gandhi authored Aug 16, 2023
```
* [ASR Pipeline] Fix init

* refactor test

* change default kwarg setting

* only perform checks if we have to

* override init

* move pre/forward/post checks to sanitize
```
  36f183eb
- Input data format (#25464) · 6bca43bb
  amyeroberts authored Aug 16, 2023
```
* Add copied from statements for image processors

* Move out rescale and normalize to base image processor

* Remove rescale and normalize from vit (post rebase)

* Update docstrings and tidy up

* PR comments

* Add input_data_format as preprocess argument

* Resolve tests and tidy up

* Remove num_channels argument

* Update doc strings -> default ints not in code formatting
```
  6bca43bb
- More frozen args (#25540) · a6609caf
  Zach Mueller authored Aug 16, 2023
  
  a6609caf
- Fix `MaskFormerModelIntegrationTest` OOM (#25544) · f61f072b
  Yih-Dar authored Aug 16, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  f61f072b
- fix vit hybrid test (#25543) · 0ed23e4d
  Marc Sun authored Aug 16, 2023
```
fix test
```
  0ed23e4d
- Generate: fix default max length warning (#25539) · 3f9cb335
  Joao Gante authored Aug 16, 2023
  
  3f9cb335
- Document the test fetcher (#25521) · e13d5b60
  Sylvain Gugger authored Aug 16, 2023
```
* Document the test fetcher

* Address review comments
```
  e13d5b60
- Marian: post-hack-fix correction (#25459) · 0b568291
  Joao Gante authored Aug 16, 2023
  
  0b568291
- Fix nested configs of Jukebox (#25533) · 5ccf343a
  Sylvain Gugger authored Aug 16, 2023
  
  5ccf343a
- [TYPO] fix typo/format in quicktour.md (#25519) · c385de24
  lishukan authored Aug 16, 2023
```
* fix_all_language_quicktour

* give up ! before bash command

---------
Co-authored-by: lishukan <lishukan@dxy.cn>
```
  c385de24
15 Aug, 2023 6 commits

Use dynamic past key-values shape in TF-Whisper (#25523) · eec5841e
Matt authored Aug 15, 2023

eec5841e

Make training args fully immutable (#25435) · ca514992

Zach Mueller authored Aug 15, 2023

* Make training args fully immutable

* Working tests, PyTorch

* In test_trainer

* during testing

* Use proper dataclass way

* Fix test

* Another one

* Fix tf

* Lingering slow

* Exception

* Clean

ca514992

add __repr__ to the BitsAndBytesConfig class (#25517) · f11518a5
YQ authored Aug 15, 2023
```
add __repr__
```
f11518a5

Bump tornado from 6.3.2 to 6.3.3 in /examples/research_projects/lxmert (#25511) · 7a94ea4c

dependabot[bot] authored Aug 15, 2023

Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.3.2 to 6.3.3.
- [Changelog](https://github.com/tornadoweb/tornado/blob/master/docs/releases.rst)
- [Commits](https://github.com/tornadoweb/tornado/compare/v6.3.2...v6.3.3

)

---
updated-dependencies:
- dependency-name: tornado
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

7a94ea4c

Bump tornado from 6.3.2 to 6.3.3 in /examples/research_projects/visual_bert (#25512) · 2552b8c5

dependabot[bot] authored Aug 15, 2023

Bump tornado in /examples/research_projects/visual_bert

Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.3.2 to 6.3.3.
- [Changelog](https://github.com/tornadoweb/tornado/blob/master/docs/releases.rst)
- [Commits](https://github.com/tornadoweb/tornado/compare/v6.3.2...v6.3.3

)

---
updated-dependencies:
- dependency-name: tornado
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

2552b8c5

Check for case where `auxiliary_head` is `None` in `UperNetPreTrainedModel` (#25514) · df91ff53
Michael Murray authored Aug 14, 2023
```
check for case where auxiliary_head is None in UperNetPreTrainedModel
```
df91ff53

14 Aug, 2023 5 commits
- Conditional DETR type hint fix (#25505) · b42010bb
  Matt authored Aug 14, 2023
  
  b42010bb
- 🚨🚨🚨 Remove softmax for EfficientNetForImageClassification 🚨🚨🚨 (#25501) · c4129196
  amyeroberts authored Aug 14, 2023
```
* Remove softmax for EfficientNet

* Update integration test values

* Fix up
```
  c4129196
- fix gptq nits (#25500) · 06a1d75b
  Marc Sun authored Aug 14, 2023
```
* fix nits

* fix docstring

* fix doc

* fix damp_percent

* fix doc
```
  06a1d75b
- MaskFormer post_process_instance_segmentation bug fix convert out side of loop (#25497) · 80f29a25
  amyeroberts authored Aug 14, 2023
```
Bug fix - convert out side of loop
```
  80f29a25
- Set can_generate for SpeechT5ForTextToSpeech (#25493) · ee7d6694
  Yoach Lacombe authored Aug 14, 2023
```
add can_generate=True to SpeechT5ForTextToSpeech
```
  ee7d6694