Commits · 9a2995ee394b8f45fa3a5c12187e06886b77691a · chenpangpang / transformers

18 Apr, 2022 7 commits
- [Quicktour Audio] Improve && remove ffmpeg dependency (#16723) · 9a2995ee
  Patrick von Platen authored Apr 18, 2022
```
* [Quicktour Audio] Improve && remove ffmpeg dependency

* final fix

* final touches
```
  9a2995ee
- [ViT, BEiT, DeiT, DPT] Improve code (#16799) · d3c9d0e5
  NielsRogge authored Apr 18, 2022
```
* Improve code

* Fix bugs

* Fix another bug

* Clean up DTP as well

* Update DPT model outputs
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
  d3c9d0e5
- Fix syntax error in TorchHub workflow · 3785f466
  Sylvain Gugger authored Apr 18, 2022
  
  3785f466
- Create empty venv on cache miss (#16816) · 6984848e
  Joao Gante authored Apr 18, 2022
  
  6984848e
- Raise error and suggestion when using custom optimizer with Fairscale or Deepspeed (#16786) · 43814483
  Allan Jie authored Apr 18, 2022
```
* optimizer issues related to saving

* remove the "optimizer saving" option

* reformat using make style
```
  43814483
- TF generate refactor - XLA sample (#16713) · b4ddd267
  Joao Gante authored Apr 18, 2022
  
  b4ddd267
- CI: non-remote GH Actions now use a python venv (#16789) · 02de7a8e
  Joao Gante authored Apr 18, 2022
  
  02de7a8e
17 Apr, 2022 1 commit
- Pin Jax to last working release (#16808) · dee6f016
  Sylvain Gugger authored Apr 16, 2022
```
* Pin Jax to last working release

* Try lower

* Try lower
```
  dee6f016
15 Apr, 2022 4 commits

Update README.md (#16797) · 78f346c2
NielsRogge authored Apr 15, 2022

78f346c2
Fix PT TF ViTMAE (#16766) · ee209d4d
Yih-Dar authored Apr 15, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
ee209d4d

[modeling utils] revamp `from_pretrained(..., low_cpu_mem_usage=True)` + tests (#16657) · 5da33f87

Stas Bekman authored Apr 14, 2022

* add low_cpu_mem_usage tests

* wip: revamping

* wip

* install /usr/bin/time

* wip

* cleanup

* cleanup

* cleanup

* cleanup

* cleanup

* fix assert

* put the wrapper back

* cleanup; switch to bert-base-cased

* Trigger CI

* Trigger CI

5da33f87

[trainer / deepspeed] fix hyperparameter_search (#16740) · ce2fef2a

Stas Bekman authored Apr 14, 2022

* [trainer / deepspeed] fix hyperparameter_search

* require optuna

* style

* oops

* add dep in the right place

* create deepspeed-testing dep group

* Trigger CI

ce2fef2a

14 Apr, 2022 9 commits
- Fix issue avoid-missing-comma found at https://codereview.doctor (#16768) · 1b7de41a
  code-review-doctor authored Apr 14, 2022
  
  1b7de41a
- [SpeechEncoderDecoderModel] Fix bug in reshaping labels (#16748) · de8b06f9
  Sanchit Gandhi authored Apr 14, 2022
  
  de8b06f9
- Improve image classification example (#16585) · 048443db
  NielsRogge authored Apr 14, 2022
```
* Improve README

* Make dataset_name argument optional

* Improve local data

* Fix bug

* Improve README some more

* Apply suggestions from code review

* Improve README
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
  048443db
- Kill async pushes when calling push_to_hub with blocking=True (#16755) · 3e4eec47
  Sylvain Gugger authored Apr 14, 2022
  
  3e4eec47
- [deepspeed / m2m_100] make deepspeed zero-3 work with layerdrop (#16717) · c21e1071
  Stas Bekman authored Apr 14, 2022
```
* [deepspeed / m2m_100] make deepspeed 3 work with layerdrop

* fix

* revert last
```
  c21e1071
- Make nightly install dev accelerate (#16783) · 89293a0f
  Zachary Mueller authored Apr 14, 2022
  
  89293a0f
- Fix batch size in evaluation loop (#16763) · b151ddb9
  Sylvain Gugger authored Apr 14, 2022
```
* Fix batch size in evaluation loop

* remove debug statement
```
  b151ddb9
- [Flax `.from_pretrained`] Raise a warning if model weights are not in float32 (#16762) · d8269eb4
  Sanchit Gandhi authored Apr 14, 2022
```
* [Flax] Raise a warning if model weights are not in float32

* apply suggestions and few small changes

* reorder wording for better readability
```
  d8269eb4
- Enabling `Tapex` in table question answering pipeline. (#16663) · 195fbbb6
  Nicolas Patry authored Apr 14, 2022
```
* Enabling `Tapex` in table question answering pipeline.

* Questions are independant for Tapex, making the test respect that.

* Missing extra space.
```
  195fbbb6
13 Apr, 2022 14 commits

[Doctest] added doctest changes for electra (#16675) · 442dc456
Bhadresh Savani authored Apr 14, 2022
```
* added doctest changes for electra

* fixed doctest tests

* updated changes
```
442dc456

Fixup no_trainer examples scripts and add more tests (#16765) · be752d12

Zachary Mueller authored Apr 13, 2022

* Change tracking to store_true

* Remove step param and use it in the log dictionary directly

* use vars(args) when passing args to init_trackers

* Include tracking tests since tensorboard is already a dep

be752d12

[self-scheduled ci] explain where dependencies are (#16757) · 3a16ab25
Stas Bekman authored Apr 13, 2022

3a16ab25

Add self training code for text classification (#16738) · 34ef029d

Tu Vu authored Apr 13, 2022

* Add self-training code for text-classification

* Add self-training code for text-classification

* Add self-training code for text-classification

* Add self-training code for text-classification

* Add self-training code for text-classification

* Delete strata

34ef029d

Add defensive check for config num_labels and id2label (#16709) · 8e0d3b42

Sylvain Gugger authored Apr 13, 2022

* Add defensive check for config num_labels and id2label

* Actually check value...

* Only warning inside init plus better error message

8e0d3b42

Reduce Funnel PT/TF diff (#16744) · 6bed0647

Yih-Dar authored Apr 13, 2022



* Make Funnel Test less flaky
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6bed0647

CI: setup-dependent pip cache (#16751) · 0b8f6972
Joao Gante authored Apr 13, 2022
```
* Setup-dependent pip cache

* Do not restore from old versions
```
0b8f6972
[modeling_utils] better explanation of ignore keys (#16741) · ac43a40e
Stas Bekman authored Apr 13, 2022

ac43a40e

Fix and improve CTRL doctests (#16573) · 0235bc57

Jeremy Fisher authored Apr 13, 2022



* Improve CTRL doctests

* Fix `CTRLForSequenceClassification` flakiness with inconsistent losses

* Remove unused

* Fixup

* Add CTRL to documentation_tests.txt

* Fix control code not being first

* Add output assertions

* Change from sshleifer/tiny-ctrl -> ctrl

* Run `make fixup`

* apply `list` to output logits shape for clarity

* Reduce output loss precision to make assertion more robust

* Add assertion of control code being first

* Fix docstyle

* upper case sentence following control code

* Weird bug fixes

* Add a better generation example
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

0235bc57

Add Doc Test for GPT-J (#16507) · 06b4aac9

Michael Chung authored Apr 13, 2022



* Required the values GPTJ unfortunately cannot run the model =)

* Added the file to the doc tests

* Run Fixup and Style

* Fixed with the test versions of gptj. Ran Style and Fixup.

* Trigger ci

* A Minor Change to License

* Fixed spacing added to the benchmark_utils. Then refactored tests to const variables.

* Removed strings that were included as default parameters anyways.
Co-authored-by: ArEnSc <xx.mike.chung.xx@gmail.com>

06b4aac9

[from_pretrained] refactor find_mismatched_keys (#16706) · 12bfa97a
Stas Bekman authored Apr 13, 2022

12bfa97a

Fix #16660 (tokenizers setters of ids of special tokens) (#16661) · 9f8bfe70

davidleonfdez authored Apr 13, 2022

* Fix setters of *_token_id properties of SpecialTokensMixin

* Test setters of common tokens ids

* Move to a separate test checks of setters of tokens ids

* Add independent test for ByT5

* Add Canine test

* Test speech to text

9f8bfe70

[Doctests] Fix all T5 doc tests (#16646) · b24201fa

Patrick von Platen authored Apr 13, 2022



* [Doctests] Fix all T5 doc tests

* make style

* Update docs/source/en/model_doc/t5.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply Sylvains comments

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b24201fa

Fix decoding score comparison when using logits processors or warpers (#10638) · f7196f2e
Santiago Castro authored Apr 13, 2022
```
* Normalize using a logits warper

* Add a flag in `generate` to support the logit renormalization

* Add in RAG
```
f7196f2e

12 Apr, 2022 5 commits
- TF generate: handle case without cache in beam search (#16704) · eb5bdcdf
  Joao Gante authored Apr 12, 2022
  
  eb5bdcdf
- add Bigbird ONNX config (#16427) · 9c9db751
  Minh Chien Vu authored Apr 13, 2022
```
* add Bigbird ONNX config
```
  9c9db751
- [FlaxWav2Vec2Model] Fix bug in attention mask (#16725) · a9604067
  Sanchit Gandhi authored Apr 12, 2022
```
* [FlaxWav2Vec2Model] Fix bug in attention mask

* more fixes

* add (Flax)SpeechEncoderDecoderModel PT-FX cross-test
```
  a9604067
- [FlaxSpeechEncoderDecoder] Fix input shape bug in weights init (#16728) · 6adefba3
  Sanchit Gandhi authored Apr 12, 2022
```
* [FlaxSpeechEncoderDecoder] Fix input shape bug in weights init

* make style
```
  6adefba3
- Add Doc Tests for Reformer PyTorch (#16565) · 1bac40db
  hiromu authored Apr 13, 2022
```
* start working

* fix: ReformerForQA doctest

* fix: ReformerModelWithLMHead doctest

* fix: ReformerModelForSC doctest

* fix: ReformerModelForMLM doctest

* add: documentation_tests.txt

* make fixup

* change: ReformerModelForSC doctest

* change: checkpoint
```
  1bac40db