Commits · 19420fd99e1f08a052a1d0d267f3496002d03618 · chenpangpang / transformers

03 May, 2022 2 commits

Move test model folders (#17034) · 19420fd9

Yih-Dar authored May 03, 2022



* move test model folders (TODO: fix imports and others)

* fix (potentially partially) imports (in model test modules)

* fix (potentially partially) imports (in tokenization test modules)

* fix (potentially partially) imports (in feature extraction test modules)

* fix import utils.test_modeling_tf_core

* fix path ../fixtures/

* fix imports about generation.test_generation_flax_utils

* fix more imports

* fix fixture path

* fix get_test_dir

* update module_to_test_file

* fix get_tests_dir from wrong transformers.utils

* update config.yml (CircleCI)

* fix style

* remove missing imports

* update new model script

* update check_repo

* update SPECIAL_MODULE_TO_TEST_MAP

* fix style

* add __init__

* update self-scheduled

* fix add_new_model scripts

* check one way to get location back

* python setup.py build install

* fix import in test auto

* update self-scheduled.yml

* update slack notification script

* Add comments about artifact names

* fix for yolos
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

19420fd9

[FlaxBert] Add ForCausalLM (#16995) · cd9274d0

Sanchit Gandhi authored May 03, 2022

* [FlaxBert] Add ForCausalLM

* make style

* fix output attentions

* Add RobertaForCausalLM

* remove comment

* fix fx-to-pt model loading

* remove comment

* add modeling tests

* add enc-dec model tests

* add big_bird

* add electra

* make style

* make repo-consitency

* add to docs

* remove roberta test

* quality

* amend cookiecutter

* fix attention_mask bug in flax bert model tester

* tighten pt-fx thresholds to 1e-5

* add 'copied from' statements

* amend 'copied from' statements

* amend 'copied from' statements

* quality

cd9274d0

02 May, 2022 3 commits

[T5 Tokenizer] Model has no fixed position ids - there is no hardcode… (#16990) · 31616b8d

Patrick von Platen authored May 02, 2022



* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length

* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length

* correct t5 tokenizer

* correct t5 tokenizer

* fix test

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* finish
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

31616b8d

Add YOLOS (#16848) · 1ac69874

NielsRogge authored May 02, 2022



* First draft

* Add YolosForObjectDetection

* Make forward pass work

* Add mid position embeddings

* Add interpolation of position encodings

* Add expected values

* Add YOLOS to tests

* Add integration test

* Support tiny model as well

* Support all models in conversion script

* Remove mid_pe_size attribute

* Make more tests pass

* Add model to README and fix config

* Add copied from statements

* Rename base_model_prefix to vit

* Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP

* Apply suggestions from code review

* Apply more suggestions from code review

* Convert remaining checkpoints

* Improve docstrings

* Add YolosFeatureExtractor

* Add feature extractor to docs

* Add corresponding tests

* Fix style

* Fix docs

* Apply suggestion from code review

* Fix bad rebase

* Fix some more bad rebase

* Fix missing character

* Improve docs and variable names
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

1ac69874

Clean up vision tests (#17024) · 2de2c9ec

NielsRogge authored May 02, 2022



* Clean up tests

* Make fixup
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

2de2c9ec

29 Apr, 2022 2 commits
- TF: XLA bad words logits processor and list of processors (#16974) · fb0ae129
  Joao Gante authored Apr 29, 2022
  
  fb0ae129
- use scale=1.0 in floats_tensor called in speech model testers (#17007) · e952e049
  Yih-Dar authored Apr 29, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e952e049
28 Apr, 2022 1 commit
- set eos_token_id to None to generate until max length (#16989) · 5af5735f
  Yih-Dar authored Apr 28, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5af5735f
27 Apr, 2022 1 commit
- Fix HubertRobustTest PT/TF equivalence test on GPU (#16943) · 49d5bcb0
  Yih-Dar authored Apr 27, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  49d5bcb0
26 Apr, 2022 1 commit
- Add onnx config for RoFormer (#16861) · aaee4038
  Krishna Sirumalla authored Apr 26, 2022
```
* add roformer onnx config
```
  aaee4038
25 Apr, 2022 7 commits
- Fix issue probably-meant-fstring found at https://codereview.doctor (#16913) · 65687520
  code-review-doctor authored Apr 25, 2022
  
  65687520
- TF: XLA stable softmax (#16892) · e03966e4
  Joao Gante authored Apr 25, 2022
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  e03966e4
- added deit onnx config (#16887) · 8246caf3
  Rushi Chaudhari authored Apr 25, 2022
```
* added deit onnx config
```
  8246caf3
- TF: XLA Logits Warpers (#16899) · 9331b379
  Joao Gante authored Apr 25, 2022
```
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
```
  9331b379
- TF: XLA logits processors - minimum length, forced eos, and forced bos (#16912) · 809dac48
  Joao Gante authored Apr 25, 2022
```
* XLA min len, forced eos, and forced bos
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
```
  809dac48
- Fix PyTorch RAG tests GPU OOM (#16881) · 32adbb26
  Yih-Dar authored Apr 25, 2022
```
* add torch.cuda.empty_cache in some PT RAG tests

* torch.cuda.empty_cache in tearDownModule()

* tearDown()

* add gc.collect()
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  32adbb26
- add bigbird typo fixes (#16897) · 508baf19
  Thomas Chaigneau authored Apr 25, 2022
```
Co-authored-by: ChainYo <t.chaigneau.tc@gmail.com>
```
  508baf19
22 Apr, 2022 3 commits
- TF: XLA repetition penalty (#16879) · 99c8226b
  Joao Gante authored Apr 22, 2022
  
  99c8226b
- Add OnnxConfig for ConvBERT (#16859) · ec81c11a
  Thomas Chaigneau authored Apr 22, 2022
```
* add OnnxConfig for ConvBert
Co-authored-by: ChainYo <t.chaigneau.tc@gmail.com>
```
  ec81c11a
- TF: rework XLA generate tests (#16866) · 6d90d76f
  Joao Gante authored Apr 22, 2022
  
  6d90d76f
21 Apr, 2022 2 commits
- Return input_ids in ImageGPT feature extractor (#16872) · cb555af2
  Sylvain Gugger authored Apr 21, 2022
  
  cb555af2
- Long QuestionAnsweringPipeline fix. (#16778) · 6620f60c
  Nicolas Patry authored Apr 21, 2022
```
* Temporary commit witht the long QA fix.

* Adding slow tests covering this fix.

* Removing fast test as it doesn't fail anyway.
```
  6620f60c
20 Apr, 2022 2 commits

Fixing return type tensor with `num_return_sequences>1`. (#16828) · e13a91fe
Nicolas Patry authored Apr 20, 2022
```
* Fixing return type tensor with `num_return_sequences>1`.

* Nit.
```
e13a91fe

add DebertaV2 fast tokenizer (#15529) · ff06b177

Yang Ming authored Apr 20, 2022

Co-authored-by: alcinos <carion.nicolas@gmail.com>
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>
Co-authored-by: Nicolas Carion <carion.nicolas@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ff06b177

19 Apr, 2022 7 commits

Add support for bitsandbytes (#15622) · 3104036e

Manuel R. Ciosici authored Apr 19, 2022



* Add initial BNB integration

* fixup! Add initial BNB integration

* Add bnb test decorator

* Update Adamw8bit option name

* Use the full bnb package name

* Overide bnb for all embedding layers

* Fix package name

* Formatting

* Remove unnecessary import

* Update src/transformers/trainer.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename AdamwBNB optimizer option

* Add training test checking that bnb memory utilization is lower

* fix merge

* fix merge; fix + extend new test

* cleanup

* expand bnb

* move all require_* candidates to testing_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

3104036e

Improve test_pt_tf_model_equivalence on PT side (#16731) · e6d23a4b

Yih-Dar authored Apr 19, 2022



* Update test_pt_tf_model_equivalence on PT side
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e6d23a4b

TF: Add sigmoid activation function (#16819) · f09c45e0
Joao Gante authored Apr 19, 2022

f09c45e0
Add onnx export of models with a multiple choice classification head (#16758) · 77de8d6c
Ella Charlaix authored Apr 19, 2022
```
* Add export of models with a multiple-choice classification head
```
77de8d6c

Some tests misusing assertTrue for comparisons fix (#16771) · a2392415

code-review-doctor authored Apr 19, 2022

* Fix issue avoid-misusing-assert-true found at https://codereview.doctor



* fix tests

* fix tf
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a2392415

[Flax] improve large model init and loading (#16148) · d3bd9ac7

Suraj Patil authored Apr 19, 2022



* begin do_init

* add params_shape_tree

* raise error if params are accessed when do_init is False

* don't allow do_init=False when keys are missing

* make shape tree a property

* assign self._params at the end

* add test for do_init

* add do_init arg to all flax models

* fix param setting

* disbale do_init for composite models

* update test

* add do_init in FlaxBigBirdForMultipleChoice

* better names and errors

* improve test

* style

* add a warning when do_init=False

* remove extra if

* set params after _required_params

* add test for from_pretrained

* do_init => _do_init

* chage warning to info

* fix typo

* add params in init_weights

* add params to gpt neo init

* add params to init_weights

* update do_init test

* Trigger CI

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update template

* trigger CI

* style

* style

* fix template
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3bd9ac7

Clean up semantic segmentation tests (#16801) · 494c2a8c
NielsRogge authored Apr 19, 2022
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
494c2a8c

18 Apr, 2022 4 commits

Allow passing encoder_ouputs as tuple to EncoderDecoder Models (#16814) · 51e0ebed

jsnfly authored Apr 18, 2022



* Add passing encoder_outputs as tuple to existing test

* Add check for tuple

* Add check for tuple also for speech and vision
Co-authored-by: jsnfly <jsnfly@gmx.de>

51e0ebed

[Data2Vec] Add data2vec vision (#16760) · 8d3f952a

Patrick von Platen authored Apr 18, 2022



* save intermediate

* add vision

* add vision

* save

* finish models

* finish models

* continue

* finish

* up

* up

* up

* tests all pass

* clean up

* up

* up

* fix bugs in beit

* correct docs

* finish

* finish docs

* make style

* up

* more fixes

* fix type hint

* make style

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/data2vec/test_modeling_data2vec_vision.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fix test
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8d3f952a

[ViT, BEiT, DeiT, DPT] Improve code (#16799) · d3c9d0e5

NielsRogge authored Apr 18, 2022



* Improve code

* Fix bugs

* Fix another bug

* Clean up DTP as well

* Update DPT model outputs
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

d3c9d0e5

TF generate refactor - XLA sample (#16713) · b4ddd267
Joao Gante authored Apr 18, 2022

b4ddd267

15 Apr, 2022 2 commits

[modeling utils] revamp `from_pretrained(..., low_cpu_mem_usage=True)` + tests (#16657) · 5da33f87

Stas Bekman authored Apr 14, 2022

* add low_cpu_mem_usage tests

* wip: revamping

* wip

* install /usr/bin/time

* wip

* cleanup

* cleanup

* cleanup

* cleanup

* cleanup

* fix assert

* put the wrapper back

* cleanup; switch to bert-base-cased

* Trigger CI

* Trigger CI

5da33f87

[trainer / deepspeed] fix hyperparameter_search (#16740) · ce2fef2a

Stas Bekman authored Apr 14, 2022

* [trainer / deepspeed] fix hyperparameter_search

* require optuna

* style

* oops

* add dep in the right place

* create deepspeed-testing dep group

* Trigger CI

ce2fef2a

14 Apr, 2022 2 commits
- Fix issue avoid-missing-comma found at https://codereview.doctor (#16768) · 1b7de41a
  code-review-doctor authored Apr 14, 2022
  
  1b7de41a
- Enabling `Tapex` in table question answering pipeline. (#16663) · 195fbbb6
  Nicolas Patry authored Apr 14, 2022
```
* Enabling `Tapex` in table question answering pipeline.

* Questions are independant for Tapex, making the test respect that.

* Missing extra space.
```
  195fbbb6
13 Apr, 2022 1 commit

Reduce Funnel PT/TF diff (#16744) · 6bed0647

Yih-Dar authored Apr 13, 2022



* Make Funnel Test less flaky
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6bed0647