Commits · b74a955325ef78c6d07b62c4f9be13ef0df170da · chenpangpang / transformers

"tests/vscode:/vscode.git/clone" did not exist on "b8db265bc6d0c9208ee465a12c6497149b4ee725"

19 Apr, 2022 3 commits

Some tests misusing assertTrue for comparisons fix (#16771) · a2392415

code-review-doctor authored Apr 19, 2022

* Fix issue avoid-misusing-assert-true found at https://codereview.doctor



* fix tests

* fix tf
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a2392415

[Flax] improve large model init and loading (#16148) · d3bd9ac7

Suraj Patil authored Apr 19, 2022



* begin do_init

* add params_shape_tree

* raise error if params are accessed when do_init is False

* don't allow do_init=False when keys are missing

* make shape tree a property

* assign self._params at the end

* add test for do_init

* add do_init arg to all flax models

* fix param setting

* disbale do_init for composite models

* update test

* add do_init in FlaxBigBirdForMultipleChoice

* better names and errors

* improve test

* style

* add a warning when do_init=False

* remove extra if

* set params after _required_params

* add test for from_pretrained

* do_init => _do_init

* chage warning to info

* fix typo

* add params in init_weights

* add params to gpt neo init

* add params to init_weights

* update do_init test

* Trigger CI

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update template

* trigger CI

* style

* style

* fix template
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d3bd9ac7

Clean up semantic segmentation tests (#16801) · 494c2a8c
NielsRogge authored Apr 19, 2022
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
494c2a8c

18 Apr, 2022 4 commits

Allow passing encoder_ouputs as tuple to EncoderDecoder Models (#16814) · 51e0ebed

jsnfly authored Apr 18, 2022



* Add passing encoder_outputs as tuple to existing test

* Add check for tuple

* Add check for tuple also for speech and vision
Co-authored-by: jsnfly <jsnfly@gmx.de>

51e0ebed

[Data2Vec] Add data2vec vision (#16760) · 8d3f952a

Patrick von Platen authored Apr 18, 2022



* save intermediate

* add vision

* add vision

* save

* finish models

* finish models

* continue

* finish

* up

* up

* up

* tests all pass

* clean up

* up

* up

* fix bugs in beit

* correct docs

* finish

* finish docs

* make style

* up

* more fixes

* fix type hint

* make style

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/data2vec/test_modeling_data2vec_vision.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fix test
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8d3f952a

[ViT, BEiT, DeiT, DPT] Improve code (#16799) · d3c9d0e5

NielsRogge authored Apr 18, 2022



* Improve code

* Fix bugs

* Fix another bug

* Clean up DTP as well

* Update DPT model outputs
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

d3c9d0e5

TF generate refactor - XLA sample (#16713) · b4ddd267
Joao Gante authored Apr 18, 2022

b4ddd267

15 Apr, 2022 2 commits

[modeling utils] revamp `from_pretrained(..., low_cpu_mem_usage=True)` + tests (#16657) · 5da33f87

Stas Bekman authored Apr 14, 2022

* add low_cpu_mem_usage tests

* wip: revamping

* wip

* install /usr/bin/time

* wip

* cleanup

* cleanup

* cleanup

* cleanup

* cleanup

* fix assert

* put the wrapper back

* cleanup; switch to bert-base-cased

* Trigger CI

* Trigger CI

5da33f87

[trainer / deepspeed] fix hyperparameter_search (#16740) · ce2fef2a

Stas Bekman authored Apr 14, 2022

* [trainer / deepspeed] fix hyperparameter_search

* require optuna

* style

* oops

* add dep in the right place

* create deepspeed-testing dep group

* Trigger CI

ce2fef2a

14 Apr, 2022 2 commits
- Fix issue avoid-missing-comma found at https://codereview.doctor (#16768) · 1b7de41a
  code-review-doctor authored Apr 14, 2022
  
  1b7de41a
- Enabling `Tapex` in table question answering pipeline. (#16663) · 195fbbb6
  Nicolas Patry authored Apr 14, 2022
```
* Enabling `Tapex` in table question answering pipeline.

* Questions are independant for Tapex, making the test respect that.

* Missing extra space.
```
  195fbbb6
13 Apr, 2022 3 commits

Reduce Funnel PT/TF diff (#16744) · 6bed0647

Yih-Dar authored Apr 13, 2022



* Make Funnel Test less flaky
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6bed0647

Fix #16660 (tokenizers setters of ids of special tokens) (#16661) · 9f8bfe70

davidleonfdez authored Apr 13, 2022

* Fix setters of *_token_id properties of SpecialTokensMixin

* Test setters of common tokens ids

* Move to a separate test checks of setters of tokens ids

* Add independent test for ByT5

* Add Canine test

* Test speech to text

9f8bfe70

Fix decoding score comparison when using logits processors or warpers (#10638) · f7196f2e
Santiago Castro authored Apr 13, 2022
```
* Normalize using a logits warper

* Add a flag in `generate` to support the logit renormalization

* Add in RAG
```
f7196f2e

12 Apr, 2022 4 commits
- add Bigbird ONNX config (#16427) · 9c9db751
  Minh Chien Vu authored Apr 13, 2022
```
* add Bigbird ONNX config
```
  9c9db751
- [FlaxWav2Vec2Model] Fix bug in attention mask (#16725) · a9604067
  Sanchit Gandhi authored Apr 12, 2022
```
* [FlaxWav2Vec2Model] Fix bug in attention mask

* more fixes

* add (Flax)SpeechEncoderDecoderModel PT-FX cross-test
```
  a9604067
- TF: remove set_tensor_by_indices_to_value (#16729) · d7f7f29f
  Joao Gante authored Apr 12, 2022
  
  d7f7f29f
- Change the chunk_iter function to handle (#16730) · a192f61e
  Nicolas Patry authored Apr 12, 2022
```
* Change the chunk_iter function to handle

the subtle cases where the last chunk gets ignored since all the
data is in the `left_strided` data.

We need to remove the right striding on the previous item.

* Remove commented line.
```
  a192f61e
11 Apr, 2022 6 commits

Improve PT/TF equivalence test (#16557) · dce33f21

Yih-Dar authored Apr 11, 2022



* add error message

* Use names in the error message

* allow ModelOutput

* rename to check_pt_tf_outputs and move outside

* fix style

* skip past_key_values in a better way

* Add comments

* improve code for label/loss

* make the logic clear by moving the ignore keys out

* fix _postprocessing_to_ignore

* fix _postprocessing_to_ignore: create new outputs from the remaining fields

* ignore past_key_values in TFGPT2 models for now

* make check_pt_tf_outputs better regarding names

* move check_pt_tf_models outside

* rename methods

* remove test_pt_tf_model_equivalence in TFCLIPModelTest

* Reduce TFViTMAEModelTest.test_pt_tf_model_equivalence

* move prepare_pt_inputs_from_tf_inputs outside check_pt_tf_models

* Fix quality

* Clean-up TFLxmertModelTester.test_pt_tf_model_equivalence

* Fix quality

* fix

* fix style

* Clean-up TFLEDModelTest.test_pt_tf_model_equivalence

* Fix quality

* add docstring

* improve comment
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

dce33f21

Enable more test_torchscript (#16679) · c04619ec

Yih-Dar authored Apr 11, 2022



* update _create_and_check_torchscript

* Enable test_torchscript

* clear_class_registry
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

c04619ec

Reduce memory leak in _create_and_check_torchscript (#16691) · 3918d6a9
Yih-Dar authored Apr 11, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
3918d6a9
Rename the method test_torchscript (#16693) · 2109afae
Yih-Dar authored Apr 11, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
2109afae
Generate: min length can't be larger than max length (#16668) · b0bf3011
Joao Gante authored Apr 11, 2022
```
* min length must be smaller than max length

* Update min_length in tests
```
b0bf3011

add a warning in `SpmConverter` for sentencepiece's model using the byte fallback feature (#16629) · 1025a9b7

SaulLu authored Apr 11, 2022

* update proto sentencepiece model

* Revert "update proto sentencepiece model"

This reverts commit b07f671747fec35773d0b3d4788b8b15aefa0229.

* add check

* add test

* Revert "Revert "update proto sentencepiece model""

This reverts commit 46108257b8927b73627ec8f4f3eed53a95fc700d.

* test for log level

* test for log level 2

* warning at the warning level

* clean

* format

* add explanation in docstring

1025a9b7

08 Apr, 2022 1 commit

Add TAPEX (#16473) · 4ef0abb7

NielsRogge authored Apr 08, 2022

* Add TapexTokenizer

* Improve docstrings and provide option to provide answer

* Remove option for pretokenized inputs

* Add TAPEX to README

* Fix copies

* Remove option for pretokenized inputs

* Initial commit: add tapex fine-tuning examples on both table-based question answering and table-based fact verification.

* - Draft a README file for running the script and introducing some background.
- Remove unused code lines in tabfact script.
- Disable the deafult `pad_to_max_length` option which is memory-consuming.

* * Support `as_target_tokenizer` function for TapexTokenizer.
* Fix the do_lower_case behaviour of TapexTokenizer.
* Add unit tests for target scenarios and cased/uncased scenarios for both source and target.

* * Replace the label BartTokenizer with TapexTokenizer's as_target_tokenizer function.
* Fix typos in tapex example README.

* * fix the evaluation script - remove the property `task_name`

* * Make the label space more clear for tabfact tasks

* * Using a new fine-tuning script for tapex-base on tabfact.

* * Remove the lowercase code outside the tokenizer - we use the tokenizer to control whether do_lower_case
* Guarantee the hyper-parameter can be run without out-of-memory on 16GB card and report the new reproduced number on wikisql

* * Remove the default tokenizer_name option.
* Provide evaluation command.

* * Support for WikiTableQuestion dataset.

* Fix a typo in README.

* * Fix the datasets's key name in WikiTableQuestions

* Run make fixup and move test to folder

* Fix quality

* Apply suggestions from code review

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply suggestions from code review

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply some more suggestions from code review

* Improve docstrings

* Overwrite failing test

* Improve comment in example scripts

* Fix rebase

* Add TAPEX to Auto mapping

* Add TAPEX to auto config mappings

* Put TAPEX higher than BART in auto mapping

* Add TAPEX to doc tests
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
Co-authored-by: SivilTaram <qianlxc@outlook.com>
Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

4ef0abb7

07 Apr, 2022 2 commits

RegNet (#16188) · af14c619

Francesco Saverio Zuppichini authored Apr 07, 2022



* base model done

* make style

* done

* added files

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Trigger doc build

* resolved conversations

* resolved conversations

* seer models

* minor changes

* minor changes

* make fixup

* glob variables

* minor changes

* fix copies

* config when possibile

* resolved conflicts

* resolved conflicts

* resolved conflicts

* CI

* conversion script for 10b param

* fixed for 10b model

* minor updates in the doc + make style

* removed unused code

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* removed unused code

* removed unused code

* updated modeling_utils from main
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

af14c619

Remove parent/child tests in auto model tests (#16653) · 389f6615
Sylvain Gugger authored Apr 07, 2022

389f6615

06 Apr, 2022 2 commits
- TF generate refactor - Beam Search (#16374) · 3f43d824
  Joao Gante authored Apr 06, 2022
```
* refactor TF beam search

* refactored generate can now properly use attention masks

* add force bos/eos logit processors
```
  3f43d824
- [FlaxSpeechEncoderDecoderModel] More Rigorous PT-Flax Equivalence Tests (#16589) · 8d57c424
  Sanchit Gandhi authored Apr 06, 2022
  
  8d57c424
05 Apr, 2022 2 commits

Adding new train_step logic to make things less confusing for users (#15994) · 43540052

Matt authored Apr 05, 2022



* Adding new train_step logic to make things less confusing for users

* DO NOT ASK WHY WE NEED THAT SUBCLASS

* Metrics now working, at least for single-output models with type annotations!

* Updates and TODOs for the new train_step

* Make fixup

* Temporary test workaround until T5 has types

* Temporary test workaround until T5 has types

* I think this actually works! Needs a lot of tests though

* MAke style/quality

* Revert changes to T5 tests

* Deleting the aforementioned unmentionable subclass

* Deleting the aforementioned unmentionable subclass

* Adding a Keras API test

* Style fixes

* Removing unneeded TODO and comments

* Update test_step too

* Stop trying to compute metrics with the dummy_loss, patch up test

* Make style

* make fixup

* Docstring cleanup

* make fixup

* make fixup

* Stop expanding 1D input tensors when using dummy loss

* Adjust T5 test given the new compile()

* make fixup

* Skipping test for convnext

* Removing old T5-specific Keras test now that we have a common one

* make fixup

* make fixup

* Only skip convnext test on CPU

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Avoiding TF import issues

* make fixup

* Update compile() to support TF 2.3

* Skipping model.fit() on template classes for now

* Skipping model.fit() on template class tests for now

* Replace ad-hoc solution with find_labels

* make fixup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

43540052

Fix CI: test_inference_for_pretraining in ViTMAEModelTest (#16591) · 765bafb8
Yih-Dar authored Apr 05, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
765bafb8

04 Apr, 2022 3 commits

TF: Finalize `unpack_inputs`-related changes (#16499) · dad5ca83
Joao Gante authored Apr 04, 2022
```
* Add unpack_inputs to remaining models

* removed kwargs to `call()` in TF models

* fix TF T5 tests
```
dad5ca83
add a test checking the format of `convert_tokens_to_string`'s output (#16540) · be9474bd
SaulLu authored Apr 04, 2022
```
* add new tests

* add comment to overridden tests
```
be9474bd

Add utility to find model labels (#16526) · 3951b9f3

Sylvain Gugger authored Apr 04, 2022



* Add utility to find model labels

* Use it in the Trainer

* Update src/transformers/utils/generic.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Quality
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

3951b9f3

01 Apr, 2022 2 commits

Use random_attention_mask for TF tests (#16517) · 2199382d

Yih-Dar authored Apr 01, 2022



* use random_attention_mask for TF tests

* Fix for TFCLIP test (for now).
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2199382d

Add ONNX export for BeiT (#16498) · 9de70f21

Jim Rohrer authored Apr 01, 2022

* Add beit onnx conversion support

* Updated docs

* Added cross reference to ViT ONNX config

9de70f21

30 Mar, 2022 1 commit

Feature Extractor accepts `segmentation_maps` (#15964) · c4deb7b3

Francesco Saverio Zuppichini authored Mar 30, 2022



* feature extractor accepts

* resolved conversations

* added examples in test for ADE20K

* num_classes -> num_labels

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* resolving conversations

* resolving conversations

* removed ADE

* CI

* minor changes in conversion script

* reduce_labels in feature extractor

* minor changes

* correct preprocess for instace segmentation maps

* minor changes

* minor changes

* CI

* debugging

* better padding

* going to update labels inside the model

* going to update labels inside the model

* minor changes

* tests

* removed changes in feature_extractor_utils

* conversation

* conversation

* example in feature extractor

* more docstring in modeling

* test

* make style

* doc
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c4deb7b3

29 Mar, 2022 3 commits

Raise diff tolerance value for TFViTMAEModelTest (#16483) · 2b483230
Yih-Dar authored Mar 29, 2022
```
* Raise diff tolerance value
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
2b483230

Avoid accessing .dataset of a DataLoader in Trainer (#16451) · d7c8ce57

Sander Land authored Mar 29, 2022



* Avoid accessing .dataset of a dataloader

* style

* fix

* cleaning up, reverting some misunderstandings

* black

* add train_dataset argument to get_train_dataloader, and fix other instances of length checks

* flake8

* address comments

* fix bug

* cleanup

* add test

* Update tests/trainer/test_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* under torch

* merge

* stylistic suggestion
Co-authored-by: Sander Land <sander@chatdesk.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d7c8ce57

Add TF ViT MAE (#16255) · 5b40a37b

Sayak Paul authored Mar 29, 2022



* ported TFViTMAEIntermediate and TFViTMAEOutput.

* added TFViTMAEModel and TFViTMAEDecoder.

* feat: added a noise argument in the implementation for reproducibility.

* feat: vit mae models with an additional noise argument for reproducibility.
Co-authored-by: ariG23498 <aritra.born2fly@gmail.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5b40a37b