Commits · c4fa908fa98c3d538462c537d29b7613dd71306e · chenpangpang / transformers

11 Jan, 2022 6 commits
- Adds IBERT to models exportable with ONNX (#14868) · c4fa908f
  Virus authored Jan 11, 2022
```
* Add IBertOnnxConfig and tests

* add all the supported features for IBERT and remove outputs in IbertOnnxConfig

* use OnnxConfig

* fix codestyle

* remove serialization.rst

* codestyle
```
  c4fa908f
- [Wav2Vec2ProcessorWithLM] improve decoder downlaod (#15040) · efb35a41
  Patrick von Platen authored Jan 11, 2022
  
  efb35a41
- Fix cookiecutter (#15100) · 6ea62666
  NielsRogge authored Jan 11, 2022
  
  6ea62666
- fix doc example - TypeError: forward() got an unexpected keyword argument 'input_ids' (#15092) · 68810aa2
  Yih-Dar authored Jan 11, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  68810aa2
- Take gradient accumulation into account when defining samplers (#15095) · ca76618d
  Sylvain Gugger authored Jan 11, 2022
```
* Take gradient accumulation into account when defining samplers

* style
```
  ca76618d
- Add test to check reported training loss (#15096) · 9dc8fb2f
  Sylvain Gugger authored Jan 11, 2022
```
* Add test

* Add tests for the reported train loss
```
  9dc8fb2f
10 Jan, 2022 15 commits

Add TFVisionEncoderDecoderModel (#14148) · b67fd797

Yih-Dar authored Jan 10, 2022



* Start the work on TFVisionEncoderDecoderModel

* Expose TFVisionEncoderDecoderModel

* fix import

* Add modeling_tf_vision_encoder_decoder to _ignore_modules in get_model_modules()

* reorder

* Apply the fix for checkpoint loading as in #14016

* remove attention_mask + fix VISION_DUMMY_INPUTS

* A minimal change to make TF generate() work for vision models as encoder in encoder-decoder setting

* fix wrong condition: shape_list(input_ids) == 2

* add tests

* use personal TFViTModel checkpoint (for now)

* Add equivalence tests + projection layer

* style

* make sure projection layer can run

* Add examples

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Clean comments (need to work on TODOs for PyTorch models)

* Remove TF -> PT in check_pt_tf_equivalence for TFVisionEncoderDecoderModel

* fixes

* Revert changes in PT code.

* Update tests/test_modeling_tf_vision_encoder_decoder.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Add test_inference_coco_en for TF test

* fix quality

* fix name

* build doc

* add main_input_name

* Fix ckpt name in test

* fix diff between master and this PR

* fix doc

* fix style and quality

* fix missing doc

* fix labels handling

* Delete auto.rst

* Add the changes done in #14016

* fix prefix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* make style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

b67fd797

[performance doc] Power and Cooling (#14935) · 37bc0b4e

Stas Bekman authored Jan 10, 2022



* [performance doc] Power and Cooling

* more docs

* Update docs/source/performance.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* reword
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

37bc0b4e

[DOC] fix doc examples for bart-like models (#15093) · 3e9fdcf0
Suraj Patil authored Jan 10, 2022
```
* fix doc examples

* remove double colons
```
3e9fdcf0
Happy New Year! (#15094) · 61d18ae0
Sylvain Gugger authored Jan 10, 2022

61d18ae0
[doc] normalize HF Transformers string (#15023) · 31838d3e
Stas Bekman authored Jan 10, 2022

31838d3e
Use tqdm.auto in Pipeline docs (#14920) · f21bc421
Santiago Castro authored Jan 10, 2022
```
It's better for e.g. notebook.
```
f21bc421
Model summary horizontal banners (#15058) · f012c00a
Mishig Davaadorj authored Jan 10, 2022

f012c00a
Fix style · af9cb949
Sylvain Gugger authored Jan 10, 2022

af9cb949

fix doc example - AttributeError: type object 'RagModel' has no attribute... · 533624c5

Yih-Dar authored Jan 10, 2022


fix doc example - AttributeError: type object 'RagModel' has no attribute 'from_question_encoder_generator_pretrained' (#15076)
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

533624c5

support the trocr small models (#14893) · b2c477fc

Minghao Li authored Jan 10, 2022



* support the trocr small models

* resolve conflict

* Update docs/source/model_doc/trocr.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/model_doc/trocr.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/model_doc/trocr.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fix unexpected indent in processing_trocr.py

* Update src/transformers/models/trocr/processing_trocr.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* update the docstring of processing_trocr

* remove extra space
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

b2c477fc

Change assignee for tokenizers (#15088) · 42d57549
Lysandre Debut authored Jan 10, 2022

42d57549

Make OpenAIGPTTokenizer work with SpaCy 2.x and 3.x (#15019) · a54961c5

cody-moveworks authored Jan 10, 2022

* Make OpenAIGPTTokenizer work with SpaCy 3.x

SpaCy 3.x introduced an API change to creating the tokenizer that
breaks OpenAIGPTTokenizer. The old API for creating the tokenizer in
SpaCy 2.x no longer works under SpaCy 3.x, but the new API for creating
the tokenizer in SpaCy 3.x DOES work under SpaCy 2.x. Switching to the
new API should allow OpenAIGPTTokenizer to work under both SpaCy 2.x and
SpaCy 3.x versions.

* Add is_spacy_available and is_ftfy_available methods to file utils

* Add spacy and ftfy unittest decorator to testing utils

* Add tests for OpenAIGPTTokenizer that require spacy and ftfy

* Modify CircleCI config to run tests that require spacy and ftfy

* Remove unneeded unittest decorators are reuse test code

* Run make fixup

a54961c5

Update check_repo.py (#15014) · 9fbf7c87
Kamal Raj authored Jan 10, 2022
```
added new line
```
9fbf7c87
fix model table cell text alignment (#14999) · 0a03a868
Yih-Dar authored Jan 10, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0a03a868

[Wav2Vec2 Speech Event] Add speech event v2 (#15083) · d72343d2

Patrick von Platen authored Jan 10, 2022

* up

* up

* up

* up

* up

* up

* improve

* up

* up

* Update src/transformers/trainer.py

* up

* up

* up

d72343d2

08 Jan, 2022 1 commit

Fix convert for newer megatron-lm bert model (#14082) · 768e6c14

yoquankara authored Jan 09, 2022

* Fix convert for newer megatron-lm models

* Save megatron-bert config in a proper way

* Fix code style

768e6c14

07 Jan, 2022 3 commits

[VisionTextDualEncoder] Add token_type_ids param (#15073) · 623b4f7c

Yih-Dar authored Jan 07, 2022



* fix doc example - TypeError: get_text_features() got an unexpected keyword argument 'token_type_ids'

* add token_type_ids param
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

623b4f7c

[Fix doc examples] Add missing from_pretrained (#15044) · ac224bb0

Yih-Dar authored Jan 07, 2022



* fix doc example - ValueError: Parameter config should be an instance of class `PretrainedConfig`

* Update src/transformers/models/segformer/modeling_segformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

ac224bb0

Resubmit changes after rebase to master (#14982) · f18c6fa9
K.C. Tung authored Jan 07, 2022

f18c6fa9

06 Jan, 2022 8 commits

[VisionTextDualEncoder] Fix doc example · cc406da4
Yih-Dar authored Jan 06, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
cc406da4
Update run_speech_recognition_seq2seq.py (#14967) · b67f345d
flozi00 authored Jan 06, 2022

b67f345d
Add 'with torch.no_grad()' to BertGeneration integration test forward passes (#14963) · f71fb5c3
Tavin Turner authored Jan 06, 2022

f71fb5c3
Remove old asserts. (#15012) · d2183a46
Nicolas Patry authored Jan 06, 2022

d2183a46
Add detectron2 to Github actions (#15053) · 83c552d3
NielsRogge authored Jan 06, 2022

83c552d3
wrapped forward passes in torch.no_grad() (#15037) · 5ab87cd4
Matt Churgin authored Jan 06, 2022

5ab87cd4
Enabling `TF` on `image-classification` pipeline. (#15030) · 5a06118b
Nicolas Patry authored Jan 06, 2022

5a06118b

Add Flax image captioning example (#14864) · 9f89fa02

Yih-Dar authored Jan 06, 2022



* add image captioning example

* update README

* fix style & quality

* simplify

* apply review suggestions

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply review suggestions

* add comments about using np instead jax array

* remove unused lines

* add model creation script

* only support from_pretrained

* fix style

* fix

* not use cache_dir when creating model

* fix tokenizer creation

* update README

* fix quality

* apply suggestion

* simplify some blocks

* Update examples/flax/image-captioning/README.md


* Update examples/flax/image-captioning/run_image_captioning_flax.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* apply suggestion
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

9f89fa02

05 Jan, 2022 6 commits
- [CLIP] Fix TF test (#15042) · 2e9af294
  Suraj Patil authored Jan 05, 2022
  
  2e9af294
- [SpeechEncoderDecoder] Fix from pretrained (#15043) · 443fdaf2
  Patrick von Platen authored Jan 05, 2022
  
  443fdaf2
- [CLIP] Fix PT test (#15041) · ae929dcb
  Patrick von Platen authored Jan 05, 2022
  
  ae929dcb
- Adding QoL for `batch_size` arg (like others enabled everywhere). (#15027) · 65cb94ff
  Nicolas Patry authored Jan 05, 2022
```
* Adding QoL for `batch_size` arg (like others enabled everywhere).

* Typo.
```
  65cb94ff
- Fix doc example: mask_time_indices (numpy) has no attribute 'to' (#15033) · e34dd055
  Yih-Dar authored Jan 05, 2022
```
* fix doc example - AttributeError: 'numpy.ndarray' object has no attribute 'to'

* fix more

* Apply suggestions from code review

* Update src/transformers/models/unispeech/modeling_unispeech.py
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  e34dd055
- [megatron convert] PYTHONPATH requirements (#14956) · 927f6544
  Stas Bekman authored Jan 05, 2022
```
* [megatron convert] PYTHONPATH requirements

* more info
```
  927f6544
04 Jan, 2022 1 commit
- [doc] Update parallelism.mdx (#15018) · 857ab55c
  Kevin Ko authored Jan 05, 2022
```
* Update parallelism.mdx

* Update parallelism.mdx
```
  857ab55c