Commits · 6c8f4c9a938a09749ea1b19a5fa2a8dd27e99a29 · chenpangpang / transformers

28 Jun, 2022 11 commits

Adding GroupViT Models (#17313) · 6c8f4c9a

Jerry Jiarui XU authored Jun 28, 2022



* add group vit and fixed test (except slow)

* passing slow test

* addressed some comments

* fixed test

* fixed style

* fixed copy

* fixed segmentation output

* fixed test

* fixed relative path

* fixed copy

* add ignore non auto configured

* fixed docstring, add doc

* fixed copies

* Apply suggestions from code review

merge suggestions
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* resolve comment, renaming model

* delete unused attr

* use fix copies

* resolve comments

* fixed attn

* remove unused vars

* refactor tests

* resolve final comments

* add demo notebook

* fixed inconsitent default

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* rename stage->stages

* Create single GroupViTEncoderLayer class

* Update conversion script

* Simplify conversion script

* Remove cross-attention class in favor of GroupViTAttention

* Convert other model as well, add processor to conversion script

* addressing final comment

* fixed args

* Update src/transformers/models/groupvit/modeling_groupvit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

6c8f4c9a

Mrbean/codegen onnx (#17903) · b424f0b4
mrbean authored Jun 28, 2022

b424f0b4
Add ONNX support for DETR (#17904) · 76d13de5
regisss authored Jun 28, 2022

76d13de5
In `group_texts` function, drop last block if smaller than `block_size` (#17908) · bfcd5743
Bill Ray authored Jun 28, 2022

bfcd5743
Move logic into pixelshuffle layer (#17899) · f71895a6
amyeroberts authored Jun 28, 2022
```
* Move all pixelshuffle logic into layer

* Rename layer

* Use correct input to function
```
f71895a6
Fix loss computation in TFBertForPreTraining (#17898) · 0094565f
Matt authored Jun 28, 2022

0094565f
Pin black to 22.3.0 to benefit from a stable --preview flag (#17918) · 1dfa03f1
Lysandre Debut authored Jun 28, 2022

1dfa03f1
[M2M100] update conversion script (#17916) · 9eec4e93
Suraj Patil authored Jun 28, 2022

9eec4e93

Fix PyTorch/TF Auto tests (#17895) · db2644b9

Yih-Dar authored Jun 28, 2022



* add loading_info
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

db2644b9

Fix `test_number_of_steps_in_training_with_ipex` (#17889) · f717d47f
Yih-Dar authored Jun 28, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f717d47f
Update expected values in constrained beam search tests (#17887) · 0b0dd977
Yih-Dar authored Jun 28, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0b0dd977

27 Jun, 2022 9 commits

Fix bug in gpt2's (from-scratch) special scaled weight initialization (#17877) · e02037b3

Andrej authored Jun 27, 2022



* only special scale init each gpt2 c_proj weight once, on exact match

* fix double quotes
Co-authored-by: leandro <leandro.vonwerra@spoud.io>

e02037b3

Update README_zh-hans.md (#17861) · 6dd00f6b
JiJi authored Jun 28, 2022

6dd00f6b

bert: add conversion script for BERT Token Dropping TF2 checkpoints (#17142) · 71b2839f

Stefan Schweter authored Jun 27, 2022

* bert: add conversion script for BERT Token Dropping TF2 checkpoints

* bert: rename conversion script for BERT Token Dropping checkpoints

* bert: fix flake errors in BERT Token Dropping conversion script

* bert: make doc-builder happy!!1!11

* bert: fix pytorch_dump_path of BERT Token Dropping conversion script

71b2839f

Fix add new model like frameworks (#17869) · 98742829
Sylvain Gugger authored Jun 27, 2022
```
* Add new model like adds only the selected frameworks object in init

* Small fix
```
98742829
Add type annotations for RoFormer models (#17878) · afb71b67
Ian Castillo authored Jun 27, 2022

afb71b67

fix (#17890) · 9a345384

Yih-Dar authored Jun 27, 2022


Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9a345384

fix mask (#17837) · 3ec7d4cf
Younes Belkada authored Jun 27, 2022

3ec7d4cf

Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d

Matt authored Jun 27, 2022

* Add a TF in-graph tokenizer for BERT

* Add from_pretrained

* Add proper truncation, option handling to match other tokenizers

* Add proper imports and guards

* Add test, fix all the bugs exposed by said test

* Fix truncation of paired texts in graph mode, more test updates

* Small fixes, add a (very careful) test for savedmodel

* Add tensorflow-text dependency, make fixup

* Update documentation

* Update documentation

* make fixup

* Slight changes to tests

* Add some docstring examples

* Update tests

* Update tests and add proper lowercasing/normalization

* make fixup

* Add docstring for padding!

* Mark slow tests

* make fixup

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* make fixup

* Properly handle tensorflow-text dummies

ee0d001d

Fix TF GPT2 test_onnx_runtime_optimize (#17874) · 401fcca6
Yih-Dar authored Jun 27, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
401fcca6

25 Jun, 2022 1 commit
- CLI: handle multimodal inputs (#17839) · cc5c061e
  Joao Gante authored Jun 25, 2022
  
  cc5c061e
24 Jun, 2022 13 commits

Properly get tests deps in test_fetcher (#17870) · e8eb699e
Sylvain Gugger authored Jun 24, 2022
```
* Properly get tests deps in test_fetcher

* Remove print
```
e8eb699e
Fix `test_inference_instance_segmentation_head` (#17872) · b03be78a
Yih-Dar authored Jun 24, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b03be78a
Skip `test_multi_gpu_data_parallel_forward` for `MaskFormer` (#17864) · 494aac65
Yih-Dar authored Jun 24, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
494aac65

Use higher value for hidden_size in Flax BigBird test (#17822) · 0e0f1f46

Yih-Dar authored Jun 24, 2022



* Use higher value for hidden_size in Flax BigBird test

* remove 5e-5
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

0e0f1f46

Fix: torch.utils.checkpoint import error. (#17849) · 2ef94ee0
kumapo authored Jun 25, 2022

2ef94ee0

Add type hints for gptneox models (#17858) · ef28a402

willtai authored Jun 24, 2022

* feat: Add type hints for GPTNeoxForCausalLM and GPTNeoXModel

* fix: removed imported Dict type

* fix: Removed unused List import

ef28a402

[CodeGen] support device_map="auto" for sharded checkpoints (#17871) · 061a73d1
Suraj Patil authored Jun 24, 2022

061a73d1

Add CodeGen model (#17443) · d6b6fb99

rooa authored Jun 24, 2022



* Add CodeGen model

* Add missing key and switch order of super()

* Fix torch.ones init with uint8 instead of bool

* Address comments: copy statements and doc

* update tests

* remove old model parallel

* fix batch gen tests

* fix batch gen test

* update test_gpt2_sample_max_time

* fix codgen test and revert gpt2 test change

* Fix incorrect tie_word_embedding value, typo, URL

* Fix model order in README and styling

* Reorder model list alphabetically

* Set tie_word_embedding to False by default

* Apply suggestions from code review

* Better attn mask name & remove attn masked_bias

* add tokenizer for codegen

* quality

* doc tokenizer

* fix-copies

* add CodeGenTokenizer in converter

* make truncation optional

* add test for truncation

* add copyright

* fix-copies

* fix fast tokenizer decode

* Update src/transformers/models/codegen/tokenization_codegen.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* increase vocab_size in tests
Co-authored-by: patil-suraj <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d6b6fb99

Fix Splinter test (#17854) · 44749001
Yih-Dar authored Jun 24, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
44749001
[tests/VisionEncoderDecoder] import to_2tuple from test utils (#17865) · 73a0496c
Suraj Patil authored Jun 24, 2022

73a0496c

Fix Constrained beam search duplication and weird output issue (#17814) · bc7a6fdc

NaN authored Jun 24, 2022

* fix(ConstrainedBeamSearchScorer.step_sentence_constraint): avoid hypothesis duplication between topk and advance

* fix(GenerationMixin.constrained_beam_search): appropriately assign beam scores instead of token scores

bc7a6fdc

Improve encoder decoder model docs (#17815) · c2c0d9db

Vishwas authored Jun 24, 2022



* Copied all the changes from the last PR

* added in documentation_tests.txt

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: vishwaspai <vishwas.pai@emplay.net>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

c2c0d9db

Improve vision models (#17731) · 09178705

NielsRogge authored Jun 24, 2022



* Improve vision models

* Add a lot of improvements

* Remove to_2tuple from swin tests

* Fix TF Swin

* Fix more tests

* Fix copies

* Improve more models

* Fix ViTMAE test

* Add channel check for TF models

* Add proper channel check for TF models

* Apply suggestion from code review

* Apply suggestions from code review

* Add channel check for Flax models, apply suggestion

* Fix bug

* Add tests for greyscale images

* Add test for interpolation of pos encodigns
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

09178705

23 Jun, 2022 6 commits

Auto-build Docker images before on-merge if setup.py was changed (#17573) · 893ab124
Zachary Mueller authored Jun 23, 2022
```
* Auto-build on setup modification

* Modify push-caller

* Make adjustments based on code review
```
893ab124
Properly calculate the total train iterations and recalculate num epochs in... · 75259b44
Zachary Mueller authored Jun 23, 2022
```
Properly calculate the total train iterations and recalculate num epochs in no_trainer scripts (#17856)
```
75259b44
Index RNG states by global rank in saves (#17852) · 7c1b9128
Sylvain Gugger authored Jun 23, 2022

7c1b9128

Nezha Pytorch implementation (#17776) · 7cf52a49

Sijun He authored Jun 24, 2022



* wip

* rebase

* all tests pass

* rebase

* ready for PR

* address comments

* fix styles

* add require_torch to pipeline test

* remove remote image to improve CI consistency

* address comments; fix tf/flax tests

* address comments; fix tf/flax tests

* fix tests; add alias

* repo consistency tests

* Update src/transformers/pipelines/visual_question_answering.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* address comments

* Update src/transformers/pipelines/visual_question_answering.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* merge

* wip

* wip

* wip

* most basic tests passes

* all tests pass now

* relative embedding

* wip

* running make fixup

* remove bert changes

* fix doc

* fix doc

* fix issues

* fix doc

* address comments

* fix CI

* remove redundant copied from

* address comments

* fix broken test
Co-authored-by: Sijun He <sijunhe@Sijuns-MacBook-Pro.local>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

7cf52a49

Change no trainer image_classification test (#17635) · acb709d5
Zachary Mueller authored Jun 23, 2022
```
* Adjust test arguments and use a new example test
```
acb709d5

Update modeling_cvt.py (#17846) · e70abdad

Fx039482 authored Jun 23, 2022

As shown in the colab notebook I added the missing type hints for " CvtForImageClassification
CvtModel
"

e70abdad