Commits · b8142753f9747f10907d5987e29acec5a6a0ba15 · chenpangpang / transformers

29 Jun, 2022 5 commits
- Add missing comment quotes (#17379) · b8142753
  Leon Derczynski authored Jun 29, 2022
  
  b8142753
- Remove render tags (#17897) · e113c5cb
  NielsRogge authored Jun 29, 2022
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
  e113c5cb
- Fix the Conda package build (#16737) · 90415475
  Santiago Castro authored Jun 29, 2022
```
* Fix the Conda package build

* Update build.sh

* Update release-conda.yml
```
  90415475
- Remove DT_DOUBLE from the T5 graph (#17891) · babd7b1a
  Michal Szutenberg authored Jun 29, 2022
  
  babd7b1a
- Compute min_resolution in prepare_image_inputs (#17915) · 6aae59d0
  Yih-Dar authored Jun 29, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  6aae59d0
28 Jun, 2022 14 commits

Fixing a regression with `return_all_scores` introduced in #17606 (#17906) · 776855c7

Nicolas Patry authored Jun 28, 2022

Fixing a regression with `return_all_scores` introduced in #17606

- The legacy test actually tested `return_all_scores=False` (the actual
  default) instead of `return_all_scores=True` (the actual weird case).

This commit adds the correct legacy test and fixes it.

Tmp legacy tests.

Actually fix the regression (also contains lists)

Less diffed code.

776855c7

Pin PyTorch in requirements as well · 5f1e67a5
Sylvain Gugger authored Jun 28, 2022

5f1e67a5
Pin PyTorch while we fix compatibility with 1.12 · 5a3d0cbd
Sylvain Gugger authored Jun 28, 2022

5a3d0cbd

Adding GroupViT Models (#17313) · 6c8f4c9a

Jerry Jiarui XU authored Jun 28, 2022



* add group vit and fixed test (except slow)

* passing slow test

* addressed some comments

* fixed test

* fixed style

* fixed copy

* fixed segmentation output

* fixed test

* fixed relative path

* fixed copy

* add ignore non auto configured

* fixed docstring, add doc

* fixed copies

* Apply suggestions from code review

merge suggestions
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* resolve comment, renaming model

* delete unused attr

* use fix copies

* resolve comments

* fixed attn

* remove unused vars

* refactor tests

* resolve final comments

* add demo notebook

* fixed inconsitent default

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* rename stage->stages

* Create single GroupViTEncoderLayer class

* Update conversion script

* Simplify conversion script

* Remove cross-attention class in favor of GroupViTAttention

* Convert other model as well, add processor to conversion script

* addressing final comment

* fixed args

* Update src/transformers/models/groupvit/modeling_groupvit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

6c8f4c9a

Mrbean/codegen onnx (#17903) · b424f0b4
mrbean authored Jun 28, 2022

b424f0b4
Add ONNX support for DETR (#17904) · 76d13de5
regisss authored Jun 28, 2022

76d13de5
In `group_texts` function, drop last block if smaller than `block_size` (#17908) · bfcd5743
Bill Ray authored Jun 28, 2022

bfcd5743
Move logic into pixelshuffle layer (#17899) · f71895a6
amyeroberts authored Jun 28, 2022
```
* Move all pixelshuffle logic into layer

* Rename layer

* Use correct input to function
```
f71895a6
Fix loss computation in TFBertForPreTraining (#17898) · 0094565f
Matt authored Jun 28, 2022

0094565f
Pin black to 22.3.0 to benefit from a stable --preview flag (#17918) · 1dfa03f1
Lysandre Debut authored Jun 28, 2022

1dfa03f1
[M2M100] update conversion script (#17916) · 9eec4e93
Suraj Patil authored Jun 28, 2022

9eec4e93

Fix PyTorch/TF Auto tests (#17895) · db2644b9

Yih-Dar authored Jun 28, 2022



* add loading_info
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

db2644b9

Fix `test_number_of_steps_in_training_with_ipex` (#17889) · f717d47f
Yih-Dar authored Jun 28, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f717d47f
Update expected values in constrained beam search tests (#17887) · 0b0dd977
Yih-Dar authored Jun 28, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0b0dd977

27 Jun, 2022 9 commits

Fix bug in gpt2's (from-scratch) special scaled weight initialization (#17877) · e02037b3

Andrej authored Jun 27, 2022



* only special scale init each gpt2 c_proj weight once, on exact match

* fix double quotes
Co-authored-by: leandro <leandro.vonwerra@spoud.io>

e02037b3

Update README_zh-hans.md (#17861) · 6dd00f6b
JiJi authored Jun 28, 2022

6dd00f6b

bert: add conversion script for BERT Token Dropping TF2 checkpoints (#17142) · 71b2839f

Stefan Schweter authored Jun 27, 2022

* bert: add conversion script for BERT Token Dropping TF2 checkpoints

* bert: rename conversion script for BERT Token Dropping checkpoints

* bert: fix flake errors in BERT Token Dropping conversion script

* bert: make doc-builder happy!!1!11

* bert: fix pytorch_dump_path of BERT Token Dropping conversion script

71b2839f

Fix add new model like frameworks (#17869) · 98742829
Sylvain Gugger authored Jun 27, 2022
```
* Add new model like adds only the selected frameworks object in init

* Small fix
```
98742829
Add type annotations for RoFormer models (#17878) · afb71b67
Ian Castillo authored Jun 27, 2022

afb71b67

fix (#17890) · 9a345384

Yih-Dar authored Jun 27, 2022


Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9a345384

fix mask (#17837) · 3ec7d4cf
Younes Belkada authored Jun 27, 2022

3ec7d4cf

Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d

Matt authored Jun 27, 2022

* Add a TF in-graph tokenizer for BERT

* Add from_pretrained

* Add proper truncation, option handling to match other tokenizers

* Add proper imports and guards

* Add test, fix all the bugs exposed by said test

* Fix truncation of paired texts in graph mode, more test updates

* Small fixes, add a (very careful) test for savedmodel

* Add tensorflow-text dependency, make fixup

* Update documentation

* Update documentation

* make fixup

* Slight changes to tests

* Add some docstring examples

* Update tests

* Update tests and add proper lowercasing/normalization

* make fixup

* Add docstring for padding!

* Mark slow tests

* make fixup

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* make fixup

* Properly handle tensorflow-text dummies

ee0d001d

Fix TF GPT2 test_onnx_runtime_optimize (#17874) · 401fcca6
Yih-Dar authored Jun 27, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
401fcca6

25 Jun, 2022 1 commit
- CLI: handle multimodal inputs (#17839) · cc5c061e
  Joao Gante authored Jun 25, 2022
  
  cc5c061e
24 Jun, 2022 11 commits

Properly get tests deps in test_fetcher (#17870) · e8eb699e
Sylvain Gugger authored Jun 24, 2022
```
* Properly get tests deps in test_fetcher

* Remove print
```
e8eb699e
Fix `test_inference_instance_segmentation_head` (#17872) · b03be78a
Yih-Dar authored Jun 24, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b03be78a
Skip `test_multi_gpu_data_parallel_forward` for `MaskFormer` (#17864) · 494aac65
Yih-Dar authored Jun 24, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
494aac65

Use higher value for hidden_size in Flax BigBird test (#17822) · 0e0f1f46

Yih-Dar authored Jun 24, 2022



* Use higher value for hidden_size in Flax BigBird test

* remove 5e-5
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

0e0f1f46

Fix: torch.utils.checkpoint import error. (#17849) · 2ef94ee0
kumapo authored Jun 25, 2022

2ef94ee0

Add type hints for gptneox models (#17858) · ef28a402

willtai authored Jun 24, 2022

* feat: Add type hints for GPTNeoxForCausalLM and GPTNeoXModel

* fix: removed imported Dict type

* fix: Removed unused List import

ef28a402

[CodeGen] support device_map="auto" for sharded checkpoints (#17871) · 061a73d1
Suraj Patil authored Jun 24, 2022

061a73d1

Add CodeGen model (#17443) · d6b6fb99

rooa authored Jun 24, 2022



* Add CodeGen model

* Add missing key and switch order of super()

* Fix torch.ones init with uint8 instead of bool

* Address comments: copy statements and doc

* update tests

* remove old model parallel

* fix batch gen tests

* fix batch gen test

* update test_gpt2_sample_max_time

* fix codgen test and revert gpt2 test change

* Fix incorrect tie_word_embedding value, typo, URL

* Fix model order in README and styling

* Reorder model list alphabetically

* Set tie_word_embedding to False by default

* Apply suggestions from code review

* Better attn mask name & remove attn masked_bias

* add tokenizer for codegen

* quality

* doc tokenizer

* fix-copies

* add CodeGenTokenizer in converter

* make truncation optional

* add test for truncation

* add copyright

* fix-copies

* fix fast tokenizer decode

* Update src/transformers/models/codegen/tokenization_codegen.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* increase vocab_size in tests
Co-authored-by: patil-suraj <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d6b6fb99

Fix Splinter test (#17854) · 44749001
Yih-Dar authored Jun 24, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
44749001
[tests/VisionEncoderDecoder] import to_2tuple from test utils (#17865) · 73a0496c
Suraj Patil authored Jun 24, 2022

73a0496c

Fix Constrained beam search duplication and weird output issue (#17814) · bc7a6fdc

NaN authored Jun 24, 2022

* fix(ConstrainedBeamSearchScorer.step_sentence_constraint): avoid hypothesis duplication between topk and advance

* fix(GenerationMixin.constrained_beam_search): appropriately assign beam scores instead of token scores

bc7a6fdc