Commits · b424f0b4a301abcbf3c282114159371ee44c3e01 · chenpangpang / transformers

28 Jun, 2022 7 commits
- Mrbean/codegen onnx (#17903) · b424f0b4
  mrbean authored Jun 28, 2022
  
  b424f0b4
- Add ONNX support for DETR (#17904) · 76d13de5
  regisss authored Jun 28, 2022
  
  76d13de5
- Move logic into pixelshuffle layer (#17899) · f71895a6
  amyeroberts authored Jun 28, 2022
```
* Move all pixelshuffle logic into layer

* Rename layer

* Use correct input to function
```
  f71895a6
- Fix loss computation in TFBertForPreTraining (#17898) · 0094565f
  Matt authored Jun 28, 2022
  
  0094565f
- Pin black to 22.3.0 to benefit from a stable --preview flag (#17918) · 1dfa03f1
  Lysandre Debut authored Jun 28, 2022
  
  1dfa03f1
- [M2M100] update conversion script (#17916) · 9eec4e93
  Suraj Patil authored Jun 28, 2022
  
  9eec4e93
- Fix PyTorch/TF Auto tests (#17895) · db2644b9
  Yih-Dar authored Jun 28, 2022
```
* add loading_info
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  db2644b9
27 Jun, 2022 6 commits

Fix bug in gpt2's (from-scratch) special scaled weight initialization (#17877) · e02037b3

Andrej authored Jun 27, 2022



* only special scale init each gpt2 c_proj weight once, on exact match

* fix double quotes
Co-authored-by: leandro <leandro.vonwerra@spoud.io>

e02037b3

bert: add conversion script for BERT Token Dropping TF2 checkpoints (#17142) · 71b2839f

Stefan Schweter authored Jun 27, 2022

* bert: add conversion script for BERT Token Dropping TF2 checkpoints

* bert: rename conversion script for BERT Token Dropping checkpoints

* bert: fix flake errors in BERT Token Dropping conversion script

* bert: make doc-builder happy!!1!11

* bert: fix pytorch_dump_path of BERT Token Dropping conversion script

71b2839f

Fix add new model like frameworks (#17869) · 98742829
Sylvain Gugger authored Jun 27, 2022
```
* Add new model like adds only the selected frameworks object in init

* Small fix
```
98742829
Add type annotations for RoFormer models (#17878) · afb71b67
Ian Castillo authored Jun 27, 2022

afb71b67
fix mask (#17837) · 3ec7d4cf
Younes Belkada authored Jun 27, 2022

3ec7d4cf

Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d

Matt authored Jun 27, 2022

* Add a TF in-graph tokenizer for BERT

* Add from_pretrained

* Add proper truncation, option handling to match other tokenizers

* Add proper imports and guards

* Add test, fix all the bugs exposed by said test

* Fix truncation of paired texts in graph mode, more test updates

* Small fixes, add a (very careful) test for savedmodel

* Add tensorflow-text dependency, make fixup

* Update documentation

* Update documentation

* make fixup

* Slight changes to tests

* Add some docstring examples

* Update tests

* Update tests and add proper lowercasing/normalization

* make fixup

* Add docstring for padding!

* Mark slow tests

* make fixup

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* make fixup

* Properly handle tensorflow-text dummies

ee0d001d

25 Jun, 2022 1 commit
- CLI: handle multimodal inputs (#17839) · cc5c061e
  Joao Gante authored Jun 25, 2022
  
  cc5c061e
24 Jun, 2022 6 commits

Fix: torch.utils.checkpoint import error. (#17849) · 2ef94ee0
kumapo authored Jun 25, 2022

2ef94ee0

Add type hints for gptneox models (#17858) · ef28a402

willtai authored Jun 24, 2022

* feat: Add type hints for GPTNeoxForCausalLM and GPTNeoXModel

* fix: removed imported Dict type

* fix: Removed unused List import

ef28a402

[CodeGen] support device_map="auto" for sharded checkpoints (#17871) · 061a73d1
Suraj Patil authored Jun 24, 2022

061a73d1

Add CodeGen model (#17443) · d6b6fb99

rooa authored Jun 24, 2022



* Add CodeGen model

* Add missing key and switch order of super()

* Fix torch.ones init with uint8 instead of bool

* Address comments: copy statements and doc

* update tests

* remove old model parallel

* fix batch gen tests

* fix batch gen test

* update test_gpt2_sample_max_time

* fix codgen test and revert gpt2 test change

* Fix incorrect tie_word_embedding value, typo, URL

* Fix model order in README and styling

* Reorder model list alphabetically

* Set tie_word_embedding to False by default

* Apply suggestions from code review

* Better attn mask name & remove attn masked_bias

* add tokenizer for codegen

* quality

* doc tokenizer

* fix-copies

* add CodeGenTokenizer in converter

* make truncation optional

* add test for truncation

* add copyright

* fix-copies

* fix fast tokenizer decode

* Update src/transformers/models/codegen/tokenization_codegen.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* increase vocab_size in tests
Co-authored-by: patil-suraj <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d6b6fb99

Fix Constrained beam search duplication and weird output issue (#17814) · bc7a6fdc

NaN authored Jun 24, 2022

* fix(ConstrainedBeamSearchScorer.step_sentence_constraint): avoid hypothesis duplication between topk and advance

* fix(GenerationMixin.constrained_beam_search): appropriately assign beam scores instead of token scores

bc7a6fdc

Improve vision models (#17731) · 09178705

NielsRogge authored Jun 24, 2022



* Improve vision models

* Add a lot of improvements

* Remove to_2tuple from swin tests

* Fix TF Swin

* Fix more tests

* Fix copies

* Improve more models

* Fix ViTMAE test

* Add channel check for TF models

* Add proper channel check for TF models

* Apply suggestion from code review

* Apply suggestions from code review

* Add channel check for Flax models, apply suggestion

* Fix bug

* Add tests for greyscale images

* Add test for interpolation of pos encodigns
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

09178705

23 Jun, 2022 11 commits

Index RNG states by global rank in saves (#17852) · 7c1b9128
Sylvain Gugger authored Jun 23, 2022

7c1b9128

Nezha Pytorch implementation (#17776) · 7cf52a49

Sijun He authored Jun 24, 2022



* wip

* rebase

* all tests pass

* rebase

* ready for PR

* address comments

* fix styles

* add require_torch to pipeline test

* remove remote image to improve CI consistency

* address comments; fix tf/flax tests

* address comments; fix tf/flax tests

* fix tests; add alias

* repo consistency tests

* Update src/transformers/pipelines/visual_question_answering.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* address comments

* Update src/transformers/pipelines/visual_question_answering.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* merge

* wip

* wip

* wip

* most basic tests passes

* all tests pass now

* relative embedding

* wip

* running make fixup

* remove bert changes

* fix doc

* fix doc

* fix issues

* fix doc

* address comments

* fix CI

* remove redundant copied from

* address comments

* fix broken test
Co-authored-by: Sijun He <sijunhe@Sijuns-MacBook-Pro.local>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

7cf52a49

Update modeling_cvt.py (#17846) · e70abdad

Fx039482 authored Jun 23, 2022

As shown in the colab notebook I added the missing type hints for " CvtForImageClassification
CvtModel
"

e70abdad

BLOOM minor changes on tokenizer (#17823) · 18c263c4

Younes Belkada authored Jun 23, 2022



* few fixes:

- hardcode tokenizer padding side
- remove unused args

* few fixes:

- added new attribute on TokenizerTesterMixin
- added new slow test
- remove unused arg on tokenizer class

* make style

* Update src/transformers/models/bloom/tokenization_bloom_fast.py
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

* make quality

* apply changes

- remove new attribute
- redefine test on the class

* add comments
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

18c263c4

Fix an error message in BigBird (#17840) · 5bc779ae
Yih-Dar authored Jun 23, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5bc779ae
Fix properties of unset special tokens in non verbose mode (#17797) · 3eed5530
Guillaume Klein authored Jun 23, 2022
```
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>
```
3eed5530
change message (#17836) · b2fdbacc
SaulLu authored Jun 23, 2022

b2fdbacc

Add missing type hints for QDQBertModel (#17783) · d37a68e6

willtai authored Jun 23, 2022

* Feat: add missing type hints for QDQBertModel

* fix: ran black and isort

* feat: Add missing output type for QDQBertModel

* feat: Add type hints for QDQBertLMHeadModel and models starting with QDQBertFor

* fix: add missing return type for QDQBertModel

* fix: remove wrong return type for QDQBertEmbeddings

* fix: readded config argument to load_tf_weights_in_qdqbert

* fix: add BertConfig type to BertEmbeddings config due t checko error in ci

* fix: removed config type hints to avoid copy checks

d37a68e6

Update type hints modeling_yoso.py (#17827) · 4297f44b

Fx039482 authored Jun 23, 2022

* Update modeling_yoso.py

* make fixup

* Update modeling_yoso.py

That should be it copied from previous PR

4297f44b

TF: generate without `tf.TensorArray` (#17801) · 5cce3076
Joao Gante authored Jun 23, 2022

5cce3076

add doctests for DETR (#17786) · ab223fc1

Quentin authored Jun 23, 2022

* add: check labels for detr object detection doctests

* add: check shapes

* add: add detr to documentation_tests.py

* fix: make fixup output

* fix: add a comment

ab223fc1

22 Jun, 2022 5 commits

Offload fixes (#17810) · df8e6804
Sylvain Gugger authored Jun 22, 2022
```
* Offload fixes

* Add a test
```
df8e6804

CLI: use hub's `create_commit` (#17755) · 0d0c392c

Joao Gante authored Jun 22, 2022

* use create_commit

* better commit message and description

* touch setup.py to trigger cache update

* add hub version gating

0d0c392c

initial commit (#17818) · 56b83cf0
Arthur authored Jun 22, 2022

56b83cf0

Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer`... · 13570381

Eran Hirsch authored Jun 22, 2022

Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict` (#17805)

* Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict`

* Add all generate parameters to `Seq2SeqTrainer`, and also to `QuestionAnsweringSeq2SeqTrainer` which overrides it

* Remove `self._num_beams` from trainer classes

* - Run fixup
- Fix "Constraint" not exposed
- Fix synced_gpus to actually read from param

* Use kwargs

* Copy kwargs before making changes to it

* Fix style issues unused imports

13570381

Flax sharded (#17760) · 16c6eb7c
Arthur authored Jun 22, 2022

16c6eb7c

21 Jun, 2022 4 commits

Fix `top_k_top_p_filtering` having unexpected behavior (#17744) · 3b00b623

unifyh authored Jun 22, 2022

- Fix `top_k_top_p_filtering` not passing `filter_value` to
   `TopPLogitsWarper` causing any top-p filtered logits to be -inf
   instead of specified value

 - Add corresponding test

3b00b623

Remove duplicate code (#17708) · 3ccff0d4
Kyungmin Lee authored Jun 22, 2022

3ccff0d4

Improve error message Union not allowed (#17769) · 26a6a426

Bram Vanroy authored Jun 21, 2022



* Improve error message Union not allowed

* make style

* Update src/transformers/hf_argparser.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

26a6a426

Add final_layer_norm to OPT model (#17785) · abc400b0

Thomas Wang authored Jun 21, 2022



* Add final_layer_norm to OPT model

* Add JAX and TF version

* Fix Keras name

* Woops

* Allow for non breaking change

* Apply suggestions from code review

* add tests
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

abc400b0