Commits · 3ec7d4cfe4808ff034db12f2bc781d6173a9932c · chenpangpang / transformers

27 Jun, 2022 3 commits

fix mask (#17837) · 3ec7d4cf
Younes Belkada authored Jun 27, 2022

3ec7d4cf

Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d

Matt authored Jun 27, 2022

* Add a TF in-graph tokenizer for BERT

* Add from_pretrained

* Add proper truncation, option handling to match other tokenizers

* Add proper imports and guards

* Add test, fix all the bugs exposed by said test

* Fix truncation of paired texts in graph mode, more test updates

* Small fixes, add a (very careful) test for savedmodel

* Add tensorflow-text dependency, make fixup

* Update documentation

* Update documentation

* make fixup

* Slight changes to tests

* Add some docstring examples

* Update tests

* Update tests and add proper lowercasing/normalization

* make fixup

* Add docstring for padding!

* Mark slow tests

* make fixup

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* Fall back to BertTokenizerFast if BertTokenizer is unavailable

* make fixup

* Properly handle tensorflow-text dummies

ee0d001d

Fix TF GPT2 test_onnx_runtime_optimize (#17874) · 401fcca6
Yih-Dar authored Jun 27, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
401fcca6

25 Jun, 2022 1 commit
- CLI: handle multimodal inputs (#17839) · cc5c061e
  Joao Gante authored Jun 25, 2022
  
  cc5c061e
24 Jun, 2022 13 commits

Properly get tests deps in test_fetcher (#17870) · e8eb699e
Sylvain Gugger authored Jun 24, 2022
```
* Properly get tests deps in test_fetcher

* Remove print
```
e8eb699e
Fix `test_inference_instance_segmentation_head` (#17872) · b03be78a
Yih-Dar authored Jun 24, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b03be78a
Skip `test_multi_gpu_data_parallel_forward` for `MaskFormer` (#17864) · 494aac65
Yih-Dar authored Jun 24, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
494aac65

Use higher value for hidden_size in Flax BigBird test (#17822) · 0e0f1f46

Yih-Dar authored Jun 24, 2022



* Use higher value for hidden_size in Flax BigBird test

* remove 5e-5
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

0e0f1f46

Fix: torch.utils.checkpoint import error. (#17849) · 2ef94ee0
kumapo authored Jun 25, 2022

2ef94ee0

Add type hints for gptneox models (#17858) · ef28a402

willtai authored Jun 24, 2022

* feat: Add type hints for GPTNeoxForCausalLM and GPTNeoXModel

* fix: removed imported Dict type

* fix: Removed unused List import

ef28a402

[CodeGen] support device_map="auto" for sharded checkpoints (#17871) · 061a73d1
Suraj Patil authored Jun 24, 2022

061a73d1

Add CodeGen model (#17443) · d6b6fb99

rooa authored Jun 24, 2022



* Add CodeGen model

* Add missing key and switch order of super()

* Fix torch.ones init with uint8 instead of bool

* Address comments: copy statements and doc

* update tests

* remove old model parallel

* fix batch gen tests

* fix batch gen test

* update test_gpt2_sample_max_time

* fix codgen test and revert gpt2 test change

* Fix incorrect tie_word_embedding value, typo, URL

* Fix model order in README and styling

* Reorder model list alphabetically

* Set tie_word_embedding to False by default

* Apply suggestions from code review

* Better attn mask name & remove attn masked_bias

* add tokenizer for codegen

* quality

* doc tokenizer

* fix-copies

* add CodeGenTokenizer in converter

* make truncation optional

* add test for truncation

* add copyright

* fix-copies

* fix fast tokenizer decode

* Update src/transformers/models/codegen/tokenization_codegen.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* increase vocab_size in tests
Co-authored-by: patil-suraj <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d6b6fb99

Fix Splinter test (#17854) · 44749001
Yih-Dar authored Jun 24, 2022
```
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
44749001
[tests/VisionEncoderDecoder] import to_2tuple from test utils (#17865) · 73a0496c
Suraj Patil authored Jun 24, 2022

73a0496c

Fix Constrained beam search duplication and weird output issue (#17814) · bc7a6fdc

NaN authored Jun 24, 2022

* fix(ConstrainedBeamSearchScorer.step_sentence_constraint): avoid hypothesis duplication between topk and advance

* fix(GenerationMixin.constrained_beam_search): appropriately assign beam scores instead of token scores

bc7a6fdc

Improve encoder decoder model docs (#17815) · c2c0d9db

Vishwas authored Jun 24, 2022



* Copied all the changes from the last PR

* added in documentation_tests.txt

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/encoder-decoder.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: vishwaspai <vishwas.pai@emplay.net>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

c2c0d9db

Improve vision models (#17731) · 09178705

NielsRogge authored Jun 24, 2022



* Improve vision models

* Add a lot of improvements

* Remove to_2tuple from swin tests

* Fix TF Swin

* Fix more tests

* Fix copies

* Improve more models

* Fix ViTMAE test

* Add channel check for TF models

* Add proper channel check for TF models

* Apply suggestion from code review

* Apply suggestions from code review

* Add channel check for Flax models, apply suggestion

* Fix bug

* Add tests for greyscale images

* Add test for interpolation of pos encodigns
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

09178705

23 Jun, 2022 17 commits

Auto-build Docker images before on-merge if setup.py was changed (#17573) · 893ab124
Zachary Mueller authored Jun 23, 2022
```
* Auto-build on setup modification

* Modify push-caller

* Make adjustments based on code review
```
893ab124
Properly calculate the total train iterations and recalculate num epochs in... · 75259b44
Zachary Mueller authored Jun 23, 2022
```
Properly calculate the total train iterations and recalculate num epochs in no_trainer scripts (#17856)
```
75259b44
Index RNG states by global rank in saves (#17852) · 7c1b9128
Sylvain Gugger authored Jun 23, 2022

7c1b9128

Nezha Pytorch implementation (#17776) · 7cf52a49

Sijun He authored Jun 24, 2022



* wip

* rebase

* all tests pass

* rebase

* ready for PR

* address comments

* fix styles

* add require_torch to pipeline test

* remove remote image to improve CI consistency

* address comments; fix tf/flax tests

* address comments; fix tf/flax tests

* fix tests; add alias

* repo consistency tests

* Update src/transformers/pipelines/visual_question_answering.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* address comments

* Update src/transformers/pipelines/visual_question_answering.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* merge

* wip

* wip

* wip

* most basic tests passes

* all tests pass now

* relative embedding

* wip

* running make fixup

* remove bert changes

* fix doc

* fix doc

* fix issues

* fix doc

* address comments

* fix CI

* remove redundant copied from

* address comments

* fix broken test
Co-authored-by: Sijun He <sijunhe@Sijuns-MacBook-Pro.local>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

7cf52a49

Change no trainer image_classification test (#17635) · acb709d5
Zachary Mueller authored Jun 23, 2022
```
* Adjust test arguments and use a new example test
```
acb709d5

Update modeling_cvt.py (#17846) · e70abdad

Fx039482 authored Jun 23, 2022

As shown in the colab notebook I added the missing type hints for " CvtForImageClassification
CvtModel
"

e70abdad

Fix broken test for models with batchnorm (#17841) · 1a7ef334

Matt authored Jun 23, 2022

* Fix tests that broke when models used batchnorm

* Initializing the model twice does not actually...
...give you the same weights each time.
I am good at machine learning.

* Fix speed regression

1a7ef334

BLOOM minor changes on tokenizer (#17823) · 18c263c4

Younes Belkada authored Jun 23, 2022



* few fixes:

- hardcode tokenizer padding side
- remove unused args

* few fixes:

- added new attribute on TokenizerTesterMixin
- added new slow test
- remove unused arg on tokenizer class

* make style

* Update src/transformers/models/bloom/tokenization_bloom_fast.py
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

* make quality

* apply changes

- remove new attribute
- redefine test on the class

* add comments
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

18c263c4

Improve performance docs (#17750) · 6f29029b

Leandro von Werra authored Jun 23, 2022



* add skeleton files

* fix cpu inference link

* add hint to make clear that single gpu section contains general info

* add new files to ToC

* update toctree to have subsection for performance

* add "coming soon" to the still empty sections

* fix missing title

* fix typo

* add reference to empty documents

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

6f29029b

Fix an error message in BigBird (#17840) · 5bc779ae
Yih-Dar authored Jun 23, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5bc779ae
Fix properties of unset special tokens in non verbose mode (#17797) · 3eed5530
Guillaume Klein authored Jun 23, 2022
```
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>
```
3eed5530
change message (#17836) · b2fdbacc
SaulLu authored Jun 23, 2022

b2fdbacc

Add missing type hints for QDQBertModel (#17783) · d37a68e6

willtai authored Jun 23, 2022

* Feat: add missing type hints for QDQBertModel

* fix: ran black and isort

* feat: Add missing output type for QDQBertModel

* feat: Add type hints for QDQBertLMHeadModel and models starting with QDQBertFor

* fix: add missing return type for QDQBertModel

* fix: remove wrong return type for QDQBertEmbeddings

* fix: readded config argument to load_tf_weights_in_qdqbert

* fix: add BertConfig type to BertEmbeddings config due t checko error in ci

* fix: removed config type hints to avoid copy checks

d37a68e6

Update type hints modeling_yoso.py (#17827) · 4297f44b

Fx039482 authored Jun 23, 2022

* Update modeling_yoso.py

* make fixup

* Update modeling_yoso.py

That should be it copied from previous PR

4297f44b

TF: generate without `tf.TensorArray` (#17801) · 5cce3076
Joao Gante authored Jun 23, 2022

5cce3076

add doctests for DETR (#17786) · ab223fc1

Quentin authored Jun 23, 2022

* add: check labels for detr object detection doctests

* add: check shapes

* add: add detr to documentation_tests.py

* fix: make fixup output

* fix: add a comment

ab223fc1

Fix push CI artifact path (#17788) · 8d634b70
Yih-Dar authored Jun 23, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8d634b70

22 Jun, 2022 6 commits

Offload fixes (#17810) · df8e6804
Sylvain Gugger authored Jun 22, 2022
```
* Offload fixes

* Add a test
```
df8e6804

CLI: use hub's `create_commit` (#17755) · 0d0c392c

Joao Gante authored Jun 22, 2022

* use create_commit

* better commit message and description

* touch setup.py to trigger cache update

* add hub version gating

0d0c392c

Bump numpy from 1.21.0 to 1.22.0 in /examples/research_projects/lxmert (#17817) · c366ce10

dependabot[bot] authored Jun 22, 2022

Bumps [numpy](https://github.com/numpy/numpy) from 1.21.0 to 1.22.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst)
- [Commits](https://github.com/numpy/numpy/compare/v1.21.0...v1.22.0

)

---
updated-dependencies:
- dependency-name: numpy
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

c366ce10

Bump numpy in /examples/research_projects/visual_bert (#17816) · af0d21e7

dependabot[bot] authored Jun 22, 2022

Bumps [numpy](https://github.com/numpy/numpy) from 1.21.0 to 1.22.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst)
- [Commits](https://github.com/numpy/numpy/compare/v1.21.0...v1.22.0

)

---
updated-dependencies:
- dependency-name: numpy
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

af0d21e7

initial commit (#17818) · 56b83cf0
Arthur authored Jun 22, 2022

56b83cf0

Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer`... · 13570381

Eran Hirsch authored Jun 22, 2022

Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict` (#17805)

* Add logits_processor parameter, used by `generate`, to `Seq2SeqTrainer` methods `evaluate` and `predict`

* Add all generate parameters to `Seq2SeqTrainer`, and also to `QuestionAnsweringSeq2SeqTrainer` which overrides it

* Remove `self._num_beams` from trainer classes

* - Run fixup
- Fix "Constraint" not exposed
- Fix synced_gpus to actually read from param

* Use kwargs

* Copy kwargs before making changes to it

* Fix style issues unused imports

13570381