Commits · 4ad2f68e3439a98ba76317a6453f910c4a631284 · chenpangpang / transformers

10 May, 2022 8 commits

Fix template init (#17163) · 4ad2f68e
Sylvain Gugger authored May 10, 2022

4ad2f68e

Add MLFLOW_FLATTEN_PARAMS support in MLflowCallback (#17148) · e99f0efe

Nicolas Brousse authored May 10, 2022

* add support for MLFLOW_FLATTEN_PARAMS

* ensure key is str

* fix style and update warning msg

* Empty commit to trigger CI

* fix bug in check_inits.py

* add unittest for flatten_dict utils

* fix 'NoneType' object is not callable on __del__

* add generic flatten_dict unittest to SPECIAL_MODULE_TO_TEST_MAP

* fix style

e99f0efe

missing file (#17164) · 976835d5
Stas Bekman authored May 10, 2022

976835d5
Fixing the output of code examples in the preprocessing chapter (#17162) · 259eeb6d
Patrick Haller authored May 10, 2022

259eeb6d

[Deepspeed] add many more models to the model zoo test (#12695) · f8615044

Stas Bekman authored May 10, 2022

* model zoo take 2

* add deberta

* new param for zero2

* doc update

* doc update

* add layoutlm

* bump deepspeed

* add deberta-v2, funnel, longformer

* new models

* style

* add t5_v1

* update TAPAS status

* reorg problematic models

* move doc to another PR

* style

* fix checkpoint check test

* making progress on more models running

* cleanup

* new version

* cleanup

f8615044

[trainer] sharded _load_best_model (#17150) · 9aeacfe0
Stas Bekman authored May 10, 2022
```
* [trainer] sharded _load_best_model

probably needs a test?

* undo delete
```
9aeacfe0
train args defaulting None marked as Optional (#17156) · 1766fa21
Dom Miketa authored May 10, 2022
```
Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>
```
1766fa21
LogSumExp trick `question_answering` pipeline. (#17143) · 6d80c92c
Nicolas Patry authored May 10, 2022
```
* LogSumExp trick `question_answering` pipeline.

* Adding a failing test.
```
6d80c92c

09 May, 2022 10 commits

Fix all docs for accelerate install directions (#17145) · d719bcd4
Zachary Mueller authored May 09, 2022

d719bcd4

Fix MLflowCallback end_run() and add support for tags and nested runs (#17130) · 766d4bf7

Nicolas Brousse authored May 09, 2022

* ensure mlflow.end_run() is executed at end of training when mlflow.start_run() was executed by the callback

* add debug msg

* add support for MLFLOW_TAGS, MLFLOW_RUN_ID, and MLFLOW_NESTED_RUN

* update to support python 3.6+

* Validate env variables using ENV_VARS_TRUE_VALUES

* Empty-Commit

766d4bf7

Add the auto_find_batch_size capability from Accelerate into Trainer (#17068) · 2fbb2379

Zachary Mueller authored May 09, 2022


Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

- Adds auto_batch_size finder 
- Moves training loop to an inner training loop

2fbb2379

[WIP] Fix Pyright static type checking by replacing if-else imports with try-except (#16578) · df735d13

Dom Miketa authored May 09, 2022



* rebase and isort

* modify cookiecutter init

* fix cookiecutter auto imports

* fix clean_frameworks_in_init

* fix add_model_to_main_init

* blackify

* replace unnecessary f-strings

* update yolos imports

* fix roberta import bug

* fix yolos missing dependency

* fix add_model_like and cookiecutter bug

* fix repository consistency error

* modify cookiecutter, fix add_new_model_like

* remove stale line
Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>

df735d13

Fix quality and repo consistency · 7783fa6b
Sylvain Gugger authored May 09, 2022

7783fa6b

PyTorch FSDP integration in Trainer (#17136) · 05fc1766

Sourab Mangrulkar authored May 09, 2022



* PyTorch FSDP integration in Trainer

* reformatting

make style and make quality are now compliant.

* Updating dependency check

* Trigger CI
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

05fc1766

add `mobilebert` onnx configs (#17029) · dc3645dc

Manan Dey authored May 09, 2022

* update docs of length_penalty

* Revert "update docs of length_penalty"

This reverts commit 466bf4800b75ec29bd2ff75bad8e8973bd98d01c.

* add mobilebert onnx config

* address suggestions

* Update auto.mdx

* Update __init__.py

* Update features.py

dc3645dc

Add type hints for BigBirdPegasus and Data2VecText PyTorch models (#17123) · a021f2b9

robsmith155 authored May 09, 2022

* Add type hints for remaining BigBirdPegasus models

Here I added type hints to the BigBirdPegasusForCausalLM class.

* Add missing type hints for Data2VecText models

Added type hints to the Data2VecTextForCausalLM, Data2VecTextForMaskedLM,
Data2VecTextForMultipleChoice, Data2VecTextForQuestionAnswering,
Data2VecTextForSequenceClassification, and
Data2VecTextForTokenClassification classes.

a021f2b9

LayoutLMv2Processor: ensure 1-to-1 mapping between images and samples in case... · e9fd583c

ghlai9665 authored May 09, 2022

LayoutLMv2Processor: ensure 1-to-1 mapping between images and samples in case of overflowing tokens (#17092)

* add get_overflowing_images function to ensure 1-to-1 mapping between samples and images in LayoutLMv2Processor

* make style

* add test for overflowing_tokens, change assert to ValueError, avoiding unrelated formatting changes

* change line length by passing --preview into black

e9fd583c

split single_gpu and multi_gpu (#17083) · 3212afa6

Yih-Dar authored May 09, 2022



* split single_gpu and multi_gpu

* update needs in send_result
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

3212afa6

06 May, 2022 2 commits
- Added BigBirdPegasus onnx config (#17104) · 215e0681
  Ritik Nandwal authored May 06, 2022
```
* Add onnx configuration for bigbird-pegasus

* Modify docs
```
  215e0681
- Fix self-push CI report path in cat (#17111) · 351cdbdf
  Yih-Dar authored May 06, 2022
```
* fix report cat path

* fix report cat path
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  351cdbdf
05 May, 2022 6 commits
- Fix link to example scripts (#17103) · cad61b68
  Steven Liu authored May 05, 2022
  
  cad61b68
- fix missing "models" in pipeline test module (#17090) · a59eb349
  Yih-Dar authored May 05, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a59eb349
- Remove torchhub test (#17097) · dd16a113
  Sylvain Gugger authored May 05, 2022
  
  dd16a113
- Fix MLflowCallback and add support for MLFLOW_EXPERIMENT_NAME (#17091) · c849a61e
  Nicolas Brousse authored May 05, 2022
```
* Fix use of mlflow.active_run() and add proper support for MLFLOW_EXPERIMENT_NAME

* Fix code style (make style)
```
  c849a61e
- Add type hints for BERTGeneration (#17047) · 99289c08
  robsmith155 authored May 05, 2022
```
Added type hints for the BERTGenerationEncoder and BERTGenerationDecoder
classes.
```
  99289c08
- type hints for pytorch models (#17064) · 45360e1a
  Robot Jelly authored May 05, 2022
```
* type hints for pytorch models

* fixed import error

* fixed some errors
```
  45360e1a
04 May, 2022 14 commits

Added spanish translation of autoclass_tutorial. (#17069) · db377a0b

Daniel Espejel authored May 04, 2022

* Added spanish translation of autoclass_tutorial.
Added 'local' and 'title' fields for autoclass_tutorial.

* Fixed autoclass_tutorial title in _toctree.yml and autoclass_tutorial.mdx

db377a0b

minor change on TF Data2Vec test (#17085) · 6dc4c36a
Yih-Dar authored May 04, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6dc4c36a
📝 open fresh PR for pipeline doctests (#17073) · 23619ef6
Steven Liu authored May 04, 2022

23619ef6
Fix DeBERTa `token_type_ids` (#17082) · 870e6f29
Patrick Deutschmann authored May 04, 2022

870e6f29

Allow saved_model export of TFCLIPModel in save_pretrained (#16886) · 279bc584

Sean Moriarity authored May 04, 2022



* CLIP Serving

* Add type hints per code review

* Use black, flake8, and isort

* Update src/transformers/models/clip/modeling_tf_clip.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Rollback serving_output and add TODO

* Remove irrelevant portions of failing tests

* Revert "Rollback serving_output and add TODO"

This reverts commit a4abfa6ba3b7875a13538dbc2ddc4eb17dfcca8d.

* Rollback to original test/serving_output

* Fix unused var

* Apply suggestions from code review

* Update formatting with black

* Fix style again from rebase

* Update tests/models/clip/test_modeling_tf_clip.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Sean Moriarity <sean.l.moriarity.mil@army.mil>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

279bc584

Update to build via git for accelerate (#17084) · ef203902
Zachary Mueller authored May 04, 2022

ef203902
Deprecate model templates (#17062) · bb8d4052
Sylvain Gugger authored May 04, 2022
```
* Deprecate model templates

* Address review comments
```
bb8d4052

Type hint complete Albert model file. (#16682) · 9c5ae87f

karthikrangasai authored May 04, 2022



* Type hint complete Albert model file.

* Update typing.

* Update src/transformers/models/albert/modeling_albert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

9c5ae87f

Bump notebook from 6.4.1 to 6.4.10 in /examples/research_projects/lxmert (#16634) · 2bf95e2b

dependabot[bot] authored May 04, 2022

Bumps [notebook](http://jupyter.org

) from 6.4.1 to 6.4.10.

---
updated-dependencies:
- dependency-name: notebook
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

2bf95e2b

Bump notebook in /examples/research_projects/visual_bert (#16635) · 7a229ef4

dependabot[bot] authored May 04, 2022

Bumps [notebook](http://jupyter.org

) from 6.4.1 to 6.4.10.

---
updated-dependencies:
- dependency-name: notebook
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

7a229ef4

Add Data2Vec for Vision in TF (#17008) · 049e7917

Sayak Paul authored May 04, 2022



* add utilities till TFData2VecVisionLayer.

* chore: pass window_size to attention layer.

* feat: add TFData2VecVisionRelativePositionBias.

* feat: initial implementation ready for tf data2vec.

* fix: relative position bias index, table to be fixed.

* chore: implementation added, tests remaining.

* add: tests, other PR files.

* fix: code quality.

* fix: import structure in init.

* chore: run make fix-copies.

* chore: address PR feedback (round I).

* chore: styling nit.

* fix: tests due to removal of to_2tuple().

* chore: rebase with upstream main and move the test.

* Update src/transformers/models/auto/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/auto/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix: layer call.

* chore: remove from_pt=True and rerun test.

* chore: remove cast and tf.divide.

* chore: minor edits to the test script.

* Update src/transformers/models/data2vec/modeling_tf_data2vec_vision.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* fix: expand() on TF tensors with broadcast_to().

* fix: test import.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

049e7917

Make sure telemetry arguments are not returned as unused kwargs (#17063) · d76d2a2a
Sylvain Gugger authored May 04, 2022
```
* Make sure telemetry arguments are not returned as unused kwargs

* Fix test
```
d76d2a2a

Remove masked image modeling from BEIT ONNX export (#16980) · 675e2d16

lewtun authored May 04, 2022



* Add masked image modelling to task mapping

* Refactor ONNX features to be listed alphabetically

* Add warning about BEiT masked image modeling
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

675e2d16

Skip RoFormer ONNX test if rjieba not installed (#16981) · 4bb1d0ec

lewtun authored May 04, 2022

* Skip RoFormer ONNX test if rjieba not installed

* Update deps table

* Skip RoFormer serialization test

* Fix RoFormer vocab

* Add rjieba to CircleCI

4bb1d0ec