Commits · e99f0efedc30512e308e0684d3fe3afa4d374e34 · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "0e75aeefaf4beeb7a5bb6a1f05b83ab99e045a24"

10 May, 2022 5 commits

Add MLFLOW_FLATTEN_PARAMS support in MLflowCallback (#17148) · e99f0efe

Nicolas Brousse authored May 10, 2022

* add support for MLFLOW_FLATTEN_PARAMS

* ensure key is str

* fix style and update warning msg

* Empty commit to trigger CI

* fix bug in check_inits.py

* add unittest for flatten_dict utils

* fix 'NoneType' object is not callable on __del__

* add generic flatten_dict unittest to SPECIAL_MODULE_TO_TEST_MAP

* fix style

e99f0efe

[Deepspeed] add many more models to the model zoo test (#12695) · f8615044

Stas Bekman authored May 10, 2022

* model zoo take 2

* add deberta

* new param for zero2

* doc update

* doc update

* add layoutlm

* bump deepspeed

* add deberta-v2, funnel, longformer

* new models

* style

* add t5_v1

* update TAPAS status

* reorg problematic models

* move doc to another PR

* style

* fix checkpoint check test

* making progress on more models running

* cleanup

* new version

* cleanup

f8615044

[trainer] sharded _load_best_model (#17150) · 9aeacfe0
Stas Bekman authored May 10, 2022
```
* [trainer] sharded _load_best_model

probably needs a test?

* undo delete
```
9aeacfe0
train args defaulting None marked as Optional (#17156) · 1766fa21
Dom Miketa authored May 10, 2022
```
Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>
```
1766fa21
LogSumExp trick `question_answering` pipeline. (#17143) · 6d80c92c
Nicolas Patry authored May 10, 2022
```
* LogSumExp trick `question_answering` pipeline.

* Adding a failing test.
```
6d80c92c

09 May, 2022 8 commits

Fix MLflowCallback end_run() and add support for tags and nested runs (#17130) · 766d4bf7

Nicolas Brousse authored May 09, 2022

* ensure mlflow.end_run() is executed at end of training when mlflow.start_run() was executed by the callback

* add debug msg

* add support for MLFLOW_TAGS, MLFLOW_RUN_ID, and MLFLOW_NESTED_RUN

* update to support python 3.6+

* Validate env variables using ENV_VARS_TRUE_VALUES

* Empty-Commit

766d4bf7

Add the auto_find_batch_size capability from Accelerate into Trainer (#17068) · 2fbb2379

Zachary Mueller authored May 09, 2022


Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

- Adds auto_batch_size finder 
- Moves training loop to an inner training loop

2fbb2379

[WIP] Fix Pyright static type checking by replacing if-else imports with try-except (#16578) · df735d13

Dom Miketa authored May 09, 2022



* rebase and isort

* modify cookiecutter init

* fix cookiecutter auto imports

* fix clean_frameworks_in_init

* fix add_model_to_main_init

* blackify

* replace unnecessary f-strings

* update yolos imports

* fix roberta import bug

* fix yolos missing dependency

* fix add_model_like and cookiecutter bug

* fix repository consistency error

* modify cookiecutter, fix add_new_model_like

* remove stale line
Co-authored-by: Dom Miketa <dmiketa@exscientia.co.uk>

df735d13

Fix quality and repo consistency · 7783fa6b
Sylvain Gugger authored May 09, 2022

7783fa6b

PyTorch FSDP integration in Trainer (#17136) · 05fc1766

Sourab Mangrulkar authored May 09, 2022



* PyTorch FSDP integration in Trainer

* reformatting

make style and make quality are now compliant.

* Updating dependency check

* Trigger CI
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

05fc1766

add `mobilebert` onnx configs (#17029) · dc3645dc

Manan Dey authored May 09, 2022

* update docs of length_penalty

* Revert "update docs of length_penalty"

This reverts commit 466bf4800b75ec29bd2ff75bad8e8973bd98d01c.

* add mobilebert onnx config

* address suggestions

* Update auto.mdx

* Update __init__.py

* Update features.py

dc3645dc

Add type hints for BigBirdPegasus and Data2VecText PyTorch models (#17123) · a021f2b9

robsmith155 authored May 09, 2022

* Add type hints for remaining BigBirdPegasus models

Here I added type hints to the BigBirdPegasusForCausalLM class.

* Add missing type hints for Data2VecText models

Added type hints to the Data2VecTextForCausalLM, Data2VecTextForMaskedLM,
Data2VecTextForMultipleChoice, Data2VecTextForQuestionAnswering,
Data2VecTextForSequenceClassification, and
Data2VecTextForTokenClassification classes.

a021f2b9

LayoutLMv2Processor: ensure 1-to-1 mapping between images and samples in case... · e9fd583c

ghlai9665 authored May 09, 2022

LayoutLMv2Processor: ensure 1-to-1 mapping between images and samples in case of overflowing tokens (#17092)

* add get_overflowing_images function to ensure 1-to-1 mapping between samples and images in LayoutLMv2Processor

* make style

* add test for overflowing_tokens, change assert to ValueError, avoiding unrelated formatting changes

* change line length by passing --preview into black

e9fd583c

06 May, 2022 1 commit
- Added BigBirdPegasus onnx config (#17104) · 215e0681
  Ritik Nandwal authored May 06, 2022
```
* Add onnx configuration for bigbird-pegasus

* Modify docs
```
  215e0681
05 May, 2022 3 commits
- Fix MLflowCallback and add support for MLFLOW_EXPERIMENT_NAME (#17091) · c849a61e
  Nicolas Brousse authored May 05, 2022
```
* Fix use of mlflow.active_run() and add proper support for MLFLOW_EXPERIMENT_NAME

* Fix code style (make style)
```
  c849a61e
- Add type hints for BERTGeneration (#17047) · 99289c08
  robsmith155 authored May 05, 2022
```
Added type hints for the BERTGenerationEncoder and BERTGenerationDecoder
classes.
```
  99289c08
- type hints for pytorch models (#17064) · 45360e1a
  Robot Jelly authored May 05, 2022
```
* type hints for pytorch models

* fixed import error

* fixed some errors
```
  45360e1a
04 May, 2022 9 commits

minor change on TF Data2Vec test (#17085) · 6dc4c36a
Yih-Dar authored May 04, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6dc4c36a
Fix DeBERTa `token_type_ids` (#17082) · 870e6f29
Patrick Deutschmann authored May 04, 2022

870e6f29

Allow saved_model export of TFCLIPModel in save_pretrained (#16886) · 279bc584

Sean Moriarity authored May 04, 2022



* CLIP Serving

* Add type hints per code review

* Use black, flake8, and isort

* Update src/transformers/models/clip/modeling_tf_clip.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Rollback serving_output and add TODO

* Remove irrelevant portions of failing tests

* Revert "Rollback serving_output and add TODO"

This reverts commit a4abfa6ba3b7875a13538dbc2ddc4eb17dfcca8d.

* Rollback to original test/serving_output

* Fix unused var

* Apply suggestions from code review

* Update formatting with black

* Fix style again from rebase

* Update tests/models/clip/test_modeling_tf_clip.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Sean Moriarity <sean.l.moriarity.mil@army.mil>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

279bc584

Deprecate model templates (#17062) · bb8d4052
Sylvain Gugger authored May 04, 2022
```
* Deprecate model templates

* Address review comments
```
bb8d4052

Type hint complete Albert model file. (#16682) · 9c5ae87f

karthikrangasai authored May 04, 2022



* Type hint complete Albert model file.

* Update typing.

* Update src/transformers/models/albert/modeling_albert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

9c5ae87f

Add Data2Vec for Vision in TF (#17008) · 049e7917

Sayak Paul authored May 04, 2022



* add utilities till TFData2VecVisionLayer.

* chore: pass window_size to attention layer.

* feat: add TFData2VecVisionRelativePositionBias.

* feat: initial implementation ready for tf data2vec.

* fix: relative position bias index, table to be fixed.

* chore: implementation added, tests remaining.

* add: tests, other PR files.

* fix: code quality.

* fix: import structure in init.

* chore: run make fix-copies.

* chore: address PR feedback (round I).

* chore: styling nit.

* fix: tests due to removal of to_2tuple().

* chore: rebase with upstream main and move the test.

* Update src/transformers/models/auto/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/auto/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix: layer call.

* chore: remove from_pt=True and rerun test.

* chore: remove cast and tf.divide.

* chore: minor edits to the test script.

* Update src/transformers/models/data2vec/modeling_tf_data2vec_vision.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* fix: expand() on TF tensors with broadcast_to().

* fix: test import.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

049e7917

Make sure telemetry arguments are not returned as unused kwargs (#17063) · d76d2a2a
Sylvain Gugger authored May 04, 2022
```
* Make sure telemetry arguments are not returned as unused kwargs

* Fix test
```
d76d2a2a

Remove masked image modeling from BEIT ONNX export (#16980) · 675e2d16

lewtun authored May 04, 2022



* Add masked image modelling to task mapping

* Refactor ONNX features to be listed alphabetically

* Add warning about BEiT masked image modeling
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

675e2d16

Skip RoFormer ONNX test if rjieba not installed (#16981) · 4bb1d0ec

lewtun authored May 04, 2022

* Skip RoFormer ONNX test if rjieba not installed

* Update deps table

* Skip RoFormer serialization test

* Fix RoFormer vocab

* Add rjieba to CircleCI

4bb1d0ec

03 May, 2022 5 commits

Remove device parameter from create_extended_attention_mask_for_decoder (#16894) · 39f8eafc
Pavel Belevich authored May 03, 2022

39f8eafc
Fix RNG reload in resume training from epoch checkpoint (#17055) · 1c9fcd0e
Sylvain Gugger authored May 03, 2022
```
* Fix RNG reload in resume training from epoch checkpoint

* Fix test
```
1c9fcd0e
Make Trainer compatible with sharded checkpoints (#17053) · a8fa2f91
Sylvain Gugger authored May 03, 2022
```
* Make Trainer compatible with sharded checkpoints

* Add doc
```
a8fa2f91

Move test model folders (#17034) · 19420fd9

Yih-Dar authored May 03, 2022



* move test model folders (TODO: fix imports and others)

* fix (potentially partially) imports (in model test modules)

* fix (potentially partially) imports (in tokenization test modules)

* fix (potentially partially) imports (in feature extraction test modules)

* fix import utils.test_modeling_tf_core

* fix path ../fixtures/

* fix imports about generation.test_generation_flax_utils

* fix more imports

* fix fixture path

* fix get_test_dir

* update module_to_test_file

* fix get_tests_dir from wrong transformers.utils

* update config.yml (CircleCI)

* fix style

* remove missing imports

* update new model script

* update check_repo

* update SPECIAL_MODULE_TO_TEST_MAP

* fix style

* add __init__

* update self-scheduled

* fix add_new_model scripts

* check one way to get location back

* python setup.py build install

* fix import in test auto

* update self-scheduled.yml

* update slack notification script

* Add comments about artifact names

* fix for yolos
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

19420fd9

[FlaxBert] Add ForCausalLM (#16995) · cd9274d0

Sanchit Gandhi authored May 03, 2022

* [FlaxBert] Add ForCausalLM

* make style

* fix output attentions

* Add RobertaForCausalLM

* remove comment

* fix fx-to-pt model loading

* remove comment

* add modeling tests

* add enc-dec model tests

* add big_bird

* add electra

* make style

* make repo-consitency

* add to docs

* remove roberta test

* quality

* amend cookiecutter

* fix attention_mask bug in flax bert model tester

* tighten pt-fx thresholds to 1e-5

* add 'copied from' statements

* amend 'copied from' statements

* amend 'copied from' statements

* quality

cd9274d0

02 May, 2022 9 commits

[T5 Tokenizer] Model has no fixed position ids - there is no hardcode… (#16990) · 31616b8d

Patrick von Platen authored May 02, 2022



* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length

* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length

* correct t5 tokenizer

* correct t5 tokenizer

* fix test

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* finish
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

31616b8d

Make the sacremoses dependency optional (#17049) · 30ca5299
Lysandre Debut authored May 02, 2022
```
* Make sacremoses optional

* Pickle
```
30ca5299
Allow all imports from transformers (#17050) · bb2e088b
Lysandre Debut authored May 02, 2022

bb2e088b

Add YOLOS (#16848) · 1ac69874

NielsRogge authored May 02, 2022



* First draft

* Add YolosForObjectDetection

* Make forward pass work

* Add mid position embeddings

* Add interpolation of position encodings

* Add expected values

* Add YOLOS to tests

* Add integration test

* Support tiny model as well

* Support all models in conversion script

* Remove mid_pe_size attribute

* Make more tests pass

* Add model to README and fix config

* Add copied from statements

* Rename base_model_prefix to vit

* Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP

* Apply suggestions from code review

* Apply more suggestions from code review

* Convert remaining checkpoints

* Improve docstrings

* Add YolosFeatureExtractor

* Add feature extractor to docs

* Add corresponding tests

* Fix style

* Fix docs

* Apply suggestion from code review

* Fix bad rebase

* Fix some more bad rebase

* Fix missing character

* Improve docs and variable names
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

1ac69874

[Trainer] Move logic for checkpoint loading into separate methods for easy overriding (#17043) · daecae1f
calpt authored May 02, 2022

daecae1f
Fix typo in RetriBERT docstring (#17018) · 9586e222
Martin Pömsl authored May 02, 2022

9586e222
[Flax(Speech)EncoderDecoder] Fix bug in `decoder_module` (#17036) · 93b802c4
Sanchit Gandhi authored May 02, 2022
```
* [FlaxSpeechEncoderDecoder] Fix bug in `decoder_module`

* [FlaxEncoderDecoder] Fix bug in `decoder_module`
```
93b802c4
Fix style · 1ae182d9
Sylvain Gugger authored May 02, 2022

1ae182d9

Fx with meta (#16836) · 2c2a2169

Michael Benayoun authored May 02, 2022

* Add meta proxy

* Uses meta data to trace data dependent control-flow

* Remove commented class

* Handles torch creating functions

* Added type annotation to fix tracing

* Tracing works for everything but T5 and GPT-J

* Almost all previously supported models pass

* All architectures can be traced except T5

* Intermediate commit to have a trace of the comparison operators for HFProxy

* Everything works, except loss computation

* Everything works

* Removed unused import

* Overriden methods do not use underlying ops (linear and torch.matmul), and model attributes are copied to the traced version

* Fix torch_matmul_override

* Change attributes reference to deepcopy

* Remove breakpoint and add torch_index_override

* Small fix

* Fix typo

* Replace asserts by explicit exceptions

2c2a2169