Commits · 0d0d77693f79c7f7d39bba6921cc9741f00de988 · chenpangpang / transformers

15 Nov, 2022 25 commits

Allow trainer to return eval. loss for CLIP-like models (#20214) · 0d0d7769

Yih-Dar authored Nov 15, 2022



* Allow trainer to return loss for CLIP-like models

* Apply suggestions

* update

* update

* update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

0d0d7769

Update reqs to include min gather_for_metrics Accelerate version (#20242) · 822ae69c
Zachary Mueller authored Nov 15, 2022
```
* Update reqs to include min gather_for_metrics Accelerate version

* Other reqs
```
822ae69c

Add clip resources to the transformers documentation (#20190) · c19aa7ac

Ambuj Pawar authored Nov 15, 2022



* WIP: Added CLIP resources from HuggingFace blog

* ADD: Notebooks documentation to clip

* Add link straight to notebook
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Change notebook links to colab
Co-authored-by: Ambuj Pawar <your_email@abc.example>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

c19aa7ac

Add to DeBERTa resources (#20155) · 5b62f8ea

Saad Mahmud authored Nov 16, 2022

* Add to DeBERTa resources

* Fix mistakes with chapter number

* Add fill-mask pipeline

* Add sequence, token and QA pipeline

* Change token classification pipeline order

* Remove flax script and notebook links

5b62f8ea

Slightly alter Keras dummy loss (#20232) · 26ec7928

Matt authored Nov 15, 2022

* Slightly alter Keras dummy loss

* Slightly alter Keras dummy loss

* Add sample weight to test_keras_fit

* Fix test_keras_fit for datasets

* Skip the sample_weight stuff for models where the model tester has no batch_size

26ec7928

[CLIP] allow loading projection layer in vision and text model (#18962) · 7f744338

Suraj Patil authored Nov 15, 2022



* allow loading projection in text and vision model

* begin tests

* finish test for CLIPTextModelTest

* style

* add slow tests

* add new classes for projection heads

* remove with_projection

* add in init

* add in doc

* fix tests

* fix some more tests

* fix copies

* fix docs

* remove leftover from fix-copies

* add the head models in IGNORE_NON_AUTO_CONFIGURED

* fix docstr

* fix tests

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* add docstr for models
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

7f744338

Enable PyTorch 1.13 (#20168) · 9643ecf8

Sylvain Gugger authored Nov 15, 2022

* Try PT1.13 by removing torch scatter

* Skip failing tests

* Style

* Remvoe testing extras for repo utils

* Try with all decorators

* Try to wipe the cache

* Fix all tests?

* Try this way

* Fix comma

* Update to main

* Try with less deps

* Quality

9643ecf8

New logging support to "Trainer" Class (ClearML Logger) (#20184) · 777b1bfe

Muhammad Sakib Khan Inan authored Nov 15, 2022



* Init Update

* ClearML Callbacks integration

* update corrections

* args reporting updated

* {'tensorboard': False, 'pytorch': False}

* ClearML Tests added

* add clearml

* output_uri=True in Task.init

* reformatted integrations.py

* reformatted and fixed

* IF-ELSE statement issue on "has_clearml" resolved

* Add clearml in main callback docs

* Add additional clearml documentation

* Update src/transformers/integrations.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Accept suggestion
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Accept suggestion
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Small change in comments

* Make style clearml

* Accept suggestion
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Victor Sonck <victor.sonck@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

777b1bfe

Fix MaskformerFeatureExtractor (#20100) · b4997382

NielsRogge authored Nov 15, 2022



* Fix bug

* Add another fix

* Add print statement

* Apply fix

* Fix feature extractor

* Fix feature extractor

* Add print statements

* Add print statements

* Remove print statements

* Add instance segmentation integration test

* Add integration test for semantic segmentation

* Add draft for panoptic segmentation integration test

* Fix integration test for panoptic segmentation

* Remove slow annotator
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

b4997382

Fix docstring of CLIPTokenizer(Fast) (#20233) · 6e3b0144
TilmannR authored Nov 15, 2022

6e3b0144
Fix `run_clip.py` (#20234) · cf7b98b8
Yih-Dar authored Nov 15, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
cf7b98b8
fixed spelling error in testing.mdx (#20220) · 683cbc4c
Kendall authored Nov 15, 2022

683cbc4c
fix device issue (#20227) · 6ed6ed29
Yih-Dar authored Nov 15, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6ed6ed29
Add missing ESM autoclass (#20177) · d3d5fa3e
Matt authored Nov 15, 2022
```
* Add missing ESM autoclass

* Correct ESMFold checkpoint
```
d3d5fa3e
Remove `authorized_missing_keys`in favor of _keys_to_ignore_on_load_missing (#20228) · 92cfe8b0
Arthur authored Nov 15, 2022

92cfe8b0

Typo on doctring in ElectraTokenizer (#20192) · 2d920010

Yong woo Song authored Nov 15, 2022

* chore: typo on docstring in tokenization_electra

* chore: typo on docstring in tokenization_electra

* update for check copies

2d920010

Add object detection + segmentation transforms (#20003) · 4c7e8d09

amyeroberts authored Nov 15, 2022



* Add transforms for object detection

* Update src/transformers/image_transforms.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Better var names & docstring

* Remove unused var desc in docstring

* Update src/transformers/image_transforms.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4c7e8d09

Add Switch transformers (#19323) · 163ac3d3

Younes Belkada authored Nov 15, 2022



* first commit

* add more comments

* add router v1

* clean up

- remove `tf` modeling files

* clean up

- remove `tf` modeling files

* clean up

* v0 routers

* added more router

- Implemented `ExpertsChooseMaskedRouter`

- added tests
- 2 more routers to implement

* last router

* improved docstring

- completed the docstring in `router.py`
- added more args in the config

* v0 sparse mlp

* replace wrong naming

* forward pass run

* update MOE layer

* small router update

* fixup

* consistency

* remove scatter router

* remove abstract layer

* update test and model for integration testing

* v1 conversion

* update

* hardcode hack

* all keys match

* add gin conversion, without additional libraries

* update conversion sctipy

* delete router file

* update tests wrt router deletion

* fix router issues

* update expert code

* update, logits match, code needsREFACTORING

* Refactor code
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* add generate tests
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

* add support for router loss
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* fix forward error

* refactor a bit

* remove `FlaxSwitchTransformers` modules

* more tests pass

* Update code
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* fixup

* fix tests

* fix doc

* fix doc + tokenization

* fix tokenizer test

* fix test

* fix loss output

* update code for backward pass

* add loss support

* update documentation

* fix documentation, clean tokenizer

* more doc fix, cleanup example_switch

* fix failing test

* fix test

* fix test

* fix loss issue

* move layer

* update doc and fix router capacity usage

* fixup

* add sparse mlp index for documentation on hub

* fixup

* test sparse mix architecture

* Apply suggestions from code review

* Update docs/source/en/model_doc/switch_transformers.mdx

* fixup on update

* fix tests

* fix another test

* attempt fix

* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/switch_transformers/convert_switch_transformers_original_flax_checkpoint_to_pytorch.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* try

* all tests pass

* fix jitter noise

* Apply suggestions from code review

* doc tests pass

* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove assert

* change config order

* fix readme japanese

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove parallelizable tests + add one liners

* remove ONNX config

* fix nits

- add `T5Tokenizer` in auto mapping
- remove `Switch Transformers` from ONNX supported models

* remove `_get_router`

* remove asserts

* add check in test for `router_dtype`

* add `SwitchTransformersConfig` in `run_pipeline_test`

* Update tests/pipelines/test_pipelines_summarization.py

* add huge model conversion script

* fix slow tests

- add better casting for `Linear8bitLt`
- remove `torchscript` tests

* add make dir

* style on new script

* fix nits

- doctest
- remove `_keys_to_ignore_on_load_unexpected`

* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py

* add google as authors

* fix year

* remove last `assert` statements

* standardize vertical spaces

* fix failing import

* fix another failing test

* Remove strange àuthorized_keys`

* removing todo and padding that is never used
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: ybelkada <younes@huggingface.co>
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur Zucker <arthur@huggingface.co>

163ac3d3

Add param_name to size_dict logs & tidy (#20205) · 55ba3190
amyeroberts authored Nov 15, 2022

55ba3190

Add `accelerate` support for `ViT` family (#20174) · f1e8c48c

Younes Belkada authored Nov 15, 2022

* add `accelerate` support for `ViT` family

- add `_no_split_modules`
- manually cast to the right `dtype`: to change

* enable `float16` for `deit`

* fix `make fixup`

* add `slow` test for `fp16` inference

* another safety check

* Update src/transformers/models/deit/modeling_deit.py

f1e8c48c

[WHISPER] Update modeling tests (#20162) · 11b2e45c

Arthur authored Nov 15, 2022



* Update modeling tests

* update tokenization test

* typo

* nit

* fix expected attention outputs

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update tests from review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* remove problematics kwargs passed to the padding function
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

11b2e45c

update relative positional embedding (#20203) · f60eec40

Arthur authored Nov 15, 2022

* update relative positional embedding

* make fix copies

* add `use_cache` to list of arguments

* fixup

* 1line fucntion

* add `test_decoder_model_past_with_large_inputs_relative_pos_emb`

* add relative pos embedding test for more models

* style

f60eec40

Make `ImageSegmentationPipelineTests` less flaky (#20147) · f9909fbf

Yih-Dar authored Nov 15, 2022



* Fix ImageSegmentationPipelineTests

* Use 0.9

* no zip

* links to show images

* links to show images

* rebase
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f9909fbf

Update tokenizer_summary.mdx (#20135) · 9625924c
bofeng huang authored Nov 15, 2022

9625924c

[docs] set overflowing image width to auto-scale (#20197) · 8fadfd50

Wonhyeong Seo authored Nov 15, 2022

* docs: fix: set overflowing image width to auto-scale

* docs: fix: new language Korean is also affected

* docs: fix: unnecessary line break in index page

8fadfd50

14 Nov, 2022 15 commits

Adding chunking for whisper (all seq2seq actually). Very crude matching algorithm. (#20104) · 25c451e5

Nicolas Patry authored Nov 14, 2022

* Very crude matching algorithm.

* Fixing tests.

* Removing comments

* Adding warning + fix short matches.

* Cleanup tests.

* Quality.

* Less noisy.

* Fixup.

25c451e5

Generate: add Bloom fixes for contrastive search (#20213) · 938cb047
Joao Gante authored Nov 14, 2022

938cb047
Downgrade log warning -> info (#20202) · fda12563
amyeroberts authored Nov 14, 2022

fda12563

Update README.md (#20188) · 36b063ed

Ming Liu authored Nov 15, 2022

There is typo in the original hyperlink.

Below is the original version:
Based on the script [`run_translation_no_trainer.py`](https://github.com/huggingface/transformers/blob/main/examples/pytorch/translation/**run_translationn_no_trainer.py**).

36b063ed

mark `test_save_load_fast_init_from_base` as `is_flaky` (#20200) · 536e60d2
Yih-Dar authored Nov 14, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
536e60d2

[Examples] Generalise Seq2Seq ASR to handle Whisper (#19519) · af1a7c8c

Sanchit Gandhi authored Nov 14, 2022

* merge conflicts

* bos and eos in datacollator

* (temp) hardcode removal of attention mask

* freeze encoder

* actually freeze encoder

* set max length / num beams according to gen kwargs

* (temp) fix tests

* don't pop attn mask

* override return attention mask config from Hub

* Hub configs updated 🤗

* final fixes

* update type annotations

* backward comp

af1a7c8c

feat: add i18n issue template (#20199) · 7ecb0391

Wonhyeong Seo authored Nov 15, 2022

Part of #20183
docs: add relevant labels to i18n issue template
fix: typo on completion count

7ecb0391

docs: translated index page to korean (#20180) · 07d8d6e2

Wonhyeong Seo authored Nov 15, 2022

docs: i18n: first draft of index page
docs: fix: first revision of index page
docs: i18n: missed section - supported frameworks
docs: fix: second revision of index page
review by @ArthurZucker

refactor: remove untranslated files from korean
docs: fix: remove untranslated references from toctree.yml
feat: enable korean docs in gh actions
docs: feat: add in_translation page as placeholder
docs: bug: testing if internal toc need alphabet chars
docs: fix: custom english anchor for non-alphanumeric headings
review by @sgugger

docs: i18n: translate comments on install methods in _config.py
docs: refactor: more concise wording for translations

07d8d6e2

add _keys_to_ignore_on_load_unexpected = [r"pooler"] (#20210) · c149d366
Arthur authored Nov 14, 2022

c149d366
[ROC_BERT] Make CI happy (#20175) · 8dcf494e
Younes Belkada authored Nov 14, 2022
```
* fix slow test

* Update tests/models/roc_bert/test_modeling_roc_bert.py
```
8dcf494e
Generate: TF sample doctest result update (#20208) · 7b55bb45
Joao Gante authored Nov 14, 2022

7b55bb45

Pytorch type hints (#20112) · d24e84d9

IMvision12 authored Nov 14, 2022

* initial commit

* Update modeling_whisper.py

* Fixing Tests

* modeling_vision_text_dual_encoder

* modeling_vision_encoder_decoder

* Update modeling_vit.py

* Update modeling_vit_msn.py

* Update modeling_trajectory_transformer.py

* style

* Update modeling_time_series_transformer.py

* Update modeling_time_series_transformer.py

* Update modeling_segformer.py

* Update modeling_plbart.py

* Update modeling_dpt.py

* Update modeling_deit.py

* Update modeling_dpt.py

* Update modeling_esm.py

* Update modeling_fnet.py

* Update modeling_fnet.py

* Update modeling_fnet.py

* Update modeling_flava.py

* Update modeling_flava.py

* Update modeling_layoutlmv3.py

* Update modeling_levit.py

d24e84d9

Proposal Remove the weird `inspect` in ASR pipeline and make WhisperEncoder... · 03bc6ece

Nicolas Patry authored Nov 14, 2022


Proposal Remove the weird `inspect` in ASR pipeline and make WhisperEncoder just nice to use. (#19571)

* Proposal Remove the weird `inspect` in ASR pipeline and make
WhisperEncoder just nice to use.

It seems that accepting `attention_mask` is kind of an invariant of our
models. For Seq2Seq ASR models, we had a special comment on how it
actually was important to send it.

`inspecting` seems pretty brittle way to handle this case.
My suggestion is to simply add it as an kwarg that and just ignoring
it with the docstring explaining why it's ignored.

* Fixup.

* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Doc fixing .
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

03bc6ece

Update README.md (#19530) · 2308f3d4
code-with-rajeev authored Nov 14, 2022
```
Fixed a grammatical error.
```
2308f3d4

Fix tapas scatter (#20149) · 78a471ff

Bartosz Szmelczynski authored Nov 14, 2022



* First draft

* Remove scatter dependency

* Add require_torch

* update vectorized sum test, add clone call

* remove artifacts

* fix style

* fix style v2

* remove "scatter" mentions from the code base

* fix isort error
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

78a471ff