Commits · 683cbc4c340b7e3d24981ac1c8ac90fe776cda36 · chenpangpang / transformers

15 Nov, 2022 14 commits

fixed spelling error in testing.mdx (#20220) · 683cbc4c
Kendall authored Nov 15, 2022

683cbc4c
fix device issue (#20227) · 6ed6ed29
Yih-Dar authored Nov 15, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6ed6ed29
Add missing ESM autoclass (#20177) · d3d5fa3e
Matt authored Nov 15, 2022
```
* Add missing ESM autoclass

* Correct ESMFold checkpoint
```
d3d5fa3e
Remove `authorized_missing_keys`in favor of _keys_to_ignore_on_load_missing (#20228) · 92cfe8b0
Arthur authored Nov 15, 2022

92cfe8b0

Typo on doctring in ElectraTokenizer (#20192) · 2d920010

Yong woo Song authored Nov 15, 2022

* chore: typo on docstring in tokenization_electra

* chore: typo on docstring in tokenization_electra

* update for check copies

2d920010

Add object detection + segmentation transforms (#20003) · 4c7e8d09

amyeroberts authored Nov 15, 2022



* Add transforms for object detection

* Update src/transformers/image_transforms.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Better var names & docstring

* Remove unused var desc in docstring

* Update src/transformers/image_transforms.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4c7e8d09

Add Switch transformers (#19323) · 163ac3d3

Younes Belkada authored Nov 15, 2022



* first commit

* add more comments

* add router v1

* clean up

- remove `tf` modeling files

* clean up

- remove `tf` modeling files

* clean up

* v0 routers

* added more router

- Implemented `ExpertsChooseMaskedRouter`

- added tests
- 2 more routers to implement

* last router

* improved docstring

- completed the docstring in `router.py`
- added more args in the config

* v0 sparse mlp

* replace wrong naming

* forward pass run

* update MOE layer

* small router update

* fixup

* consistency

* remove scatter router

* remove abstract layer

* update test and model for integration testing

* v1 conversion

* update

* hardcode hack

* all keys match

* add gin conversion, without additional libraries

* update conversion sctipy

* delete router file

* update tests wrt router deletion

* fix router issues

* update expert code

* update, logits match, code needsREFACTORING

* Refactor code
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* add generate tests
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

* add support for router loss
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* fix forward error

* refactor a bit

* remove `FlaxSwitchTransformers` modules

* more tests pass

* Update code
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* fixup

* fix tests

* fix doc

* fix doc + tokenization

* fix tokenizer test

* fix test

* fix loss output

* update code for backward pass

* add loss support

* update documentation

* fix documentation, clean tokenizer

* more doc fix, cleanup example_switch

* fix failing test

* fix test

* fix test

* fix loss issue

* move layer

* update doc and fix router capacity usage

* fixup

* add sparse mlp index for documentation on hub

* fixup

* test sparse mix architecture

* Apply suggestions from code review

* Update docs/source/en/model_doc/switch_transformers.mdx

* fixup on update

* fix tests

* fix another test

* attempt fix

* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/switch_transformers/convert_switch_transformers_original_flax_checkpoint_to_pytorch.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* try

* all tests pass

* fix jitter noise

* Apply suggestions from code review

* doc tests pass

* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove assert

* change config order

* fix readme japanese

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove parallelizable tests + add one liners

* remove ONNX config

* fix nits

- add `T5Tokenizer` in auto mapping
- remove `Switch Transformers` from ONNX supported models

* remove `_get_router`

* remove asserts

* add check in test for `router_dtype`

* add `SwitchTransformersConfig` in `run_pipeline_test`

* Update tests/pipelines/test_pipelines_summarization.py

* add huge model conversion script

* fix slow tests

- add better casting for `Linear8bitLt`
- remove `torchscript` tests

* add make dir

* style on new script

* fix nits

- doctest
- remove `_keys_to_ignore_on_load_unexpected`

* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py

* add google as authors

* fix year

* remove last `assert` statements

* standardize vertical spaces

* fix failing import

* fix another failing test

* Remove strange àuthorized_keys`

* removing todo and padding that is never used
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: ybelkada <younes@huggingface.co>
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur Zucker <arthur@huggingface.co>

163ac3d3

Add param_name to size_dict logs & tidy (#20205) · 55ba3190
amyeroberts authored Nov 15, 2022

55ba3190

Add `accelerate` support for `ViT` family (#20174) · f1e8c48c

Younes Belkada authored Nov 15, 2022

* add `accelerate` support for `ViT` family

- add `_no_split_modules`
- manually cast to the right `dtype`: to change

* enable `float16` for `deit`

* fix `make fixup`

* add `slow` test for `fp16` inference

* another safety check

* Update src/transformers/models/deit/modeling_deit.py

f1e8c48c

[WHISPER] Update modeling tests (#20162) · 11b2e45c

Arthur authored Nov 15, 2022



* Update modeling tests

* update tokenization test

* typo

* nit

* fix expected attention outputs

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update tests from review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* remove problematics kwargs passed to the padding function
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

11b2e45c

update relative positional embedding (#20203) · f60eec40

Arthur authored Nov 15, 2022

* update relative positional embedding

* make fix copies

* add `use_cache` to list of arguments

* fixup

* 1line fucntion

* add `test_decoder_model_past_with_large_inputs_relative_pos_emb`

* add relative pos embedding test for more models

* style

f60eec40

Make `ImageSegmentationPipelineTests` less flaky (#20147) · f9909fbf

Yih-Dar authored Nov 15, 2022



* Fix ImageSegmentationPipelineTests

* Use 0.9

* no zip

* links to show images

* links to show images

* rebase
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f9909fbf

Update tokenizer_summary.mdx (#20135) · 9625924c
bofeng huang authored Nov 15, 2022

9625924c

[docs] set overflowing image width to auto-scale (#20197) · 8fadfd50

Wonhyeong Seo authored Nov 15, 2022

* docs: fix: set overflowing image width to auto-scale

* docs: fix: new language Korean is also affected

* docs: fix: unnecessary line break in index page

8fadfd50

14 Nov, 2022 16 commits

Adding chunking for whisper (all seq2seq actually). Very crude matching algorithm. (#20104) · 25c451e5

Nicolas Patry authored Nov 14, 2022

* Very crude matching algorithm.

* Fixing tests.

* Removing comments

* Adding warning + fix short matches.

* Cleanup tests.

* Quality.

* Less noisy.

* Fixup.

25c451e5

Generate: add Bloom fixes for contrastive search (#20213) · 938cb047
Joao Gante authored Nov 14, 2022

938cb047
Downgrade log warning -> info (#20202) · fda12563
amyeroberts authored Nov 14, 2022

fda12563

Update README.md (#20188) · 36b063ed

Ming Liu authored Nov 15, 2022

There is typo in the original hyperlink.

Below is the original version:
Based on the script [`run_translation_no_trainer.py`](https://github.com/huggingface/transformers/blob/main/examples/pytorch/translation/**run_translationn_no_trainer.py**).

36b063ed

mark `test_save_load_fast_init_from_base` as `is_flaky` (#20200) · 536e60d2
Yih-Dar authored Nov 14, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
536e60d2

[Examples] Generalise Seq2Seq ASR to handle Whisper (#19519) · af1a7c8c

Sanchit Gandhi authored Nov 14, 2022

* merge conflicts

* bos and eos in datacollator

* (temp) hardcode removal of attention mask

* freeze encoder

* actually freeze encoder

* set max length / num beams according to gen kwargs

* (temp) fix tests

* don't pop attn mask

* override return attention mask config from Hub

* Hub configs updated 🤗

* final fixes

* update type annotations

* backward comp

af1a7c8c

feat: add i18n issue template (#20199) · 7ecb0391

Wonhyeong Seo authored Nov 15, 2022

Part of #20183
docs: add relevant labels to i18n issue template
fix: typo on completion count

7ecb0391

docs: translated index page to korean (#20180) · 07d8d6e2

Wonhyeong Seo authored Nov 15, 2022

docs: i18n: first draft of index page
docs: fix: first revision of index page
docs: i18n: missed section - supported frameworks
docs: fix: second revision of index page
review by @ArthurZucker

refactor: remove untranslated files from korean
docs: fix: remove untranslated references from toctree.yml
feat: enable korean docs in gh actions
docs: feat: add in_translation page as placeholder
docs: bug: testing if internal toc need alphabet chars
docs: fix: custom english anchor for non-alphanumeric headings
review by @sgugger

docs: i18n: translate comments on install methods in _config.py
docs: refactor: more concise wording for translations

07d8d6e2

add _keys_to_ignore_on_load_unexpected = [r"pooler"] (#20210) · c149d366
Arthur authored Nov 14, 2022

c149d366
[ROC_BERT] Make CI happy (#20175) · 8dcf494e
Younes Belkada authored Nov 14, 2022
```
* fix slow test

* Update tests/models/roc_bert/test_modeling_roc_bert.py
```
8dcf494e
Generate: TF sample doctest result update (#20208) · 7b55bb45
Joao Gante authored Nov 14, 2022

7b55bb45

Pytorch type hints (#20112) · d24e84d9

IMvision12 authored Nov 14, 2022

* initial commit

* Update modeling_whisper.py

* Fixing Tests

* modeling_vision_text_dual_encoder

* modeling_vision_encoder_decoder

* Update modeling_vit.py

* Update modeling_vit_msn.py

* Update modeling_trajectory_transformer.py

* style

* Update modeling_time_series_transformer.py

* Update modeling_time_series_transformer.py

* Update modeling_segformer.py

* Update modeling_plbart.py

* Update modeling_dpt.py

* Update modeling_deit.py

* Update modeling_dpt.py

* Update modeling_esm.py

* Update modeling_fnet.py

* Update modeling_fnet.py

* Update modeling_fnet.py

* Update modeling_flava.py

* Update modeling_flava.py

* Update modeling_layoutlmv3.py

* Update modeling_levit.py

d24e84d9

Proposal Remove the weird `inspect` in ASR pipeline and make WhisperEncoder... · 03bc6ece

Nicolas Patry authored Nov 14, 2022


Proposal Remove the weird `inspect` in ASR pipeline and make WhisperEncoder just nice to use. (#19571)

* Proposal Remove the weird `inspect` in ASR pipeline and make
WhisperEncoder just nice to use.

It seems that accepting `attention_mask` is kind of an invariant of our
models. For Seq2Seq ASR models, we had a special comment on how it
actually was important to send it.

`inspecting` seems pretty brittle way to handle this case.
My suggestion is to simply add it as an kwarg that and just ignoring
it with the docstring explaining why it's ignored.

* Fixup.

* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Doc fixing .
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

03bc6ece

Update README.md (#19530) · 2308f3d4
code-with-rajeev authored Nov 14, 2022
```
Fixed a grammatical error.
```
2308f3d4

Fix tapas scatter (#20149) · 78a471ff

Bartosz Szmelczynski authored Nov 14, 2022



* First draft

* Remove scatter dependency

* Add require_torch

* update vectorized sum test, add clone call

* remove artifacts

* fix style

* fix style v2

* remove "scatter" mentions from the code base

* fix isort error
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

78a471ff

add MobileNetV2 model (#17845) · f711d683

Matthijs Hollemans authored Nov 14, 2022

* add model files etc for MobileNetV2

* rename files for MobileNetV1

* initial implementation of MobileNetV1

* fix conversion script

* cleanup

* write docs

* tweaks

* fix conversion script

* extract hidden states

* fix test cases

* make fixup

* fixup it all

* rename V1 to V2

* fix checkpoints

* fixup

* implement first block + weight conversion

* add remaining layers

* add output stride and dilation

* fixup

* add tests

* add deeplabv3+ head

* a bit of fixup

* finish deeplab conversion

* add link to doc

* fix issue with JIT trace

in_height and in_width would be Tensor objects during JIT trace, which caused Core ML conversion to fail on the remainder op. By making them ints, the result of the padding calculation becomes a constant value.

* cleanup

* fix order of models

* fix rebase error

* remove main from doc link

* add image processor

* remove old feature extractor

* fix converter + other issues

* fixup

* fix unit test

* add to onnx tests (but these appear broken now)

* add post_process_semantic_segmentation

* use google org

* remove unused imports

* move args

* replace weird assert

f711d683

11 Nov, 2022 3 commits
- Fix type - update any PIL.Image.Resampling (#20172) · 6cc06d17
  amyeroberts authored Nov 11, 2022
  
  6cc06d17
- [OWL-ViT] Make model consistent with CLIP (#20144) · cbbeca3d
  NielsRogge authored Nov 11, 2022
```
* Apply fix

* Fix test

* Remove another argument which is not used

* Fix pipeline test

* Add argument back, add deprecation warning

* Add warning add other location

* Use warnings instead

* Add num_channels to config
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
```
  cbbeca3d
- Fix object-detection bug (height, width inversion). (#20167) · d3c05666
  Nicolas Patry authored Nov 11, 2022
  
  d3c05666
10 Nov, 2022 7 commits
- Add Jukebox model (replaces #16875) (#17826) · 61a51f5f
  Arthur authored Nov 10, 2022
  
  61a51f5f
- Skip broken test · 9740a03f
  Sylvain Gugger authored Nov 10, 2022
  
  9740a03f
- [processor] Add 'model input names' property (#20117) · 905e5773
  Sanchit Gandhi authored Nov 10, 2022
```
* [processor] Add 'model input names' property

* add test

* no f string

* add generic property method to mixin

* copy to multimodal

* copy to vision

* tests for all audio

* remove ad-hoc tests

* style

* fix flava test

* fix test

* fix processor code
```
  905e5773
- Fix arg names for our models (#20166) · 68187c46
  Matt authored Nov 10, 2022
```
* Fix arg names for our models

* Clean out the other uses of "residx" in infer()

* make fixup
```
  68187c46
- Generate: fix TF doctests (#20159) · 6dda14dc
  Joao Gante authored Nov 10, 2022
  
  6dda14dc
- Update `OnnxConfig.generate_dummy_inputs` to check `ImageProcessingMixin` (#20157) · e0d7c831
  Yih-Dar authored Nov 10, 2022
```
* Check ImageProcessingMixin in OnnxConfig.generate_dummy_inputs

* Check ImageProcessingMixin in OnnxConfig.generate_dummy_inputs

* Add back
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  e0d7c831
- doc comment fix: Args was in wrong place (#20164) · daf4436e
  Matthijs Hollemans authored Nov 10, 2022
  
  daf4436e