Commits · 163ac3d3eeb4537eb2f5381f1331cafe91bfe4c2 · chenpangpang / transformers

15 Nov, 2022 3 commits

Add Switch transformers (#19323) · 163ac3d3

Younes Belkada authored Nov 15, 2022



* first commit

* add more comments

* add router v1

* clean up

- remove `tf` modeling files

* clean up

- remove `tf` modeling files

* clean up

* v0 routers

* added more router

- Implemented `ExpertsChooseMaskedRouter`

- added tests
- 2 more routers to implement

* last router

* improved docstring

- completed the docstring in `router.py`
- added more args in the config

* v0 sparse mlp

* replace wrong naming

* forward pass run

* update MOE layer

* small router update

* fixup

* consistency

* remove scatter router

* remove abstract layer

* update test and model for integration testing

* v1 conversion

* update

* hardcode hack

* all keys match

* add gin conversion, without additional libraries

* update conversion sctipy

* delete router file

* update tests wrt router deletion

* fix router issues

* update expert code

* update, logits match, code needsREFACTORING

* Refactor code
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* add generate tests
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

* add support for router loss
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* fix forward error

* refactor a bit

* remove `FlaxSwitchTransformers` modules

* more tests pass

* Update code
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* fixup

* fix tests

* fix doc

* fix doc + tokenization

* fix tokenizer test

* fix test

* fix loss output

* update code for backward pass

* add loss support

* update documentation

* fix documentation, clean tokenizer

* more doc fix, cleanup example_switch

* fix failing test

* fix test

* fix test

* fix loss issue

* move layer

* update doc and fix router capacity usage

* fixup

* add sparse mlp index for documentation on hub

* fixup

* test sparse mix architecture

* Apply suggestions from code review

* Update docs/source/en/model_doc/switch_transformers.mdx

* fixup on update

* fix tests

* fix another test

* attempt fix

* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/switch_transformers/convert_switch_transformers_original_flax_checkpoint_to_pytorch.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* try

* all tests pass

* fix jitter noise

* Apply suggestions from code review

* doc tests pass

* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove assert

* change config order

* fix readme japanese

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove parallelizable tests + add one liners

* remove ONNX config

* fix nits

- add `T5Tokenizer` in auto mapping
- remove `Switch Transformers` from ONNX supported models

* remove `_get_router`

* remove asserts

* add check in test for `router_dtype`

* add `SwitchTransformersConfig` in `run_pipeline_test`

* Update tests/pipelines/test_pipelines_summarization.py

* add huge model conversion script

* fix slow tests

- add better casting for `Linear8bitLt`
- remove `torchscript` tests

* add make dir

* style on new script

* fix nits

- doctest
- remove `_keys_to_ignore_on_load_unexpected`

* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py

* add google as authors

* fix year

* remove last `assert` statements

* standardize vertical spaces

* fix failing import

* fix another failing test

* Remove strange àuthorized_keys`

* removing todo and padding that is never used
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: ybelkada <younes@huggingface.co>
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur Zucker <arthur@huggingface.co>

163ac3d3

Update tokenizer_summary.mdx (#20135) · 9625924c
bofeng huang authored Nov 15, 2022

9625924c

[docs] set overflowing image width to auto-scale (#20197) · 8fadfd50

Wonhyeong Seo authored Nov 15, 2022

* docs: fix: set overflowing image width to auto-scale

* docs: fix: new language Korean is also affected

* docs: fix: unnecessary line break in index page

8fadfd50

14 Nov, 2022 2 commits

Fix tapas scatter (#20149) · 78a471ff

Bartosz Szmelczynski authored Nov 14, 2022



* First draft

* Remove scatter dependency

* Add require_torch

* update vectorized sum test, add clone call

* remove artifacts

* fix style

* fix style v2

* remove "scatter" mentions from the code base

* fix isort error
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

78a471ff

add MobileNetV2 model (#17845) · f711d683

Matthijs Hollemans authored Nov 14, 2022

* add model files etc for MobileNetV2

* rename files for MobileNetV1

* initial implementation of MobileNetV1

* fix conversion script

* cleanup

* write docs

* tweaks

* fix conversion script

* extract hidden states

* fix test cases

* make fixup

* fixup it all

* rename V1 to V2

* fix checkpoints

* fixup

* implement first block + weight conversion

* add remaining layers

* add output stride and dilation

* fixup

* add tests

* add deeplabv3+ head

* a bit of fixup

* finish deeplab conversion

* add link to doc

* fix issue with JIT trace

in_height and in_width would be Tensor objects during JIT trace, which caused Core ML conversion to fail on the remainder op. By making them ints, the result of the padding calculation becomes a constant value.

* cleanup

* fix order of models

* fix rebase error

* remove main from doc link

* add image processor

* remove old feature extractor

* fix converter + other issues

* fixup

* fix unit test

* add to onnx tests (but these appear broken now)

* add post_process_semantic_segmentation

* use google org

* remove unused imports

* move args

* replace weird assert

f711d683

10 Nov, 2022 2 commits
- Add Jukebox model (replaces #16875) (#17826) · 61a51f5f
  Arthur authored Nov 10, 2022
  
  61a51f5f
- Add doc tests (#20158) · 9f0c72f9
  NielsRogge authored Nov 10, 2022
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
```
  9f0c72f9
09 Nov, 2022 3 commits

[CLIPSeg] Add resources (#20118) · 93e14486

NielsRogge authored Nov 09, 2022



* Add resource

* Add tag
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

93e14486

add cv + audio labels (#20114) · a44985b4
Steven Liu authored Nov 09, 2022

a44985b4

Generate: move generation_*.py src files into generation/*.py (#20096) · f270b960

Joao Gante authored Nov 09, 2022

* move generation_*.py src files into generation/*.py

* populate generation.__init__ with lazy loading

* move imports and references from generation.xxx.object to generation.object

f270b960

08 Nov, 2022 3 commits

AutoImageProcessor (#20111) · 4eb918e6

amyeroberts authored Nov 08, 2022

* AutoImageProcessor skeleton

* Update references

* Add mapping in init

* Add model image processors to __init__ for importing

* Add AutoImageProcessor tests

* Fix up

* Image Processor documentation

* Remove pdb

* Update docs/source/en/model_doc/mobilevit.mdx

* Update docs

* Don't add whitespace on json files

* Remove fixtures

* Move checking model config down

* Fix up

* Add check for image processor

* Remove FeatureExtractorMixin in docstrings

* Rename model_tmpfile to config_tmpfile

* Don't make None if not in image processor map

4eb918e6

Add RocBert (#20013) · efa889d2

Weiwe Shi authored Nov 08, 2022



* add roc_bert

* update roc_bert readme

* code style

* change name and delete unuse file

* udpate model file

* delete unuse log file

* delete tokenizer fast

* reformat code and change model file path

* add RocBertForPreTraining

* update docs

* delete wrong notes

* fix copies

* fix make repo-consistency error

* fix files are not present in the table of contents error

* change RocBert -> RoCBert

* add doc, add detail test
Co-authored-by: weiweishi <weiweishi@tencent.com>

efa889d2

Add CLIPSeg (#20066) · 25896306

NielsRogge authored Nov 08, 2022



* Add first draft

* Update conversion script

* Improve conversion script

* Improve conversion script some more

* Add conditional embeddings

* Add initial decoder

* Fix activation function of decoder

* Make decoder outputs match original implementation

* Make decoder outputs match original implementation

* Add more copied from statements

* Improve model outputs

* Fix auto tokenizer file

* Fix more tests

* Add test

* Improve README and docs, improve conditional embeddings

* Fix more tests

* Remove print statements

* Remove initial embeddings

* Improve conversion script

* Add interpolation of position embeddings

* Finish addition of interpolation of position embeddings

* Add support for refined checkpoint

* Fix refined checkpoint

* Remove unused parameter

* Improve conversion script

* Add support for training

* Fix conversion script

* Add CLIPSegFeatureExtractor

* Fix processor

* Fix CLIPSegProcessor

* Fix conversion script

* Fix most tests

* Fix equivalence test

* Fix README

* Add model to doc tests

* Use better variable name

* Convert other checkpoint as well

* Update config, add link to paper

* Add docs

* Update organization

* Replace base_model_prefix with clip

* Fix base_model_prefix

* Fix checkpoint of config

* Fix config checkpoint

* Remove file

* Use logits for output

* Fix tests
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

25896306

07 Nov, 2022 5 commits

Replace awkward timm link with the expected one (#20109) · 6156bffa
Tom Aarsen authored Nov 07, 2022

6156bffa
Add new terms to the glossary (#20051) · 71f772eb
Steven Liu authored Nov 07, 2022
```
* add new terms

* apply review
```
71f772eb

docs: Fixed variables in f-strings (#20087) · d44ac47b

Tom Aarsen authored Nov 07, 2022



* docs: Fixed variables in f-strings

* Replace unknown `block` with known `block_type` in ValueError
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add missing torch import in docs code block
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d44ac47b

docs: Resolve many typos in the English docs (#20088) · 3222fc64

Tom Aarsen authored Nov 07, 2022

* docs: Fix typo in ONNX parser help: 'tolerence' => 'tolerance'

* docs: Resolve many typos in the English docs

Typos found via 'codespell ./docs/source/en'

3222fc64

Replace unsupported facebookresearch/bitsandbytes (#20093) · b8112edd
Tom Aarsen authored Nov 07, 2022
```
With https://github.com/TimDettmers/bitsandbytes, which is by the same author and is still being updated
```
b8112edd

04 Nov, 2022 2 commits

Update documentation on seq2seq models with absolute positional embeddings, to... · 3bd0007e

Jordan Clive authored Nov 04, 2022


Update documentation on seq2seq models with absolute positional embeddings, to be in line with Tips section for BERT and GPT2 (#20068)
Co-authored-by: jordiclive <jordiclive19@imperial.ac.uk>

3bd0007e

Update READMEs for ESMFold and add notebooks (#20067) · 6e1c5786
Matt authored Nov 04, 2022
```
* Update READMEs for ESMFold and add notebooks

* Fix PyCharm formatting

* make fix-copies
```
6e1c5786

03 Nov, 2022 3 commits

fix jit trace error for model forward sequence is not aligned with jit.trace... · 2564f0c2

Wang, Yi authored Nov 03, 2022


fix jit trace error for model forward sequence is not aligned with jit.trace tuple input sequence, update related doc (#19891)

* fix jit trace error for classification usecase, update related doc
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* add implementation in torch 1.14.0
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* update_doc
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* update_doc
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

2564f0c2

[Whisper Tokenizer] Make more user-friendly (#19921) · 06d48806

Sanchit Gandhi authored Nov 03, 2022



* [Whisper Tokenizer] Make more user-friendly

* use property

* make indexing rigorous

* small clean-up

* tests

* skip seq2seq tests

* remove multilingual arg

* reorder args

* collapse to one function
Co-authored-by: ArthurZucker <arthur@huggingface.co>

* option to override attributes
Co-authored-by: ArthurZucker <arthur@huggingface.co>

* add to docs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* make comment more clear
Co-authored-by: sgugger <sylvain@huggingface.co>

* don't add special tokens in get_decoder_prompt_ids

* add test for set_prefix_tokens
Co-authored-by: ArthurZucker <arthur@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: sgugger <sylvain@huggingface.co>

06d48806

Fix some doctests after PR 15775 (#20036) · 9ccea7ac

Yih-Dar authored Nov 03, 2022



* Add skip_special_tokens=True in some doctest

* For T5

* Fix for speech_to_text.mdx
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9ccea7ac

02 Nov, 2022 3 commits

reorganize glossary (#20010) · aa39967b
Steven Liu authored Nov 02, 2022

aa39967b

Fix doctest (#20023) · fb7cbe23

Yih-Dar authored Nov 02, 2022



* Fix doctest
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

fb7cbe23

Add Image Processors (#19796) · a6b77598

amyeroberts authored Nov 02, 2022



* Add CLIP image processor

* Crop size as dict too

* Update warning

* Actually use logger this time

* Normalize doesn't change dtype of input

* Add perceiver image processor

* Tidy up

* Add DPT image processor

* Add Vilt image processor

* Tidy up

* Add poolformer image processor

* Tidy up

* Add LayoutLM v2 and v3 imsge processors

* Tidy up

* Add Flava image processor

* Tidy up

* Add deit image processor

* Tidy up

* Add ConvNext image processor

* Tidy up

* Add levit image processor

* Add segformer image processor

* Add in post processing

* Fix up

* Add ImageGPT image processor

* Fixup

* Add mobilevit image processor

* Tidy up

* Add postprocessing

* Fixup

* Add VideoMAE image processor

* Tidy up

* Add ImageGPT image processor

* Fixup

* Add ViT image processor

* Tidy up

* Add beit image processor

* Add mobilevit image processor

* Tidy up

* Add postprocessing

* Fixup

* Fix up

* Fix flava and remove tree module

* Fix image classification pipeline failing tests

* Update feature extractor in trainer scripts

* Update pad_if_smaller to accept tuple and int size

* Update for image segmentation pipeline

* Update src/transformers/models/perceiver/image_processing_perceiver.py
Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>

* Update src/transformers/image_processing_utils.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/beit/image_processing_beit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* PR comments - docstrings; remove accidentally added resize; var names

* Update docstrings

* Add exception if size is not in the right format

* Fix exception check

* Fix up

* Use shortest_edge in tuple in script
Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

a6b77598

01 Nov, 2022 7 commits

fix typo (#20006) · 79c720c0
Steven Liu authored Nov 01, 2022

79c720c0
Add LayoutLMv3 resource (#19932) · ab74ac11
Steven Liu authored Nov 01, 2022
```
* add layoutlmv3 resource

* add layoutlmv2 resources

* fix button
```
ab74ac11

Add BERT resources (#19852) · dec8578e

Steven Liu authored Nov 01, 2022

* add resources for bert

* add course chapters

* apply reviews

* add pipeline icons and community resource

* fix buttons

dec8578e

add dataset (#20005) · 1f6885ba
Steven Liu authored Nov 01, 2022

1f6885ba
Update image_classification.mdx (#19996) · c87ae86a
Sayak Paul authored Nov 01, 2022

c87ae86a

Added onnx config whisper (#19525) · c796b6de

Mohit Sharma authored Nov 01, 2022

* Added onnx config whisper

* added whisper support onnx

* add audio input data

* added whisper support onnx

* fixed the seqlength value

* Updated the whisper onnx ocnfig

* restore files to old version

* removed attention mask from inputs

* Updated get_dummy_input_onnxruntime docstring

* Updated relative imports and token generation

* update docstring

c796b6de

Add ESMFold (#19977) · 7f9b7b3f

Matt authored Nov 01, 2022



* initial commit

* First draft that gets outputs without crashing!

* Add all the ported openfold dependencies

* testing

* Restructure config files for ESMFold

* Debugging to find output discrepancies

* Mainly style

* Make model runnable without extra deps

* Remove utils and merge them to the modeling file

* Use correct gelu and remove some debug prints

* More cleanup

* Update esm docs

* Update conversion script to support ESMFold properly

* Port some top-level changes from ESMFold repo

* Expand EsmFold docstrings

* Make attention_mask optional (default to all 1s)

* Add inference test for ESMFold

* Use config and not n kwargs

* Add modeling output class

* Remove einops

* Remove chunking in ESM FFN

* Update tests for ESMFold

* Quality

* REpo consistency

* Remove tree dependency from ESMFold

* make fixup

* Add an error in case my structure map function breaks later

* Remove needless code

* Stop auto-casting the LM to float16 so CPU tests pass

* Stop auto-casting the LM to float16 so CPU tests pass

* Final test updates

* Split test file

* Copyright and quality

* Unpin PyTorch to see built doc

* Fix config file to_dict() method

* Add some docstrings to the output

* Skip TF checkpoint tests for ESM until we reupload those

* make fixup

* More docstrings

* Unpin to get even with main

* Flag example to write
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

7f9b7b3f

31 Oct, 2022 1 commit

[Conditional, Deformable DETR] Add postprocessing methods (#19709) · 0b294c23

NielsRogge authored Oct 31, 2022



* Add postprocessing methods

* Update docs

* Add fix

* Add test

* Add test for deformable detr postprocessing

* Add post processing methods for segmentation

* Update code examples

* Add post_process to make the pipeline work

* Apply updates
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

0b294c23

28 Oct, 2022 4 commits

Add wav2vec2 resources (#19931) · 2e35bac4

Steven Liu authored Oct 28, 2022



* add wav2vec2 resources

* apply review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

2e35bac4

add resources for distilbert (#19930) · 9d2788b4
Steven Liu authored Oct 28, 2022

9d2788b4
add resources for bart (#19928) · b0a2c3a2
Steven Liu authored Oct 28, 2022

b0a2c3a2

Add Onnx Config for ImageGPT (#19868) · 0d4c45c5

Raghav Prabhakar authored Oct 28, 2022



* add Onnx Config for ImageGPT

* add generate_dummy_inputs for onnx config

* add TYPE_CHECKING clause

* Update doc for generate_dummy_inputs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

0d4c45c5

27 Oct, 2022 2 commits
- Add GPT2 resources (#19879) · e4132952
  Steven Liu authored Oct 27, 2022
```
* add resources for gpt2

* add pipeline icons and community resources
```
  e4132952
- Add BLOOM resources (#19881) · d818dd3a
  Steven Liu authored Oct 27, 2022
```
* add bloom resources

* add pipeline icon
```
  d818dd3a