Commits · 0b5bf6abef93220fe1cf35ece99d0b54d6f00f3d · chenpangpang / transformers

25 Feb, 2022 2 commits
- Re-enable doctests for the quicktour (#15828) · 0118c4f6
  Sylvain Gugger authored Feb 25, 2022
```
* Re-enable doctests for the quicktour

* Re-enable doctests for task_summary (#15830)

* Remove &
```
  0118c4f6
- Add model specific output classes to PoolFormer model docs (#15746) · 7566734d
  Tanay Mehta authored Feb 25, 2022
```
* Added model specific output classes to poolformer docs

* Fixed Segformer typo in Poolformer docs
```
  7566734d
23 Feb, 2022 3 commits

Steven Liu authored Feb 23, 2022

* clean commit of changes to NLP tasks

* 🖍 apply feedback

* 📝

 move tf data collator in multiple choice
Co-authored-by: Steven <stevhliu@gmail.com>

fecb08c2

[doc] custom_models: mention security features of the Hub (#15768) · 32f5de10

Julien Chaumond authored Feb 23, 2022



* custom_models: tiny doc addition

* mention security feature earlier in the section
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

32f5de10

Adding ZeroShotImageClassificationPipeline (#12119) · f9582c20

Nicolas Patry authored Feb 23, 2022



* [Proposal] Adding ZeroShotImageClassificationPipeline

- Based on CLIP

* WIP, Resurection in progress.

* Resurrection... achieved.

* Reword handling different `padding_value` for `feature_extractor` and
`tokenizer`.

* Thanks doc-builder !

* Adding docs + global namespace `ZeroShotImageClassificationPipeline`.

* Fixing templates.

* Make the test pass and be robust to floating error.

* Adressing suraj's comments on docs mostly.

* Tf support start.

* TF support.

* Update src/transformers/pipelines/zero_shot_image_classification.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

f9582c20

22 Feb, 2022 2 commits

Time stamps for CTC models (#15687) · c44d3675

Patrick von Platen authored Feb 22, 2022



* [Wav2Vec2 Time Stamps]

* Add first version

* add word time stamps

* Fix

* save intermediate space

* improve

* [Finish CTC Tokenizer]

* remove @

* remove @

* push

* continue with phonemes

* up

* finish PR

* up

* add example

* rename

* finish

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* correct split

* finalize
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c44d3675

added link to our writing-doc document (#15756) · 38bed912
Francesco Saverio Zuppichini authored Feb 22, 2022

38bed912

21 Feb, 2022 1 commit
- TF text classification examples (#15704) · 3956b133
  Joao Gante authored Feb 21, 2022
```
* Working example with to_tf_dataset

* updated text_classification

* more comments
```
  3956b133
18 Feb, 2022 3 commits

Add missing PLBart entry in README (#15721) · 2c2a31ff

Gunjan Chhablani authored Feb 19, 2022

* Add missing PLBart entry in index

* Fix README

* Fix README

* Fix style

* Change to master model doc

2c2a31ff

Add PLBart (#13269) · ae1f8350

Gunjan Chhablani authored Feb 18, 2022

* Init PLBART

* Add missing configuration file

* Add conversion script and configurationf ile

* Fix style

* Update modeling and conversion scripts

* Fix scale embedding in config

* Add comment

* Fix conversion script

* Add classification option to conversion script

* Fix vocab size in config doc

* Add tokenizer files from MBart50

* Allow no lang code in regular tokenizer

* Add PLBart Tokenizer Converters

* Remove mask from multi tokenizer

* Remove mask from multi tokenizer

* Change from MBart-50 to MBart tokenizer

* Fix names and modify src/tgt behavior

* Fix imports for tokenizer

* Remove <mask> from multi tokenizer

* Fix style

* Change tokenizer_class to processor_class

* Add attribute map to config class

* Update modeling file to modified MBart code

* Update configuration file to MBart style configuration

* Fix tokenizer

* Separate tokenizers

* Fix error in tokenization auto

* Copy MBart tests

* Replace with MBart tokenization tests

* Fix style

* Fix language code in multi tokenizer

* Fix configuration docs

* Add entry for plbart_multi in transformers init

* Add dummy objects and fix imports

* Fix modeling tests

* Add TODO in config

* Fix copyright year

* Fix modeling docs and test

* Fix some tokenization tests and style

* Add changes from review

* Fix copies

* Fix docs

* Fix docs

* Fix style

* Fix year

* Add changes from review

* Remove extra changes

* Fix base tokenizer and doc

* Fix style

* Fix modeling and slow tokenizer tests

* Remove Multi-tokenizer Converter and Tests

* Delete QA model and Multi Tokenizer dummy objects

* Fix repo consistency and code quality issues

* Fix example documentation

* Fix style

* Remove PLBartTokenizer from type checking in init

* Fix consistency issue

* Add changes from review

* Fix style

* Remove PLBartTokenizerFast

* Remove FastTokenizer converter

* Fix AutoTokenzier mapping

* Add plbart to toctree and fix consistency issues

* Add language codes tokenizer test

* Fix styling and doc issues

* Add fixes for failing tests

* Fix copies

* Fix failing modeling test

* Change assert to assertTrue in modeling tests

ae1f8350

Adding a model, more doc for pushing to the hub (#15690) · 240cc6cb

Francesco Saverio Zuppichini authored Feb 18, 2022



* doc for adding a model to the hub

* run make style

* resolved conversation

* removed a line

* removed )

* Update docs/source/add_new_model.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/add_new_model.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* make style
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

240cc6cb

17 Feb, 2022 3 commits

Add SimMIM (#15586) · 57882177

NielsRogge authored Feb 17, 2022



* Add first draft

* Make model importable

* Make SwinForMaskedImageModeling importable

* Fix imports

* Add missing inits

* Add support for Swin

* Fix bug

* Fix bug

* Fix another bug

* Fix Swin MIM implementation

* Fix default encoder stride

* Fix Swin

* Add print statements for debugging

* Add image_size data argument

* Fix Swin

* Fix image_size

* Add print statements for debugging

* Fix print statement

* Remove print statements

* Improve reshaping of bool_masked_pos

* Add support for DeiT, fix tests

* Improve docstrings

* Apply new black version

* Improve script

* Fix bug

* Improve README

* Apply suggestions from code review

* Remove DS_Store and add to gitignore

* Apply suggestions from code review + fix BEiT Flax

* Revert BEiT changes

* Improve README

* Fix code quality

* Improve README
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

57882177

Minor fix on README.md (#15688) · 92a537d9

Yih-Dar authored Feb 17, 2022



* fix README

* fix more arxiv links

* make fix-copies
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

92a537d9

Add PoolFormer (#15531) · f84e0dbd

Tanay Mehta authored Feb 17, 2022



* Added all files, PoolFormerFeatureExtractor still failing tests

* Fixed PoolFormerFeatureExtractor not being able to import

* Completed Poolformer doc

* Applied Suggested fixes

* Fixed errors in modeling_auto.py

* Fix feature extractor, convert docs to Markdown, styling of code

* Remove PoolFormer from check_repo and fix integration test

* Remove Poolformer from check_repo

* Fixed configuration_poolformer.py docs and removed inference.py from poolformer

* Ran with black v22

* Added PoolFormer to _toctree.yml

* Updated poolformer doc

* Applied suggested fixes and added on README.md

* Did make fixup and make fix-copies, tests should pass now

* Changed PoolFormer weights conversion script name and fixed README

* Applied fixes in test_modeling_poolformer.py and modeling_poolformer.py

* Added PoolFormerFeatureExtractor to AutoFeatureExtractor API
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>

f84e0dbd

16 Feb, 2022 2 commits

Usage examples for logger (#15657) · b87c044c

Francesco Saverio Zuppichini authored Feb 16, 2022



* logger

* Update docs/source/main_classes/logging.mdx
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update docs/source/main_classes/logging.mdx
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

b87c044c

[t5/t0/mt5 models] faster/leaner custom layer norm (#14656) · bee361c6

Stas Bekman authored Feb 15, 2022

* [t5] faster/leaner custom layer norm

* wip

* apex.normalization.FusedRMSNorm

* cleanup

* cleanup

* add doc

* add catch all

* Trigger CI

* expand

bee361c6

15 Feb, 2022 7 commits

TF generate refactor - Greedy Search (#15562) · 2e12b907

Patrick von Platen authored Feb 15, 2022



* TF generate start refactor

* Add tf tests for sample generate

* re-organize

* boom boom

* Apply suggestions from code review

* re-add

* add all code

* make random greedy pass

* make encoder-decoder random work

* further improvements

* delete bogus file

* make gpt2 and t5 tests work

* finish logits tests

* correct logits processors

* correct past / encoder_outputs drama

* refactor some methods

* another fix

* refactor shape_list

* fix more shape list

* import shape
_list

* finish docs

* fix imports

* make style

* correct tf utils

* Fix TFRag as well

* Apply Lysandre's and Sylvais suggestions

* Update tests/test_generation_tf_logits_process.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Update src/transformers/tf_utils.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* remove cpu according to gante

* correct logit processor
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

2e12b907

Re-export `KeyDataset`. (#15645) · cdf19c50
Nicolas Patry authored Feb 15, 2022
```
* Re-export `KeyDataset`.

* Update the docs locations.
```
cdf19c50
add a network debug script and document it (#15652) · 28e6155d
Stas Bekman authored Feb 15, 2022
```
* add a network debug script and document it

* doc
```
28e6155d

Add section about doc testing (#15659) · f45ac11f

Patrick von Platen authored Feb 15, 2022



* Add doctesting section

* Improve

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f45ac11f

Fix typo in speech2text2 doc (#15617) · 86a7845c
jonrbates authored Feb 15, 2022
```
Forward looks for inputs, not input_ids
```
86a7845c
Revert "logger doc" · 05a85809
fra authored Feb 15, 2022
```
This reverts commit 41168a49.
```
05a85809
logger doc · 41168a49
fra authored Feb 15, 2022

41168a49

14 Feb, 2022 1 commit

Make Swin work with VisionEncoderDecoderModel (#15527) · b090b790

NielsRogge authored Feb 14, 2022



* Add attribute_map

* Add mention in docs

* Set hidden_size attribute correctly

* Add note about Transformer-based models only
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>

b090b790

11 Feb, 2022 4 commits
- Fix grammar in tokenizer_summary (#15614) · 4f403ea8
  Daniel Erenrich authored Feb 11, 2022
```
"to make ensure" is redundant.
```
  4f403ea8
- [deepspeed docs] misc additions (#15585) · f15c99fa
  Stas Bekman authored Feb 11, 2022
```
* [deepspeed docs] round_robin_gradients

* training and/or eval/predict loss is

* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  f15c99fa
- 🖍 remove broken link (#15615) · 85aee09e
  Steven Liu authored Feb 11, 2022
  
  85aee09e
- Mark "code in the Hub" API as experimental (#15624) · 6cf06d19
  Sylvain Gugger authored Feb 11, 2022
  
  6cf06d19
10 Feb, 2022 3 commits

Correct JSON format (#15600) · c0864d98
Ngo Quang Huy authored Feb 11, 2022

c0864d98
Add local and TensorFlow ONNX export examples to docs (#15604) · 2e8b85f7
lewtun authored Feb 10, 2022
```
* Add local and TensorFlow ONNX export examples to docs

* Use PyTorch - TensorFlow split
```
2e8b85f7

Add Tensorflow handling of ONNX conversion (#13831) · cb7ed6e0

Alberto Bégué authored Feb 10, 2022



* Add TensorFlow support for ONNX export

* Change documentation to mention conversion with Tensorflow

* Refactor export into export_pytorch and export_tensorflow

* Check model's type instead of framework installation to choose between TF and Pytorch
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Alberto Bégué <alberto.begue@della.ai>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

cb7ed6e0

09 Feb, 2022 6 commits

Expand tutorial for custom models (#15587) · c722753a

Sylvain Gugger authored Feb 09, 2022



* Expand tutorial for custom models

* Style

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

c722753a

Add link (#15588) · a86ee226

NielsRogge authored Feb 09, 2022


Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>

a86ee226

[trainer docs] document how to select specific gpus (#15551) · dee17d56
Stas Bekman authored Feb 09, 2022
```
* [trainer docs] document how to select specific gpus

* expand

* add urls

* add accelerate launcher
```
dee17d56

Constrained Beam Search [without disjunctive decoding] (#15416) · 2b5603f6

Chan Woo Kim authored Feb 10, 2022



* added classes to get started with constrained beam search

* in progress, think i can directly force tokens now but not yet with the round robin

* think now i have total control, now need to code the bank selection

* technically works as desired, need to optimize and fix design choices leading to undersirable outputs

* complete PR #1 without disjunctive decoding

* removed incorrect tests

* Delete k.txt

* Delete test.py

* Delete test.sh

* revert changes to test scripts

* genutils

* full implementation with testing, no disjunctive yet

* shifted docs

* passing all tests realistically ran locally

* removing accidentally included print statements

* fixed source of error in initial PR test

* fixing the get_device() vs device trap

* fixed documentation docstrings about constrained_beam_search

* fixed tests having failing for Speech2TextModel's floating point inputs

* fix cuda long tensor

* added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search

* deleted accidentally added test halting code with assert False

* code reformat

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/test_generation_utils.py

* fixing based on comments on PR

* took out the testing code that should but work fails without the beam search moditification ; style changes

* fixing comments issues

* docstrings for ConstraintListState

* typo in PhrsalConstraint docstring

* docstrings improvements
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2b5603f6

add model scaling section (#15119) · d923f762

Leandro von Werra authored Feb 09, 2022



* add model scaling section

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* integrate reviewer feedback

* initialize GPU properly

* add note about BnB optimizer

* move doc from `scaling.mdx` to `performance.mdx`

* integrate reviewer feedback

* revert section levels
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d923f762

PoC for a ProcessorMixin class (#15549) · b5c6fdec

Sylvain Gugger authored Feb 09, 2022



* PoC for a ProcessorMixin class

* Documentation

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Roll out to other processors

* Add base feature extractor class in init

* Use args and kwargs
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

b5c6fdec

08 Feb, 2022 3 commits

📝 Add codecarbon callback to docs (#15563) · fcb4f11c
Nathan Raw authored Feb 08, 2022

fcb4f11c

Add TFSpeech2Text (#15113) · 8406fa6d

Joao Gante authored Feb 08, 2022

* Add wrapper classes

* convert inner layers to tf

* Add TF Encoder and Decoder layers

* TFSpeech2Text models

* Loadable model

* TF model with same outputs as PT model

* test skeleton

* correct tests and run the fixup

* correct attention expansion

* TFSpeech2Text pask_key_values with TF format

8406fa6d

electra is added to onnx supported model (#15084) · 87d08afb

aaron authored Feb 08, 2022



* electra is added to onnx supported model

* add google/electra-base-generator for test onnx module
Co-authored-by: Lewis Tunstall <lewis.c.tunstall@gmail.com>

87d08afb