Commits · e150c4e2fec67d6cbe8458d989a139b07ea1fe05 · chenpangpang / transformers

10 Oct, 2022 15 commits

Fix the error message in run_t5_mlm_flax.py (#19282) · e150c4e2
Kaiyu Yang authored Oct 10, 2022

e150c4e2

Add TF whisper (#19378) · e3f028f3

amyeroberts authored Oct 10, 2022



* simplify loop

* add featur extractor

* add model

* start conversion

* add dropout

* initial commit of test files

* copnversion for all models

* update processor for correct padding

* update feature extraction

* update integration test logits match

* fmnt: off for the logits

* on the fly mel bank

* small nit

* update test

* update tokenizer

* nit feature extraction

* update

* update tokenizer test

* adds logit processor and update tokenizer to get supress tokens

* style

* clean convert

* revert to original modeling tf utils

* Update

* update

* nit

* clean convert file

* update tests and nits

* quality

* slow generation test

* ffn_dim to allow customization

* update readme

* add to toctreee

* start fixing integration tests

* update tests and code

* fix feature extractor

* fix config tests common

* update code to fix tests

* fix feature exctractor

* nit feature extraction

* update test for new feature extractor

* style

* add absrtact

* large logits wioth custom decoder input ids

* wraap around is otrch available

* fix feature extractor

* correct logits for whisper small.en

* nit

* fix encoder_attentino_mask

* some fixes

* remove unnecessary inputs

* nits

* add normalizer file

* update etst tokenization

* fix attention mask not defined

* fix generate

* remove uncoder attention mask useless

* update test modeling whisper

* update condfig to add second non supress tokens

* nits on feature exrtactor

* nit for test tokenizers

* update etsts

* update tests

* update tokenization test

* fixup

* invalidated hf token. Clean convert openai to whisper

* fix logit tests

* fixup

* Add model to README

* Fix doc tests

* clean merge

* revert toc_tree changes

* remove useless LogitProcessor

* Update whisper .mdx

* update config file doc

* update configuration docstring

* update test tokenization

* update test tokenization

* update tokenization whisper
Added copied from where needed

* update feature extraction

* nit test name

* style

* quality

* remove get suppress tokens and update non_speech tokens global variables

* Update src/transformers/models/whisper/feature_extraction_whisper.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* clean modeling whisper and test
Removed the attention mask arguments that are deprecated

* fix large test

* Add multilingual audio test, and translate test

* style

* fix larg multilingual test

* nits

* add copied from for attention layer

* remove attention masks in doc

* add english normalizer

* Update docs/source/en/model_doc/whisper.mdx
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update tokenization test

* remove copied from in whisper attention : no bias in k_proj only

* wrap around dependencies in english normalizer

* style

* correct import generation logits

* for now, wrap feature extractor with torch

* remove torch depencies for feature extraction and style

* Update src/transformers/models/whisper/convert_openai_whisper_to_tfms.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/whisper/configuration_whisper.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/whisper.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fixup

* nit

* update logitds

* style

* nit

* nits and fix final tests

* add `is_more_itertools_available` to utils

* quality

* add begin supress tokens, supress tokens to generate args and config

* clean supressTokensLogitProcessor in generation logits

* Nit naming

* add supressTokensAtBegin

* udpate tests, supress tokens to None or correct values

* nit and style

* update RAG to fit test and generate_logit

* add copy pasted statment on english normalizer

* add arguments to config_common_kwargs

* Update src/transformers/generation_utils.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/generation_logits_process.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* revert changes based on reviews

* update doc and nits

* Update src/transformers/models/whisper/configuration_whisper.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* more nits

* last nits

* update test configuration common

* add BART name in decoder attention mask documentation

* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* style

* nit

* nit

* add english.json file to git

* nits on documentation

* nit

* nits

* last styling

* add main toctree file

* remove sentence piece dependency

* clean init file

* fix tokenizer that has no dependencies on sentencepiece

* update whisper init file, nit

* remove english.json file

* add get decoder prompt id

* All weights loading

* Remove hanging pdb

* Fixup and tidy up

* Use same copied from as PT model

* Remove whitespace changes

* Remove torch references

* Tie embeddings

* Remove logits processor input to generate

* Update logit values

* revert changes and add forced logit processor

* nit

* clean normalizer

* remove protected

* Add logit processors and update generation code & tests

* Some tidy up

* Update docstring

* update

* update based on review

* Update src/transformers/models/whisper/configuration_whisper.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/whisper/configuration_whisper.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update to reflect changes on the PT model branch

* Tidy up

* Remove extra whitespace

* Fix test - make input ids small enough we can append

* Include upstream changes on main

* PR comments - add batch tests, remove comments & defaults

* Fix model output imports

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation_tf_logits_process.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update tests/models/whisper/test_modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update docstring example

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Remove changes to adjust_logits_during_generation function

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Tidy up imports that don't require TF

* Update tests - skip and no more skip

* Update tests/generation/test_generation_tf_logits_process.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Add training flags

* Add (skipped) XLA generation tests

* Add embedding correctness test

* Add constant ids for generation tests

* Make logits finding a bit tidier

* Remove unused args

* xla generation enabled

* Don't skip XLA tests anymore

* Fix tests - add position ids to expected signature and update rag generation

* Undo method reorder

* Remove added whitespace

* Remove copy-paste gradient checkopint ref

* Remove

* Trigger CI - (issue with refs when pulling)
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: NielsRogge <niels.rogge1@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Joao Gante <joao@huggingface.co>

e3f028f3

Add `OPTForQuestionAnswering` (#19402) · af69360b

APAVOU Clément authored Oct 10, 2022

* Add `OPTForQuestionAnswering`

- added `OPTForQuestionAnswering` class based on `BloomForQuestionAnswering`
- added `OPTForQuestionAnswering` in common tests
- all common tests pass
- make fixup done

* added docstrings for OPTForQuestionAnswering

* Fix docstrings for OPTForQuestionAnswering

af69360b

fix: renamed variable name (#18850) · ba71bf4c

Aritra Roy Gosthipaty authored Oct 10, 2022

The sequence_masked variable is actually the part of the sequence that is kept unmasked for the encoder. This commit renames the variable.

ba71bf4c

Remove dependency of Roberta in Blenderbot (#19411) · 4824741c

Ryan Chan authored Oct 10, 2022

* Remove dependency of Roberta in Blenderbot

* Move Copied from statements to each method of the Roberta classes

* Remove copied from line for mask_token.setter

* update output from example in docs

4824741c

Add onnx support for VisionEncoderDecoder (#19254) · 3080bb47

Mohit Sharma authored Oct 10, 2022



* Add onnx support for VisionEncoderDecoder

* Add onnx support for VisionEncoderDecoder

* Removed unused import

* Rename encoder hidden state
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Update docstrings and removed redundant code

* Added test function for enc-dec models

* Update doc string text
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* fixed code style
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

3080bb47

Stop relying on huggingface_hub's private methods (#19392) · 298f6a98
Lysandre Debut authored Oct 10, 2022
```
* Leverage hfh for move cache

* Style
```
298f6a98
Fix typo in image-classification/README.md (#19424) · 7d5ce680
wei zhao authored Oct 10, 2022
```
Fix link typo of the following content.
PyTorch version, Trainer
PyTorch version, no Trainer
```
7d5ce680

fix marianMT convertion to onnx (#19287) · c523a869

Rak Alexey authored Oct 10, 2022



* fix marianMT convertion to onnx

* Update src/transformers/onnx/convert.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Update src/transformers/onnx/convert.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

c523a869

Fixed duplicated line (paragraph #83) Documentation: @sgugger (#19436) · 34107057
Darío Hereñú authored Oct 10, 2022
```
* Fixed duplicated line (paragraph #83) @omarespejel @sgugger

* Datasets map denomination fixed (paragraph 42)
```
34107057
Backtick fixed (paragraph 68) (#19440) · 83dc49b6
Darío Hereñú authored Oct 10, 2022

83dc49b6

remove RobertaConfig inheritance from MarkupLMConfig (#19404) · 1241a499

Druhin Abrol authored Oct 10, 2022



* remove RobertaConfig inheritance from MarkupLMConfig

* Update src/transformers/models/markuplm/configuration_markuplm.py

fixed typo in docstring
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1241a499

Fix repo names for ESM tests (#19451) · 4107445a
Matt authored Oct 10, 2022

4107445a
Skip `BloomEmbeddingTest.test_embeddings` for PyTorch < 1.10 (#19261) · cbb8a379
Yih-Dar authored Oct 10, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
cbb8a379
Fix `ViTMSNForImageClassification` doctest (#19275) · 8b6bba54
Yih-Dar authored Oct 10, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8b6bba54

08 Oct, 2022 1 commit
- Remove ref to is_pipeline_test · d92e22d1
  Sylvain Gugger authored Oct 07, 2022
  
  d92e22d1
07 Oct, 2022 22 commits

Rework pipeline tests (#19366) · 9ac586b3

Sylvain Gugger authored Oct 07, 2022

* Rework pipeline tests

* Try to fix Flax tests

* Try to put it before

* Use a new decorator instead

* Remove ignore marker since it doesn't work

* Filter pipeline tests

* Woopsie

* Use the fitlered list

* Clean up and fake modif

* Remove init

* Revert fake modif

9ac586b3

Improve and fix ImageSegmentationPipeline (#19367) · 983451a1

Alara Dirik authored Oct 07, 2022

- Fixes the image segmentation pipeline test failures caused by changes to the postprocessing methods of supported models
- Updates the ImageSegmentationPipeline tests
- Improves docs, adds 'task' argument to optionally perform semantic, instance or panoptic segmentation

983451a1

Removed Bert dependency from BertGeneration code base. (#19370) · de4d71ea

Vishwas authored Oct 07, 2022



* Copied all the code required from transformers.models.bert.modeling_bert to here

* Fixed styling issues

* Reformatted copied names with Model specific name.

* Reverted BertEncoder part as there is already a class called BertGenerationEncoder

* Added prefixes in missing places.
Co-authored-by: vishwaspai <vishwas.pai@emplay.net>

de4d71ea

Make `Camembert` TF version independent from `Roberta` (#19364) · 34e0cc6d

mustapha ajeghrir authored Oct 07, 2022



* camembert tf version independent

* fixup

* fixup, all working

* remove comments

* Adding copied from roberta
Co-authored-by: Mustapha AJEGHRIR <mustapha.ajeghrir@kleegroup.com>

34e0cc6d

Removed `Bert` interdependency in `tokenization_electra.py` (#19356) · 7418a48e

Blip blop authored Oct 07, 2022



* Copied from BertTokenizer() in tokenization_bert

* Added BasicTokenizer and WordPieceTokenizer Class

* Update src/transformers/models/electra/tokenization_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Added copied from comments for basicTokenizer and WordPieceTokenizer

* Updated the comments for the tokenizerClasses

* Update src/transformers/models/electra/tokenization_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/electra/tokenization_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Formatted tokenization_electra with `make style`

* Fix repo inconsistencies

* Update src/transformers/models/electra/tokenization_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Set the logger
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7418a48e

Remove Dependency between Bart and LED (slow/fast) (#19408) · 6ef16f2b

Infrared1029 authored Oct 07, 2022

* removed dependency from bart(slow)

* removed dependency from bart(slow)

* adding copying comments (copied from bart to led)

* updated led docstring

* updated led docstring

* removed dependency from Bart (fast)

* replaced bart with LED in docstrings

* complying flake8

* added more copy comments

* fixing copying comments

* added comments back

* fix copy comments

* fixing copied from comments

* fixing copied from comments

6ef16f2b

Clip device map (#19409) · 06514b3e
Patrick von Platen authored Oct 07, 2022
```
* add first generation tutorial

* uP

* [Clip] Add text model to device map
```
06514b3e
Removed Bert and XML Dependency from Herbert (#19410) · c2b83d54
harry7337 authored Oct 07, 2022
```
Co-authored-by: harry7337 <hari.8jan@gmail.com>
```
c2b83d54

Remove dependency of Bert from Squeezebert tokenizer (#19403) · e6fc2016

Ryan Chan authored Oct 07, 2022

* Remove dependency of Bert from Squeezebert tokenizer

* run style corrections

* update copies from BertTokenizers

* Update changes and style to Squeezebert files

* update copies for bert-fast

e6fc2016

update attention mask handling (#19385) · 994b7a4e
Arthur authored Oct 07, 2022
```
* update feature extractor params

* update attention mask handling
```
994b7a4e

Export TensorFlow models to ONNX with dynamic input shapes (#19255) · a26d71d6

Dean Wyatte authored Oct 07, 2022

* validate onnx models with a different input geometry than saved with

* only test working features for now

* simpler test skipping

* rm TODO

* expose batch_size/seq_length on vit

* skip certain name, feature, framework parameterizations known to fail validation

* Trigger CI

* Trigger CI

a26d71d6

Copy BertTokenizer dependency into retribert tokenizer (#19371) · 5fef17f4
David Yang authored Oct 07, 2022

5fef17f4

edit: cast attention_mask to long in DataCollatorCTCWithPadding (#19369) · fa4bcd52

ddobokki authored Oct 07, 2022

* edit: casting attention_mask to long in DataCollatorCTCWithPadding

* edit: casting attention_mask to long in DataCollatorCTCWithPadding

fa4bcd52

[WIP] Add ZeroShotObjectDetectionPipeline (#18445) (#18930) · e9a49bab

Amrit Sahu authored Oct 07, 2022

* Add ZeroShotObjectDetectionPipeline (#18445)

* Add AutoModelForZeroShotObjectDetection task

This commit also adds the following

- Add explicit _processor method for ZeroShotObjectDetectionPipeline.
  This is necessary as pipelines don't auto infer processors yet and
  `OwlVitProcessor` wraps tokenizer and feature_extractor together, to
  process multiple images at once

- Add auto tests and other tests for ZeroShotObjectDetectionPipeline

* Add AutoModelForZeroShotObjectDetection task

This commit also adds the following

- Add explicit _processor method for ZeroShotObjectDetectionPipeline.
  This is necessary as pipelines don't auto infer processors yet and
  `OwlVitProcessor` wraps tokenizer and feature_extractor together, to
  process multiple images at once

- Add auto tests and other tests for ZeroShotObjectDetectionPipeline

* Add batching for ZeroShotObjectDetectionPipeline

* Fix doc-string ZeroShotObjectDetectionPipeline

* Fix output format: ZeroShotObjectDetectionPipeline

e9a49bab

Remove unneded words from audio-related feature extractors (#19405) · 331ea019
Omar Sanseviero authored Oct 07, 2022

331ea019

HF <-> megatron checkpoint reshaping and conversion for GPT (#19317) · 56af8df3

Sourab Mangrulkar authored Oct 07, 2022



* HF <-> megatron checkpoint conversion handling reshaping from different tensor and parallel sizes

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* addressing comments

* add doc strings and  🐛

 fixes
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

56af8df3

Added type hints for TF: TransfoXL (#19380) · 41ec5d0c

Thomas authored Oct 07, 2022

* Added type hints for TF: TransfoXL
* Added type hints for TF: TransfoXL

* Change type hints for training

* Change type hints for training

41ec5d0c

removes prophet config dependencies from xlm-prophet (#19400) · b29ebdf4
h authored Oct 07, 2022

b29ebdf4

add ONNX support for swin transformer (#19390) · e162cebf

Bibhabasu Mohapatra authored Oct 07, 2022



* swin transformer onnx support

* Updated image dimensions as dynamic
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

e162cebf

Added Type hints for XLM TF (#19333) · 969534af

IMvision12 authored Oct 07, 2022



* Update modeling_tf_xlm.py

* Updates

* Update src/transformers/models/xlm/modeling_tf_xlm.py

* Update src/transformers/models/xlm/modeling_tf_xlm.py

* Update src/transformers/models/xlm/modeling_tf_xlm.py

* Update src/transformers/models/xlm/modeling_tf_xlm.py

* Update src/transformers/models/xlm/modeling_tf_xlm.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

969534af

Fix gather for metrics (#19389) · 46fd04b4
Zachary Mueller authored Oct 07, 2022

46fd04b4

Making `ConvBert Tokenizer` independent from `bert Tokenizer` (#19347) · 7e348aac

IMvision12 authored Oct 07, 2022

* ConvBert

* added comment

* Updated

* Final_updates

* Update tokenization_convbert.py

* Update tokenization_convbert_fast.py

* Update tokenization_convbert.py

* Update tokenization_convbert.py

* Update tokenization_convbert_fast.py

* Update tokenization_convbert.py

* Update tokenization_convbert_fast.py

* Updates

* Updates

* Updated

* Final Updates

7e348aac

06 Oct, 2022 2 commits

fix docs example, add object_detection to DETR docs (#19377) · ae3e3bc6
Alara Dirik authored Oct 07, 2022

ae3e3bc6

Change link of repojacking vulnerable link (#19393) · ce262019

Ilaygoldman authored Oct 07, 2022

The link to https://github.com/vasudevgupta7/bigbird is vulnerable to repojacking (it redirects to the orignial project that changed name), you should change the link to the current name of the project. if you won't change the link, an attacker can open the linked repository and attacks users that trust your links

ce262019