Commits · 70a058bc65ce8cd1ac429139fb2df65f548fd567 · chenpangpang / transformers

11 Oct, 2022 20 commits

Added tokenize keyword arguments to feature extraction pipeline (#19382) · 70a058bc

Quancore authored Oct 11, 2022

* Added tokenize keyword arguments to feature extraction pipeline

* Reverted truncation parameter

* Import numpy moved to top

70a058bc

Make bert_japanese and cpm independent of their inherited modules (#19431) · d0d5aee1
David Yang authored Oct 12, 2022
```
* Make cpm tokenization independent of xlnet

* Make bert japanese tokenization independent of bert
```
d0d5aee1

🚨

TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization... · 462cd641

Joao Gante authored Oct 11, 2022

🚨🚨🚨  TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization updated for encoder-decoder models) (#19263)

* added test

* correct embedding init

* some changes in blenderbot (incomplete)

* update blenderbot (diff to be used as reference)

* update blenderbot_small

* update LED

* update marian

* update T5 and remove TFWrappedEmbeddings

* nullcontext() -> ContextManagers()

* fix embedding init

462cd641

Update TF whisper doc tests (#19484) · 8e4ee28e
amyeroberts authored Oct 11, 2022

8e4ee28e

Add warning in `generate` & `device_map=auto` & half precision models (#19468) · 6c66c6c8

Younes Belkada authored Oct 11, 2022



* fix device mismatch

* make fixup

* added slow tests

- added slow tests on `bnb` models to make sure generate works correctly

* replace with `self.device`

* revert force device assign

* Update src/transformers/generation_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* set the warning in `generate` instead of `sample`
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

6c66c6c8

Implement multiple span support for DocumentQuestionAnswering (#19204) · a3008c5a
Ankur Goyal authored Oct 11, 2022
```
* Implement multiple span support

* Address comments

* Add tests + fix bugs
```
a3008c5a

Decouples `XLMProphet` model from `Prophet` (#19406) · ab856f68

h authored Oct 11, 2022

* decouples xlm_prophet from prophet and adds copy patterns that pass the copy check

* adds copy patterns to copied docstrings too

* restores autodoc for XLMProphetNetModel

* removes all-casing in a bunch of places to ensure that the model is compatible with all checkpoints on the hub

* adds missing model to main init

* adds autodocs to make document checker happy

* adds missing pretrained model import

* adds missing pretrained model import to main init

* adds XLMProphetNetPreTrainedModel to the dummy pt objects

* removes examples from the source-doc file since docstrings contain them already

* adds a missing new line to make check_repo happy

ab856f68

Fix `get_embedding` dtype at init. time (#19473) · c6646613

Yih-Dar authored Oct 11, 2022



* cast positions dtype in XGLMModel

* Get the correct dtype at init time

* Get the correct dtype at init time
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

c6646613

Make `XLMRoberta` model and config independent from `Roberta` (#19359) · e38cf93e

Sofia Oliveira authored Oct 11, 2022

* remove config dependence

* remove dependencies from xlm_roberta

* Fix style

* Fix comments

* various fixes

* Fix pre-trained model name

e38cf93e

Make LayoutLM tokenizers independent from BertTokenizer (#19351) · 8cb44aaf

Arnaud Stiegler authored Oct 11, 2022

* fixing tokenizer

* adding all missing classes

* fast tokenizer | fixing format

* revert to full class copy flag

* fixing different casing

8cb44aaf

TF: TFBart embedding initialization (#19460) · 9ed80b00
Joao Gante authored Oct 11, 2022
```
* correct embedding init
```
9ed80b00
[Swin] Replace hard-coded batch size to enable dynamic ONNX export (#19475) · b651efe5
lewtun authored Oct 11, 2022
```
* [Swin] Replace hard-coded batch size to enable dynamic ONNX export
```
b651efe5
Update `WhisperModelIntegrationTests.test_large_batched_generation` (#19472) · 440bbd44
Yih-Dar authored Oct 11, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
440bbd44
Fix doctests for `DeiT` and `TFGroupViT` (#19466) · e1a5cc33
Yih-Dar authored Oct 11, 2022
```
* Fix some doctests
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
e1a5cc33

Fix `TFGroupViT` CI (#19461) · d7dc774a

Yih-Dar authored Oct 11, 2022



* Fix TFGroupViT CI
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d7dc774a

CLI: add import protection to datasets (#19470) · a293a0e8
Joao Gante authored Oct 11, 2022

a293a0e8
Syntax issues (lines 126, 203) (#19444) · ae710425
Darío Hereñú authored Oct 11, 2022

ae710425

Extend `nested_XXX` functions to mappings/dicts. (#19455) · 335f9bcd

Guillem Orellana Trullols authored Oct 11, 2022



* Extend `nested_XXX` functions to mappings/dicts.

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Style updated file
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

335f9bcd

Fix whisper for `pipeline` (#19482) · b722a6be

Arthur authored Oct 11, 2022

* update feature extractor params

* update attention mask handling

* fix doc and pipeline test

* add warning when skipping test

* add whisper translation and transcription test

* fix build doc test

b722a6be

Enabling custom TF signature draft (#19249) · df8faba4

Dimitre Oliveira authored Oct 11, 2022



* Custom TF signature draft

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Adding tf signature tests

* Fixing signature check and adding asserts

* fixing model load path

* Adjusting signature tests

* Formatting file
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Dimitre Oliveira <dimitreoliveira@Dimitres-MacBook-Air.local>

df8faba4

10 Oct, 2022 20 commits

Dev version · 10100979
Lysandre authored Oct 10, 2022

10100979
wrap forward passes with torch.no_grad() (#19412) · df2f2812
Partho authored Oct 11, 2022

df2f2812
wrap forward passes with torch.no_grad() (#19413) · 5f5e264a
Partho authored Oct 11, 2022

5f5e264a
wrap forward passes with torch.no_grad() (#19414) · c6a928ca
Partho authored Oct 11, 2022

c6a928ca
wrap forward passes with torch.no_grad() (#19416) · d739a707
Partho authored Oct 11, 2022

d739a707
wrap forward passes with torch.no_grad() (#19438) · 870a9542
Partho authored Oct 11, 2022

870a9542
wrap forward passes with torch.no_grad() (#19439) · 692c5be7
Partho authored Oct 11, 2022

692c5be7

fix (#19469) · a7bc4221

Yih-Dar authored Oct 10, 2022


Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

a7bc4221

Fixed a non-working hyperlink in the README.md file (#19434) · 25cfd911

Mikail Duzenli authored Oct 10, 2022

* Fixed a non-working hyperlink in the README.md file

The hyperlink to the community notebooks was outdated.

* Fixing missing double slash in hyperlink

25cfd911

Fix misspelled word in docstring (#19415) · 9df953a8
Bartosz Szmelczynski authored Oct 10, 2022

9df953a8
Generate: corrected exponential_decay_length_penalty type hint (#19376) · d866b485
Shivang Mishra authored Oct 10, 2022

d866b485

Fix momentum and epsilon values (#19454) · 4dd784c3

amyeroberts authored Oct 10, 2022

The momentum value for PyTorch and TensorFlow batch normalization layers is not equivalent. The TensorFlow value should be (1 - pytorch_momentum) in order to ensure the correct updates are applied to the running mean and running variance calculations. We wouldn't observe a difference loading a pretrained model and performing inference, but evaluation outputs would change after some training steps.

4dd784c3

Add Italian translation for `add_new_model.mdx` (#18713) · b0b962cc
Stefano Bosisio authored Oct 10, 2022
```
* fix conflicts

* start translating

* proof check

* add toc

* fix errors and typos
```
b0b962cc
Fix the error message in run_t5_mlm_flax.py (#19282) · e150c4e2
Kaiyu Yang authored Oct 10, 2022

e150c4e2

Add TF whisper (#19378) · e3f028f3

amyeroberts authored Oct 10, 2022



* simplify loop

* add featur extractor

* add model

* start conversion

* add dropout

* initial commit of test files

* copnversion for all models

* update processor for correct padding

* update feature extraction

* update integration test logits match

* fmnt: off for the logits

* on the fly mel bank

* small nit

* update test

* update tokenizer

* nit feature extraction

* update

* update tokenizer test

* adds logit processor and update tokenizer to get supress tokens

* style

* clean convert

* revert to original modeling tf utils

* Update

* update

* nit

* clean convert file

* update tests and nits

* quality

* slow generation test

* ffn_dim to allow customization

* update readme

* add to toctreee

* start fixing integration tests

* update tests and code

* fix feature extractor

* fix config tests common

* update code to fix tests

* fix feature exctractor

* nit feature extraction

* update test for new feature extractor

* style

* add absrtact

* large logits wioth custom decoder input ids

* wraap around is otrch available

* fix feature extractor

* correct logits for whisper small.en

* nit

* fix encoder_attentino_mask

* some fixes

* remove unnecessary inputs

* nits

* add normalizer file

* update etst tokenization

* fix attention mask not defined

* fix generate

* remove uncoder attention mask useless

* update test modeling whisper

* update condfig to add second non supress tokens

* nits on feature exrtactor

* nit for test tokenizers

* update etsts

* update tests

* update tokenization test

* fixup

* invalidated hf token. Clean convert openai to whisper

* fix logit tests

* fixup

* Add model to README

* Fix doc tests

* clean merge

* revert toc_tree changes

* remove useless LogitProcessor

* Update whisper .mdx

* update config file doc

* update configuration docstring

* update test tokenization

* update test tokenization

* update tokenization whisper
Added copied from where needed

* update feature extraction

* nit test name

* style

* quality

* remove get suppress tokens and update non_speech tokens global variables

* Update src/transformers/models/whisper/feature_extraction_whisper.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* clean modeling whisper and test
Removed the attention mask arguments that are deprecated

* fix large test

* Add multilingual audio test, and translate test

* style

* fix larg multilingual test

* nits

* add copied from for attention layer

* remove attention masks in doc

* add english normalizer

* Update docs/source/en/model_doc/whisper.mdx
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update tokenization test

* remove copied from in whisper attention : no bias in k_proj only

* wrap around dependencies in english normalizer

* style

* correct import generation logits

* for now, wrap feature extractor with torch

* remove torch depencies for feature extraction and style

* Update src/transformers/models/whisper/convert_openai_whisper_to_tfms.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/whisper/configuration_whisper.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/whisper.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fixup

* nit

* update logitds

* style

* nit

* nits and fix final tests

* add `is_more_itertools_available` to utils

* quality

* add begin supress tokens, supress tokens to generate args and config

* clean supressTokensLogitProcessor in generation logits

* Nit naming

* add supressTokensAtBegin

* udpate tests, supress tokens to None or correct values

* nit and style

* update RAG to fit test and generate_logit

* add copy pasted statment on english normalizer

* add arguments to config_common_kwargs

* Update src/transformers/generation_utils.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/generation_logits_process.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* revert changes based on reviews

* update doc and nits

* Update src/transformers/models/whisper/configuration_whisper.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* more nits

* last nits

* update test configuration common

* add BART name in decoder attention mask documentation

* Update src/transformers/models/whisper/modeling_whisper.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* style

* nit

* nit

* add english.json file to git

* nits on documentation

* nit

* nits

* last styling

* add main toctree file

* remove sentence piece dependency

* clean init file

* fix tokenizer that has no dependencies on sentencepiece

* update whisper init file, nit

* remove english.json file

* add get decoder prompt id

* All weights loading

* Remove hanging pdb

* Fixup and tidy up

* Use same copied from as PT model

* Remove whitespace changes

* Remove torch references

* Tie embeddings

* Remove logits processor input to generate

* Update logit values

* revert changes and add forced logit processor

* nit

* clean normalizer

* remove protected

* Add logit processors and update generation code & tests

* Some tidy up

* Update docstring

* update

* update based on review

* Update src/transformers/models/whisper/configuration_whisper.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/whisper/configuration_whisper.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update to reflect changes on the PT model branch

* Tidy up

* Remove extra whitespace

* Fix test - make input ids small enough we can append

* Include upstream changes on main

* PR comments - add batch tests, remove comments & defaults

* Fix model output imports

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation_tf_logits_process.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update tests/models/whisper/test_modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update docstring example

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Remove changes to adjust_logits_during_generation function

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Tidy up imports that don't require TF

* Update tests - skip and no more skip

* Update tests/generation/test_generation_tf_logits_process.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Add training flags

* Add (skipped) XLA generation tests

* Add embedding correctness test

* Add constant ids for generation tests

* Make logits finding a bit tidier

* Remove unused args

* xla generation enabled

* Don't skip XLA tests anymore

* Fix tests - add position ids to expected signature and update rag generation

* Undo method reorder

* Remove added whitespace

* Remove copy-paste gradient checkopint ref

* Remove

* Trigger CI - (issue with refs when pulling)
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: NielsRogge <niels.rogge1@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Joao Gante <joao@huggingface.co>

e3f028f3

Add `OPTForQuestionAnswering` (#19402) · af69360b

APAVOU Clément authored Oct 10, 2022

* Add `OPTForQuestionAnswering`

- added `OPTForQuestionAnswering` class based on `BloomForQuestionAnswering`
- added `OPTForQuestionAnswering` in common tests
- all common tests pass
- make fixup done

* added docstrings for OPTForQuestionAnswering

* Fix docstrings for OPTForQuestionAnswering

af69360b

fix: renamed variable name (#18850) · ba71bf4c

Aritra Roy Gosthipaty authored Oct 10, 2022

The sequence_masked variable is actually the part of the sequence that is kept unmasked for the encoder. This commit renames the variable.

ba71bf4c

Remove dependency of Roberta in Blenderbot (#19411) · 4824741c

Ryan Chan authored Oct 10, 2022

* Remove dependency of Roberta in Blenderbot

* Move Copied from statements to each method of the Roberta classes

* Remove copied from line for mask_token.setter

* update output from example in docs

4824741c

Add onnx support for VisionEncoderDecoder (#19254) · 3080bb47

Mohit Sharma authored Oct 10, 2022



* Add onnx support for VisionEncoderDecoder

* Add onnx support for VisionEncoderDecoder

* Removed unused import

* Rename encoder hidden state
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Update docstrings and removed redundant code

* Added test function for enc-dec models

* Update doc string text
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* fixed code style
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

3080bb47

Stop relying on huggingface_hub's private methods (#19392) · 298f6a98
Lysandre Debut authored Oct 10, 2022
```
* Leverage hfh for move cache

* Style
```
298f6a98