Commits · 862888a35834527fed61beaf42373423ffdbd216 · chenpangpang / transformers

19 Jan, 2023 19 commits

Add disclaimer for necessary fake models (#21178) · 862888a3
Sylvain Gugger authored Jan 19, 2023
```
* Add disclaimer for necessary fake models

* Address review comments

* Use for GPT-NeoX as well
```
862888a3

Graphormer model for Graph Classification (#20968) · 87208a05

Clémentine Fourrier authored Jan 19, 2023



* [FT] First commit for graphormer architecture.

The model has no tokenizer, as it uses a collator and preprocessing function for its input management.
Architecture to be tested against original one.
The arch might need to be changed to fit the checkpoint, but a revert to the original arch will make the code less nice to read.
TODO: doc

* [FIX] removed test model

* [FIX] import error

* [FIX] black and flake

* [DOC] added paper refs

* [FIX] [DOC]

* [FIX] black

* [DOC] Updated READMEs

* [FIX] Order of imports + rm Tokenizer calls

* [FIX] Moved assert in class to prevent doc build failure

* [FIX] make fix-copies

* [Doc] update from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* [FIX] Removed Graphormer from Sequence classification model list

* [DOC] Added HF copyright to Cython file

* [DOC] Fixed comments

* [FIX] typos in class doc + removed config classes.

Todo: update doc from paper definitions

* [FIX] Removed dependency to fairseq, and replaced all asserts with Exception management

* [FIX] Homogeneized initialization of weights to pretrained constructor

* [FIX] [CP] Updated multi_hop parameter to get same results as in original implementation

* [DOC] Relevant parameter description in the configuration file

* [DOC] Updated doc and comments in main graphormer file

* [FIX] make style and quality checks

* [DOC] Fix doc format

* [FIX] [WIP] Updated part of the tests, though still a wip

* [FIX] [WIP]

* [FIX] repo consistency

* [FIX] Changed input names for more understandability

* [FIX] [BUG] updated num_classes params for propagation in the model

* simplified collator

* [FIX] Updated tests to follow new naming pattern

* [TESTS] Updated test suite along with model

* |FIX] rm tokenizer import

* [DOC] add link to graphormerdoc

* Changed section in doc from text model to graph model

* Apply suggestions from code review

Spacing, inits
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* [DOC] Explain algos_graphormer functions

* Cython soft import protection

* Rm call to Callable in configuration graphormer

* [FIX] replaced asserts with Exceptions

* Add org to graphormer checkpoints

* Prefixed classes with Graphormer

* Management of init functions

* format

* fixes

* fix length file

* update indent

* relaunching ci

* Errors for missing cython imports

* fix style

* fix style doc
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

87208a05

revert Copyright 2023 · 758bd39e
ydshieh authored Jan 19, 2023

758bd39e

Add Japanese translation index.mdx (#21186) · 705e332b

Kambe Hiroyuki authored Jan 20, 2023

* Add Japanese translation index.mdx

* Fix the year of the license

* Change the models list to Japanese

705e332b

Flax dtype-dependent numerical masking (#21197) · cbaaa2f6
Joao Gante authored Jan 19, 2023

cbaaa2f6
[`CVT`] Fix module initialization issue (#21193) · 0b86e330
Younes Belkada authored Jan 19, 2023
```
fix cvt init
```
0b86e330

Add hallucination filter (#18675) · b9403e95

Karim Foda authored Jan 19, 2023



* Add hallucination penalty

* Make quality changes

* Inverse penalty

* Fix imports & quality

* Fix name spelling issue

* set encoder_repetition_penalty and fix quality

* Fix failing test

* Add to config_common_kwargs

* Fix modelling_rag error

* Update src/transformers/generation_logits_process.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Remove breakpoint

* Make style fixes

* Update encoder_repetition_penalty default value

* Merge latest main changes

* Make fixup changes

* Add EncoderRepetitionPenaltyLogitsProcessor to generation/__init__.py

* Fix repo-inconsistency

* Remove venv

* Remove tensorflow-macos & add tests

* Add documentation

* Fix quality issues

* move encoder_repetition_penalty to config

* Update src/transformers/configuration_utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Remove encoder_repetition_penalty from tests

* Fix type error

* Fix format error
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

b9403e95

[Whisper] Fix timestamp processor (#21187) · e9b4800d

Arthur authored Jan 19, 2023



* add draft logit processor

* add template functions

* update timesapmt processor parameters

* draft script

* simplify code

* cleanup

* fixup and clean

* update pipeline

* style

* clean up previous idea

* add tokenization utils

* update tokenizer and asr output

* fit whisper type

* style and update test

* clean test

* style test

* update tests

* update error test

* udpate code (not based on review yet)

* update tokenization

* update asr pipeline

* update code

* cleanup and update test

* fmt

* remove text verificatino

* cleanup

* cleanup

* add model test

* update tests

* update code add docstring

* update code and add docstring

* fix pipeline tests

* add draft logit processor

add template functions

update timesapmt processor parameters

draft script

simplify code

cleanup

fixup and clean

update pipeline

style

clean up previous idea

add tokenization utils

update tokenizer and asr output

fit whisper type

style and update test

clean test

style test

update tests

update error test

udpate code (not based on review yet)

update tokenization

update asr pipeline

update code

cleanup and update test

fmt

remove text verificatino

cleanup

cleanup

add model test

update tests

update code add docstring

update code and add docstring

fix pipeline tests

* Small update.

* Fixup.

* Tmp.

* More support.

* Making `forced_decoder_ids` non mandatory for users to set.

* update and fix first bug

* properly process sequence right after merge if last

* tofo

* allow list inputs + compute begin index better

* start adding tests

* add the 3 edge cases

* style

* format sequences

* fixup

* update

* update

* style

* test passes, edge cases should be good

* update last value

* remove Trie

* update tests and expec ted values

* handle bigger chunk_length

* clean tests a bit

* refactor chunk iter and clean pipeline

* update tests

* style

* refactor chunk iter and clean pipeline

* upade

* resolve comments

* Apply suggestions from code review
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* take stride right into account

* update test expected values

* Update code based on review
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

* major refactor

* add correct strides for tests

* Update src/transformers/pipelines/automatic_speech_recognition.py

* fix whisper timestamp test
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

e9b4800d

hertz is already per second (#21188) · 9b42c68f
Matthijs Hollemans authored Jan 19, 2023

9b42c68f
Update examples with image processors (#21155) · 4bc18e7a
amyeroberts authored Jan 19, 2023
```
* Update examples to use image processors

* Small fixes

* Resolve conflicts
```
4bc18e7a
Rename GLPN image processor tests (#21194) · fc8a9350
amyeroberts authored Jan 19, 2023

fc8a9350

Updates to computer vision section of the Preprocess doc (#21181) · 0359e2e1

Maria Khalusova authored Jan 19, 2023



* Extended the CV preprocessing section with more details and refactored the example

* added padding to the CV section, though it is a special case

* Added a tip about post processing methods

* make style

* link update

* Apply suggestions from review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* review feedback
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

0359e2e1

Fix device issue in `UperNetModelIntegrationTest` (#21192) · 5761ceb3
Yih-Dar authored Jan 19, 2023
```
fix device
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5761ceb3
Trigger CI · 35920c97
Sylvain Gugger authored Jan 19, 2023

35920c97
workaround documentation rendering bug (#21189) · 9b468a7c
Matthijs Hollemans authored Jan 19, 2023

9b468a7c
Update year 2020 to 2023 in one file (#21190) · 464c86ac
Yih-Dar authored Jan 19, 2023
```
* update year
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
464c86ac
Fix `Mask2FormerForUniversalSegmentation` (#21175) · 1d33f55c
Yih-Dar authored Jan 19, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
1d33f55c

Add OneFormer Model (#20577) · 5b949623

Jitesh Jain authored Jan 19, 2023

* Add Oneformer Model

* Add OneFormer Tests

* Add UNIVERSAL_SEGMENTATION_MAPPING

* Fix config

* 🐛 Fix error encountered while writing tests

* 🔨 Fix instance segmentation post processing

* Format Files and Add Documentation

* Add Documentation mdx file

* Run make fixup

* Run make fix-copies

* Remove unnecessary code

* Format modeling_oneformer.py

* Add OneFormer to ImageSegmentationPipeline

* Format files

* Add Demo link to Readme

* Fix fomatting errors

* Fix test failures

* Update Table in index.mdx

* Fix version

* Fix style

* Remove OneFormer from TF

* Fix Imports

* Fix dummy objects

* Fix tests

* Add newline

* Remove OneFormerFeatureExtractor

* Remove CUDA Kernels

* Use AutoBackbone for Swin

* Fix description

* Use Image Processor

* Fix copies

* Fix formatting

* Fix import order

* Fix flake8 errors

* Fix doc errors

* Add Hindi Readme entry

* Update supported backbones

* Update supported backbones

* Undo Changes

* Fix type of config

* Fix isort

* Fix auto.mdx

* Fix swin config

* Replace DinatBackbone with AutoBackbone

* Use SwinBackbone

* Use SwinBackbone

* Fix conversion script

* Fix arguments

* Add argument description

* Fix style

* Add OneFormerProcessor

* Fix OneFormerProcessor Tests

* Fix mapping

* Fix imports

* Fix inits

* Fix style

* Fix comment

* Fix docstring

* Move OneFormer to MultiModal

* Fix Copies

* Remove size divisor

* Fix check_repo.py

* Fix copies

* Add Processor for Testing Pipeline

* Fix padding for tokens

* Fix variables

* Fix formatting with correct black version

* Add Image Processor Test

* Apply suggestions

* Revert common modeling

* Add check for task

* Fix conversion script

* Fix initialization order

* Fix tests

* Undo Pipeline Changes

* Fix layers in MLP

* Fix copies

* Update image paths

* Fix copies

* Apply suggestions

5b949623

[issues template] update deepspeed owners (#21027) · 6d676643

Stas Bekman authored Jan 18, 2023

* [issues template] update deepspeed owners

add the right contact for deepspeed@accelerate

* pr-template

6d676643

18 Jan, 2023 16 commits

Rewrite a couple of lines in the TF XLA doc (#21177) · 00ba7cad

Matt authored Jan 18, 2023

* Rewrite a couple of lines in the TF XLA doc to explain that jit_compile can be used in model.compile() too

* Remove extra )

00ba7cad

Add AWS Neuron torchrun support (#20806) · c59d71b2

jeffhataws authored Jan 18, 2023

* Add XLA torchrun support

* Clarify that currently DDP doesn't work with torch.distributed XLA backend yet

* Enable DDP with torchrun and XLA (now available in PT-XLA 1.13)

* Add check for AWS Neuron availability and AWS Neuron specific compiler flag

* Change the new test's name to TestTrainerDistributedNeuronCore

* Remove "assert" and replace raised exception

* Remove compiler flag as it is optional. If needed, will be another PR.

* Use TORCHELASTIC_RUN_ID to determine whether torchrun is used

c59d71b2

Bump future from 0.18.2 to 0.18.3 in /examples/research_projects/visual_bert (#21173) · f70ee510

dependabot[bot] authored Jan 18, 2023

Bump future in /examples/research_projects/visual_bert

Bumps [future](https://github.com/PythonCharmers/python-future) from 0.18.2 to 0.18.3.
- [Release notes](https://github.com/PythonCharmers/python-future/releases)
- [Changelog](https://github.com/PythonCharmers/python-future/blob/master/docs/changelog.rst)
- [Commits](https://github.com/PythonCharmers/python-future/compare/v0.18.2...v0.18.3

)

---
updated-dependencies:
- dependency-name: future
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

f70ee510

Bump future from 0.18.2 to 0.18.3 in /examples/research_projects/lxmert (#21169) · 0194665c

dependabot[bot] authored Jan 18, 2023

Bumps [future](https://github.com/PythonCharmers/python-future) from 0.18.2 to 0.18.3.
- [Release notes](https://github.com/PythonCharmers/python-future/releases)
- [Changelog](https://github.com/PythonCharmers/python-future/blob/master/docs/changelog.rst)
- [Commits](https://github.com/PythonCharmers/python-future/compare/v0.18.2...v0.18.3

)

---
updated-dependencies:
- dependency-name: future
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

0194665c

Adapt repository creation to latest hf_hub (#21158) · 05e72aa0

Sylvain Gugger authored Jan 18, 2023

* Adapt repository creation to latest hf_hub

* Update all examples

* Fix other tests, add Flax examples

* Address review comments

05e72aa0

Fix doctest CI (#21166) · 32525428

Yih-Dar authored Jan 18, 2023



* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

32525428

using raw string for regex to search <extra_id> (#21162) · 8ad06b7c
Pengfei Liu authored Jan 18, 2023
```
* using raw string for regex to search <extra_id>

* fix the same issue in test file:`tokenization_t5.py`
```
8ad06b7c

fix the issue that the output dict of jit model could not get [:2] (#21146) · 8a17da2f

Wang, Yi authored Jan 18, 2023



"TypeError: unhashable type: 'slice'"
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

8a17da2f

Fix git model for generate with beam search. (#21071) · e1ad1886

Peter Lin authored Jan 18, 2023



* Fix git model for generate with beam search.

* Update comment

* Fix bug on multi batch

* Add generate tests

* Clean up tests

* Fix style
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

e1ad1886

OPT: Fix batched generation with FLAX (#21150) · e15f0d73

Joao Gante authored Jan 18, 2023

* Fix Flax OPT numerical masking

* re-enable test

* add fix to bart and reintroduce copied from in opt

e15f0d73

Fix typos in documentation (#21160) · f4786d7f
Jordi Mas authored Jan 18, 2023
```
* Fix typos in documentation

* Small fix

* Fix formatting
```
f4786d7f

Remove Roberta Dependencies from XLM Roberta Flax and Tensorflow models (#21047) · defdcd28

Samuel Xu authored Jan 18, 2023

* Added flax model code

* Added tf changes

* missed some

* Added copy comments

* Added style hints

* Fixed copy statements

* Added suggested fixes

* Made some fixes

* Style fixup

* Added necessary copy statements

* Fixing copy statements

* Added more copies

* Final copy fix

* Some bugfixes

* Adding imports to init

* Fixed up all make fixup errors

* Fixed doc errors

* Auto model changes

defdcd28

`blip` support for training (#21021) · 023f51fe

Younes Belkada authored Jan 18, 2023

* `blip` support for training

* remove labels creation

* remove unneeded `decoder_input_ids` creation

* final changes

- add colab link to documentation
- reduction = mean for loss

* fix nits

* update link

* clearer error message

023f51fe

Make `test_save_pretrained_signatures` slow test (#21105) · c8849583
Yih-Dar authored Jan 18, 2023
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c8849583

Add Japanese translation to multilingual.mdx (#21084) · 14154f72

Shogo Hida authored Jan 18, 2023



* Create toctree for Japanese translations
Signed-off-by: Shogo Hida <shogo.hida@gmail.com>

* Copy English version
Signed-off-by: Shogo Hida <shogo.hida@gmail.com>

* Add Japanese translations
Signed-off-by: Shogo Hida <shogo.hida@gmail.com>

* Add Japanese translations
Signed-off-by: Shogo Hida <shogo.hida@gmail.com>
Signed-off-by: Shogo Hida <shogo.hida@gmail.com>

14154f72

🌐 [i18n-KO] Translated `installation.mdx` to Korean (#20948) · 30c12301
Wonhyeong Seo authored Jan 18, 2023
```
docs: ko: installation.mdx
```
30c12301

17 Jan, 2023 5 commits

Fixed num_channels!=3 normalization training (#20630) · 44caf4f6

layjain authored Jan 17, 2023

* Fixed num_channels!=3 normalization training

* empty commit to trigger CI

* Empty-Commit for CircleCI

* Empty-Commit

* Empty Commit try-3: https://discuss.circleci.com/t/github-code-checkout-suddenly-failing/31558



* Empty commit to trigger CI
Co-authored-by: Lay Jain <layjain@basil.csail.mit.edu>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

44caf4f6

Add Epsilon- and Eta-Sampling (#21121) · 865da84a

Sherman Siu authored Jan 17, 2023

* Add epsilon- and eta-sampling.

Add epsilon- and eta-sampling, following the official code from https://github.com/john-hewitt/truncation-sampling and adapting to be more configurable, as required by Huggingface transformers.

* Add unit tests for epsilon- and eta-sampling.

* Black: fix code formatting.

* Fix docstring spacing.

* Clean up newlines.

* Fix implementation bugs and their associated tests.

* Remove epsilon- and eta-sampling parameters from PretrainedConfig.

* Clarify and clean up the documentation.

* Remove parameters for PretrainedConfig test.

865da84a

Refactoring of the text generate API docs (#21112) · 02488103

Maria Khalusova authored Jan 17, 2023

* initial commit, refactoring the text generation api reference

* removed repetitive code examples

* Refactoring the text generation docs to reduce repetition

* make style

02488103

Add: An introductory guide for text generation (#21090) · d386fd64

Maria Khalusova authored Jan 17, 2023



* Part of the "text generation" rework: adding a high-level overview of the text generation strategies

* code samples update via make style

* fixed a few formatting issues

* Apply suggestions from review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fixed spaces, and switched two links to markdown

* Apply Steven's suggestions from review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* new lines after headers to fix link rendering

* review feedback addressed. added links to image captioning and audio transcription examples

* minor capitalization fix

* addressed the review feedback

* Apply suggestions from review
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Applied review suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

d386fd64

Add: tensorflow example for image classification task guide (#21038) · 868d3716

Maria Khalusova authored Jan 17, 2023



* Added TF example for image classification

* Code style polishing

* code style polishing

* minor polishing

* fixed a link in a tip, and a typo in the inference TF content

* Apply Amy's suggestions from review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/tasks/image_classification.mdx
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* review feedback addressed

* make style

* added PushToHubCallback with save_strategy="no"

* minor polishing

* added PushToHubCallback with save_strategy=no

* minor polishing

* Update docs/source/en/tasks/image_classification.mdx

* added data augmentation
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* make style
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

868d3716