Commits · c21298a69b8f7537224f39706f6d6dd5cae47b98 · chenpangpang / transformers

31 Jan, 2023 5 commits

NielsRogge authored Jan 31, 2023



* Improve docs

* Add DETA resources

---------
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

c21298a6

Do not log the generation config for each prediction step in TrainerSeq2Seq (#21385) · d31497b1
regisss authored Jan 31, 2023
```
Do not log the generation config for each iteration
```
d31497b1

Cleanup the usage of `layer_norm_eps` in some models (#21336) · 98d40fed

Yih-Dar authored Jan 31, 2023



* fix

* fix

* make style

* For CLIP

* For OwlViT

* For XCLIP

* For CLIPSeg

* For GroupViT

* fix docstrings

* fix docstrings

* For AltCLIP

* For ChineseCLIP

* For Blip

* For GiT

* make style

* update

* update

* update

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

98d40fed

Template for framework-agnostic tests (#21348) · 623346ab
Joao Gante authored Jan 31, 2023

623346ab

Add DETA (#20983) · 5451f889

NielsRogge authored Jan 31, 2023

* First draft

* Add initial draft of conversion script

* Convert all weights

* Fix config

* Add image processor

* Fix DetaImageProcessor

* Run make fix copies

* Remove timm dependency

* Fix dummy objects

* Improve loss function

* Remove conv_encoder attribute

* Update conversion scripts

* Improve postprocessing + docs

* Fix copied from statements

* Add tests

* Improve postprocessing

* Improve postprocessing

* Update READMEs

* More improvements

* Fix rebase

* Add is_torchvision_available

* Add torchvision dependency

* Fix typo and README

* Fix bug

* Add copied from

* Fix style

* Apply suggestions

* Fix thanks to @ydshieh

* Fix another dependency check

* Simplify image processor

* Add scipy

* Improve code

* Add threshold argument

* Fix bug

* Set default threshold

* Improve integration test

* Add another integration test

* Update setup.py

* Address review

* Improve deformable attention function

* Improve copied from

* Use relative imports

* Address review

* Replace assertions

* Address review

* Update dummies

* Remove dummies

* Address comments, update READMEs

* Remove custom kernel code

* Add image processor tests

* Add requires_backends

* Add minor comment

* Update scripts

* Update organization name

* Fix defaults, add doc tests

* Add id2label for object 365

* Fix tests

* Update task guide

5451f889

30 Jan, 2023 12 commits
- [`run_(clm|mlm).py` examples] add streaming dataset support (#21343) · 98d88b23
  Stas Bekman authored Jan 30, 2023
```
* [run_clm example] add streaming dataset support

* unrefactor kwargs

* fix

* fix

* require datasets>=2.0.0

* port to mlm
```
  98d88b23
- translate index to zh(#20095) (#21351) · 95be242a
  BFSS authored Jan 31, 2023
```
translate index to zh
Co-authored-by: bfss <bfss@bfss.com>
```
  95be242a
- Adding resource section to GPT-J docs (#21270) · 914e5009
  Adit Krishnan authored Jan 30, 2023
```
* Added resource section to GPT-J docs

* Added most of the links found

* Addressing review comments

* Fixing formatting

* Update docs/source/en/model_doc/gptj.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Fixing one of the labels

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
```
  914e5009
- Fixes path for Graphormer checkpoint (#21367) · 14d989a9
  Clémentine Fourrier authored Jan 30, 2023
```
[FIX] path for Graphormer checkpoint
```
  14d989a9
- Generate: Relaxed `max_length` and `max_new_tokens` coexistence (#21347) · 42b60f8b
  Joao Gante authored Jan 30, 2023
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  42b60f8b
- Add cPython files in build (#21372) · 6eb3c66a
  Sylvain Gugger authored Jan 30, 2023
  
  6eb3c66a
- Fix DETR tests after #21144 (#21365) · 59611a0f
  amyeroberts authored Jan 30, 2023
```
* Fix annotation check

* Fix annotation check

* Update type annotations
```
  59611a0f
- Remove duplicate declarations in dummy inputs for TFLongformer (#21352) · 7a2e1320
  Yichao 'Peak' Ji authored Jan 30, 2023
```
Remove duplicate declarations
```
  7a2e1320
- Corrected (#21350) · 96addecf
  简律纯 authored Jan 30, 2023
  
  96addecf
- fix the issue that the output dict of jit model could not get [0] (#21354) · f3a7beff
  Wang, Yi authored Jan 30, 2023
  
  f3a7beff
- Pipeline testing - using tiny models on Hub (#20426) · c749bd40
  Yih-Dar authored Jan 30, 2023
```
* rework pipeline tests

* run pipeline tests

* fix

* fix

* fix

* revert the changes in get_test_pipeline() parameter list

* fix expected error message

* skip a test

* clean up

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  c749bd40
- Fix `GitModelIntegrationTest.test_batched_generation` device issue (#21362) · a582cfce
  Yih-Dar authored Jan 30, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a582cfce
27 Jan, 2023 6 commits

Automated compatible models list for task guides (#21338) · 73a2ff69

Maria Khalusova authored Jan 27, 2023

* initial commit. added tip placeholders and a script

* removed unused imports, fixed paths

* fixed generated links

* make style

* split language modeling doc into two: causal language modeling and masked language modeling

* added check_task_guides.py to make fix-copies

* review feedback addressed

73a2ff69

Little cleanup: let huggingface_hub manage token retrieval (#21333) · 8f3b4a1d

Lucain authored Jan 27, 2023

* Let huggingface_hub manage token retrieval

* flake8

* code quality

* adapt in every PushToHubMixin children

* add explicit return type

8f3b4a1d

[Whisper] another patch (#21324) · 0dff407d

Arthur authored Jan 27, 2023

* another patch

* fix timestamp test modeling

* let it be negative when the token is None

0dff407d

Fix `RobertaPreLayerNorm` doctest (#21337) · e5eb3e22

Yih-Dar authored Jan 27, 2023



* add mask="<mask>"

* update

* update

* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e5eb3e22

Bump onnx from 1.11.0 to 1.13.0 in /examples/research_projects/decision_transformer (#21331) · 36b668fa

dependabot[bot] authored Jan 27, 2023

Bump onnx in /examples/research_projects/decision_transformer

Bumps [onnx](https://github.com/onnx/onnx) from 1.11.0 to 1.13.0.
- [Release notes](https://github.com/onnx/onnx/releases)
- [Changelog](https://github.com/onnx/onnx/blob/main/docs/Changelog.md)
- [Commits](https://github.com/onnx/onnx/compare/v1.11.0...v1.13.0

)

---
updated-dependencies:
- dependency-name: onnx
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

36b668fa

Fix M2M100 positional embedding creation for ONNX (#21328) · 938f437c
Michael Benayoun authored Jan 27, 2023
```
* Fix M2M100 positional embedding creation for ONNX

* Restore READMEs

* Trigger CI
```
938f437c

26 Jan, 2023 10 commits

Update Hebrew language code to he per IANA registry (#21310) · 7d2a5fa7

altryne authored Jan 26, 2023

Here's my original PR into whisper that changes the same: 
https://github.com/openai/whisper/pull/401

Per [IANA registry](https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry), `iw` was deprecated as the code for Hebrew in 1989 and the preferred code is `he`

The correct subtag: 
```
%%
Type: language
Subtag: he
Description: Hebrew
Added: 2005-10-16
Suppress-Script: Hebr
%%
``` 
And the deprecation
```
%%
Type: language
Subtag: iw
Description: Hebrew
Added: 2005-10-16
Deprecated: 1989-01-01
Preferred-Value: he
Suppress-Script: Hebr
%%
```

7d2a5fa7

[Doctest] Fix `Perceiver` doctest (#21318) · b225ee6e
Younes Belkada authored Jan 26, 2023
```
fix `Perceiver` doctest
```
b225ee6e
Generate: better `compute_transition_scores` examples (#21323) · 2b8feffa
Joao Gante authored Jan 26, 2023

2b8feffa

Fix `TFEncoderDecoder` tests (#21301) · 449df41f

Yih-Dar authored Jan 26, 2023



remove max_length=None
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

449df41f

check paths in `utils/documentation_tests.txt` (#21315) · 857bad6e

Yih-Dar authored Jan 26, 2023



* check paths in utils/documentation_tests.txt

* check paths in utils/documentation_tests.txt
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

857bad6e

Small QoL for qa. (#21316) · fd0ef8b6
Nicolas Patry authored Jan 26, 2023

fd0ef8b6

[i18n-KO] Translated quicktour page to Korean (#20946) · a01dd381

Wonhyeong Seo authored Jan 26, 2023



docs: ko: quicktour page

review by @ArthurZucker
docs: fix: remove duplicate
Co-Authored-By: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

a01dd381

Fix 2 paths in the doctest list (#21314) · 31336dcf
Yih-Dar authored Jan 26, 2023
```
fix the list
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
31336dcf

Use `model_class.__name__` and compare against `XXX_MAPPING_NAMES` (#21304) · 4e41b87e

Yih-Dar authored Jan 26, 2023



* update

* update all

* clean up

* make quality

* clean up
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4e41b87e

Accept batched tensor of images as input to image processor (#21144) · d18a1cba
amyeroberts authored Jan 26, 2023
```
* Accept a batched tensor of images as input

* Add to all image processors

* Update oneformer
```
d18a1cba

25 Jan, 2023 7 commits

[WHISPER] Small patch (#21307) · 6f3faf38

Arthur authored Jan 25, 2023

* add small patch

* update tests, forced decoder ids is not prioritary against generation config

* fix two new tests

6f3faf38

Small fix to ExponentialDecayLengthPenalty docstring (#21308) · 140c6ede

Nick Hill authored Jan 25, 2023

Currently, it incorrectly states that the exponential_decay_length_penalty tuple parameter is optional.

Also changed the corresponding type hint to be more specific.

140c6ede

Add BridgeTower model (#20775) · 3a6e4a22

Anahita Bhiwandiwalla authored Jan 25, 2023



* Commit with BTModel and latest HF code

* Placeholder classes for BTForMLM and BTForITR

* Importing Bert classes from transformers

* Removed objectives.py and dist_utils.py

* Removed swin_transformer.py

* Add image normalization, BridgeTowerForImageAndTextRetrieval

* Add center_crop

* Removing bert tokenizer and LCI references

* Tested config loading from HF transformers hub

* Removed state_dict updates and added path to hub

* Enable center crop

* Getting image_size from config, renaming num_heads and num_layers

* Handling max_length in BridgeTowerProcessor

* Add BridgeTowerForMaskedLM

* Add doc string for BridgeTowerConfig

* Add doc strings for BT config, processor, image processor

* Adding docs, removed swin

* Removed convert_bridgetower_original_to_pytorch.py

* Added doc files for bridgetower, removed is_vision

* Add support attention_mask=None and BridgeTowerModelOutput

* Fix formatting

* Fixes with 'make style', 'make quality', 'make fixup'

* Remove downstream tasks from BridgeTowerModel

* Formatting fixes, add return_dict to BT models

* Clean up after doc_test

* Update BTModelOutput return type, fix todo in doc

* Remove loss_names from init

* implement tests and update tuples returned by models

* Add image reference to bridgetower.mdx

* after make fix-copies, make fixup, make style, make quality, make repo-consistency

* Rename class names with BridgeTower prefix

* Fix for image_size in BTImageProcessor

* implement feature extraction bridgetower tests

* Update image_mean and image_std to be list

* remove unused import

* Removed old comments

* Rework CLIP

* update config in tests followed config update

* Formatting fixes

* Add copied from for BridgeTowerPredictionHeadTransform

* Update bridgetower.mdx

* Update test_feature_extraction_bridgetower.py

* Update bridgetower.mdx

* BridgeTowerForMaskedLM is conditioned on image too

* Add BridgeTowerForMaskedLM

* Fixes

* Call post_init to init weights

* Move freeze layers into method

* Remove BTFeatureExtractor, add BT under multimodal models

* Remove BTFeatureExtractor, add BT under multimodal models

* Code review feedback - cleanup

* Rename variables

* Formatting and style to PR review feedback

* Move center crop after resize

* Use named parameters

* Style fix for modeling_bridgetower.py

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/bridgetower/modeling_bridgetower.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/bridgetower/modeling_bridgetower.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/models/bridgetower/modeling_bridgetower.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Rename config params, copy BERT classes, clean comments

* Cleanup irtr

* Replace Roberta imports, add BTTextConfig and Model

* Update docs, add visionconfig, consistent arg names

* make fixup

* Comments for forward in BTModel and make fixup

* correct tests

* Remove inconsistent roberta copied from

* Add BridgeTowerTextModel to dummy_pt_objects.py

* Add BridgeTowerTextModel to IGNORE_NON_TESTED

* Update docs for BT Text and Vision Configs

* Treat BridgeTowerTextModel as a private model

* BridgeTowerTextModel as private

* Run make fix-copies

* Adding BTTextModel to PRIVATE_MODELS

* Fix for issue with BT Text and Image configs

* make style changes

* Update README_ja.md

Add から to BridgeTower's description

* Clean up config, .mdx and arg names

* Fix init_weights. Remove nn.Sequential

* Formatting and style fixes

* Re-add tie_word_embeddings in config

* update test implementation

* update style

* remove commented out

* fix style

* Update README with abs for BridgeTower

* fix style

* fix mdx file

* Update bridgetower.mdx

* Update img src in bridgetower.mdx

* Update README.md

* Update README.md

* resolve style failed

* Update _toctree.yml

* Update README_ja.md

* Removed mlp_ratio, rename feats, rename BTCLIPModel

* Replace BTCLIP with BTVisionModel,pass in vision_config to BTVisionModel

* Add test_initialization support

* Add support for output_hidden_states

* Update support for output_hidden_states

* Add support for output_attentions

* Add docstring for output_hidden_states

* update tests

* add bridgetowervisionmodel as private model

* rerun the PR test

* Remove model_type, pass configs to classes, renames

* Change self.device to use weight device

* Remove image_size

* Style check fixes

* Add hidden_size and num_hidden_layers to BridgeTowerTransformer

* Update device setting

* cosmetic update

* trigger test again

* trigger tests again

* Update test_modeling_bridgetower.py

trigger tests again

* Update test_modeling_bridgetower.py

* minor update

* re-trigger tests

* Update docs/source/en/model_doc/bridgetower.mdx
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Remove pad, update max_text_len, doc cleanup, pass eps to LayerNorm

* Added copied to, some more review feedback

* make fixup

* Use BridgeTowerVisionEmbeddings

* Code cleanup

* Fixes for BridgeTowerVisionEmbeddings

* style checks

* re-tests

* fix embedding

* address comment on init file

* retrigger tests

* update import prepare_image_inputs

* update test_image_processing_bridgetower.py to reflect test_image_processing_common.py

* retrigger tests
Co-authored-by: Shaoyen Tseng <shao-yen.tseng@intel.com>
Co-authored-by: Tiep Le <tiep.le@intel.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Tiep Le <97980157+tileintel@users.noreply.github.com>

3a6e4a22

[CI-Daily] replace `past` in prepare inputs for generation (#21296) · 39799fbf
Arthur authored Jan 25, 2023
```
replace `past` in prepare inputs for generation
```
39799fbf

Documentation code sample fixes (#21302) · 23844941

Maria Khalusova authored Jan 25, 2023

* Fixed the following:
pipe -> pipeline
out in pipe(data()) is a list of dict, not a dict

* Fixed the TypeError: __init__() missing 1 required positional argument: 'key'

* Added a tip: code sample requires additional libraries to run

* Fixed custom config's name

* added seqeval to the required libraries

* fixed a missing dependency,
fixed metric naming,
added checkpoint to fix the datacollator

* added checkpoint to fix the datacollator,
added missing dependency

23844941

[Doctest] Fix `Blenderbot` doctest (#21297) · 015443f4
Younes Belkada authored Jan 25, 2023
```
fix blenderbot doctest

- add correct expected value
```
015443f4

Update `OneFormerModelIntegrationTest` expected values (#21295) · cc714d74

Yih-Dar authored Jan 25, 2023



* update values

* update values

* update values

* Update tests/models/oneformer/test_modeling_oneformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

cc714d74