Commits · cec5f7abd1062df18c32109b5c1d19a9bcc14174 · chenpangpang / transformers

07 Dec, 2022 5 commits

Update summarization `run_pipeline_test` (#20623) · cec5f7ab

Yih-Dar authored Dec 07, 2022



* update summarization run_pipeline_test

* update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

cec5f7ab

[`ViTHybrid`] + [`BiT`] cleaner `__init__` (#20649) · 3e4c9e5c
Younes Belkada authored Dec 07, 2022
```
* cleaner `__init__`

* add docstring for `backbone_config`
```
3e4c9e5c
[Trainer] add error when passing `8bit`models (#20651) · aac7b0d2
Younes Belkada authored Dec 07, 2022
```
* add error when passing `8bit`models

* fix

* improve message
```
aac7b0d2

Add BiT + ViT hybrid (#20550) · d151a8c5

NielsRogge authored Dec 07, 2022



* First draft

* More improvements

* Add backbone, first draft of ViT hybrid

* Add AutoBackbone

* More improvements

* Fix bug

* More improvements

* More improvements

* Convert ViT-hybrid

* More improvements

* add patch bit

* Fix style

* Improve code

* cleaned v1

* more cleaning

* more refactoring

* Improve models, add tests

* Add docs and tests

* Make more tests pass

* Improve default backbone config

* Update model_type

* Fix more tests

* Add more copied from statements

* More improvements

* Add push to hub to conversion scripts

* clean

* more cleanup

* clean

* replace to

* fix

* Update src/transformers/models/bit/configuration_bit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fix base model prefix

* more cleaning

* get rid of stem

* clean

* replace flag

* Update src/transformers/models/bit/configuration_bit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/bit/configuration_bit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* add check

* another check

* fix for hybrid vit

* final fix

* update config

* fix class name

* fix `make fix-copies`

* remove `use_activation`

* Update src/transformers/models/bit/configuration_bit.py

* rm unneeded file

* Add BiT image processor

* rm unneeded file

* add doc

* Add image processor to conversion script

* Add ViTHybrid image processor

* Add resources

* Move bit to correct position

* Fix auto mapping

* Rename hybrid to Hybrid

* Fix name in toctree

* Fix READMEs'

* Improve config

* Simplify GroupNormActivation layer

* fix test + make style

* Improve config

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* remove comment

* remove comment

* replace

* replace

* remove all conv_layer

* refactor norm_layer

* revert x

* add copied from

* last changes + integration tests

* make fixup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix name

* fix message

* remove assert and refactor

* refactor + make fixup

* refactor - add  + sfety checker

* fix docstring + checkpoint names

* fix merge issues

* fix function name

* fix copies

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix model checkpoint

* fix doctest output

* vit name on doc

* fix name on doc

* fix small nits

* fixed integration tests

* final changes - slow tests pass
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d151a8c5

[MaskFormer] Add support for ResNet backbone (#20483) · b610c47f

NielsRogge authored Dec 07, 2022



* Add SwinBackbone

* Add hidden_states_before_downsampling support

* Fix Swin tests

* Improve conversion script

* Add id2label mappings

* Add vistas mapping

* Update comments

* Fix backbone

* Improve tests

* Extend conversion script

* Add Swin conversion script

* Fix style

* Revert config attribute

* Remove SwinBackbone from main init

* Remove unused attribute

* Use encoder for ResNet backbone

* Improve conversion script and add integration test

* Apply suggestion
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

b610c47f

06 Dec, 2022 16 commits

Pin TensorFlow to the next release (#20635) · 6c1a0b39
Sylvain Gugger authored Dec 06, 2022

6c1a0b39
Clip floating point constants to bf16 range to avoid inf conversion (#20605) · c95f8470
aws-sangeetha authored Dec 06, 2022
```
Co-authored-by: EC2 Default User <ec2-user@ip-172-31-40-169.us-west-2.compute.internal>
```
c95f8470
Fix `natten` installation in docker file (#20632) · f68796bd
Yih-Dar authored Dec 06, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f68796bd
Fix link to speech encoder decoder model in speech recognition readme (#20633) · f821bea0
Francisco Kurucz authored Dec 06, 2022

f821bea0
add missing is_decoder param (#20631) · 4f78bcb2
Steven Liu authored Dec 06, 2022

4f78bcb2
Fix dtype of weights in from_pretrained when device_map is set (#20602) · 7586a1a3
Sylvain Gugger authored Dec 06, 2022

7586a1a3

Update some GH action versions (#20537) · bf9a5882

Yih-Dar authored Dec 06, 2022



* update actions versions

* update actions versions

* update actions versions

* update actions versions
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

bf9a5882

Ci-jukebox (#20613) · acc439ba

Arthur authored Dec 06, 2022



* fix cuda OOM by using single Prior

* only send to device when used

* use custom model

* Skip the big slow test

* Update tests/models/jukebox/test_modeling_jukebox.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

acc439ba

Fix `AutomaticSpeechRecognitionPipelineTests.run_pipeline_test` (#20597) · 9b14c1b6

Yih-Dar authored Dec 06, 2022



* Remove assert exception not triggered

* Fix wrong expected exception string

* fix

* use assertRaisesRegex
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9b14c1b6

Repo consistency · 6a707cf5
Sylvain Gugger authored Dec 06, 2022

6a707cf5
updating T5 and BART models to support Prefix Tuning (#20601) · 97a51b0c
Sourab Mangrulkar authored Dec 06, 2022
```
* updating T5 and BART models to support Prefix Tuning

* `make fix-copies`

* address comments

* address comments
```
97a51b0c
Check if docstring is None before formating it (#20592) · b9a0ede6
xxyzz authored Dec 06, 2022
```
docstrings could be `None` if Python optimize level is set to 2.
```
b9a0ede6
exclude jit time from the speed metric calculation of evaluation and prediction (#20553) · ae06bce8
Wang, Yi authored Dec 06, 2022
```
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
ae06bce8
Adding anchor links to Hindi README (#20606) · 25e10da4
Sourab Mangrulkar authored Dec 06, 2022

25e10da4
Documentation fixes (#20607) · e842e181
Samuel Xu authored Dec 06, 2022

e842e181

Rework the pipeline tutorial (#20437) · 28f3d431

Nicolas Patry authored Dec 06, 2022



* [WIP] Rework the pipeline tutorial

- Switch to `asr` instead of another NLP task.
- It also has simpler to understand results.
- Added a section with interaction with `datasets`.
- Added a section with writing a simple webserver.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Addressing comments.

* Links.

* Fixing docs format.

* Adding pipeline_webserver to _toctree.

* Warnig -> Tip warnings={true}.

* Fix link ?

* Links ?

* Fixing link, adding chunk batching.

* Oops.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/pipeline_tutorial.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

28f3d431

05 Dec, 2022 19 commits

Fix test for file not found (#20604) · 5764efe5
Sylvain Gugger authored Dec 05, 2022

5764efe5
Split autoclasses on modality (#20559) · 720e9599
Steven Liu authored Dec 05, 2022
```
* split autoclasses on modality

* apply review

* auto classes
```
720e9599
Fix code sample in preprocess (#20561) · 7d1c1c5b
Steven Liu authored Dec 05, 2022
```
* change to image_processor

* apply review
```
7d1c1c5b

README in Hindi

🇮🇳

(#20097) · 73ec12ea

Sourab Mangrulkar authored Dec 06, 2022

* Created README_hd.md

A Hindi Translation for README

* updated check_copies.py

Added the Proper info for Hindi Translation of README File !

* updated README_hd.md

Fixed some translation issues !

* Update README_hd.md

* Update README_hd.md

* Update README_hd.md

* fixing 🐛 for `make fix-copies`

* run `make fix-copies`

* `make fix-copies` 😅

Co-authored-by: Akshit Gulyan <103456810+AkshitGulyan@users.noreply.github.com>

73ec12ea

Add-whisper-conversion (#20600) · aef9aac3

Arthur authored Dec 05, 2022

* add whisper conversion scrip

* update conversion script

* update arg names

* fix missing encoder_ffn_dim

* fixup

* ast nits

aef9aac3

[Whisper] Fix decoder ids methods (#20599) · 74fb524e
Sanchit Gandhi authored Dec 05, 2022
```
* [Whisper] Fix decoder ids methods

* enum property
```
74fb524e

[Vision] `.to` function for ImageProcessors (#20536) · ef0f85cd

Younes Belkada authored Dec 05, 2022



* add v1 with tests

* add checker

* simplified version

* update docstring

* better version

* fix docstring + change order

* make style

* tests + change conditions

* final tests

* modify docstring

* Update src/transformers/feature_extraction_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* replace by `ValueError`

* fix logic

* apply suggestions

* `dtype` is not needed

* adapt suggestions

* remove `_parse_args_to_device`
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

ef0f85cd

Replace `set-output` by `$GITHUB_OUTPUT` (#20547) · 67d32f46
Yih-Dar authored Dec 05, 2022
```
* remove set-output
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
67d32f46

Fix whisper and speech to text doc (#20595) · 9763f829

Arthur authored Dec 05, 2022

* Fix whisper and speech to text doc
# What does this PR do?
Previously the documentation was badly indented for both models and indicated that
> If `decoder_input_ids` and `decoder_inputs_embeds` are both unset, `decoder_inputs_embeds` takes the value of `inputs_embeds`.`
Which is on valid for the forward pass of the `ForConditionnalGeneration` not for the model alone.

* other fixes

9763f829

clean up unused `classifier_dropout` in config (#20596) · 4430b912
Yih-Dar authored Dec 05, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
4430b912
Fix link to table transformer detection microsoft model (#20560) · eefae413
Francisco Kurucz authored Dec 05, 2022
```
* Fix link to table transformer detection microsoft model

* Fix doc styles
```
eefae413
Fix link to swin transformers v2 microsoft model (#20558) · d5af5a0c
Francisco Kurucz authored Dec 05, 2022

d5af5a0c
Fix link to Swin Model contributor novice03 (#20557) · ac3bccdc
Francisco Kurucz authored Dec 05, 2022

ac3bccdc

Add RemBERT ONNX config (#20520) · 87282cb7

Erin authored Dec 05, 2022



* rembert onnx config

* formatting
Co-authored-by: Ho <erincho@bcd0745f972b.ant.amazon.com>

87282cb7

ESM openfold_utils type hints (#20544) · afe2a466

Matthew Hoffman authored Dec 05, 2022



* add type annotations for esm chunk_utils

use isinstance builtin instead of 'type(x) is y'; add assertions to aid in type inferencing; use bools instead of ints in _get_minimal_slice_set for improved type clarity; refactor to avoid re-assigning to the same variable with a different type

* add type annotations for esm data_transforms

refactor to avoid re-assigning to the same variable with a different type

* add type annotations for esm feats utils

refactor to avoid re-assigning to the same variable with a different type

* add type annotations for esm loss utils

* add/fix type annotations for esm rigit_utils

refactor to avoid re-assigning to the same variable with a different type; fix Callable, Tuple type hints; match conditional structure to other methods; fix return type on Rotation.cat and Rotation.unsqueeze

* add type annotations for esm tensor_utils

overload for tree_map; use insinstance builtin instead of 'type(x) is y'; export dict_multimap, flatten_final_dims, permute_final_dims in openfold_utils

* add type annotations for esm protein utils

add FIXME for attempted string mutation; add missing None check in get_pdb_headers; fix potentially unbound variable 'chain_tag' in to_pdb; modify get_pdb_headers return type

* add type annotations for esm residue constants

hints on collection constants; remove magic trailing comma to reduce number of lines; change list -> tuple for rigid_group_atom_positions for improved hinting

* code style fixup
Co-authored-by: Matt <rocketknight1@gmail.com>

afe2a466

Make convert_to_onnx runable as script again (#20009) · 8ea6694d

Mihai Cernusca authored Dec 05, 2022

* Make convert_to_onnx runable as script again

Fix `convert_graph_to_onnx.py` relative import so it can be run as a script again.

* Trigger CI

8ea6694d

cross platform from_pretrained (#20538) · 84c9bf74

Arthur authored Dec 05, 2022



* add support for `from_pt`

* add tf_flax utility file

* Update src/transformers/modeling_tf_flax_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove flax related modifications

* add test

* remove FLAX related commits

* fixup

* remove safetensor todos

* revert deletion
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

84c9bf74

Ci-whisper-asr (#20588) · 538e5248
Arthur authored Dec 05, 2022
```
* Expected output for the test changed

* fix failing asr test
```
538e5248

Add BioGPT (#20420) · 13e73668

Kamal Raj Kanakarajan authored Dec 05, 2022

* biogpt initial commit

* updated init

* fix faster decoding with use_cache

* 1. fix input_ids and input_embeds with correct device
2. added _keys_to_ignore_on_load_missing
3. updated prepare_inputs_for_generation

* add activation_dropout and scale_embedding

* replace fsmt attention with bart attention

* added test

* run make fix-copies

* doc init and fix build

* updated README with proper information

* 1. added tips to docs
2. updated BioGptTokenizer func

* 1. added tokenizer test
2. refactor tokenizer

* make fixup

* add biogpt fairseq to hf converter

* updated layer names more
similar to original checkpoints

* config update doc string and set defaults

* added "#copied" from bart model and
updated doc strings

* enable model_input_names in tokenizer

* 1.  positionalembedding depending on attention_mask
2. added attention mask to prepare for generation

* added test to verify past and generation

* BioGptLMHeadModel -> BioGptForCausalLM

* fix typo

* tokenization and test
Copyright and updated assertion

* updated Copyright and
one func at time in line

* Copyright updates and
minor doc fix

* replace assertion with ValueError

* rm extra space

* added code syntax

* revert cmnt position change

* add tokenizer to auto

* updated doc string

* tokenizer doc string update

* biogpt hub model update to microsoft/biogpt

* make fixup

* rm cmnt to fix flake8 5.0.4 vs 6 error

13e73668