Commits · bb7949b35a0a2247bc20c7cb6d86764770ca3232 · chenpangpang / transformers

23 Feb, 2022 20 commits

Fix model templates (#15806) · bb7949b3
Lysandre Debut authored Feb 23, 2022
```
* Fix model templates

* Update paths
```
bb7949b3
Docker images should only run on a daily basis · 309e87e2
Lysandre authored Feb 23, 2022

309e87e2
Scheduled tests should only run on a daily basis · c475f3ce
Lysandre authored Feb 23, 2022

c475f3ce
Fix build_documentation CI (#15803) · 6336017c
Eliott C authored Feb 23, 2022

6336017c
[Test refactor 5/5] Build docker images (#15729) · a0e34806
Lysandre Debut authored Feb 23, 2022

a0e34806
[Test refactor 4/5] Improve the scheduled tests (#15728) · 4c737f0e
Lysandre Debut authored Feb 23, 2022

4c737f0e

[Test refactor 3/5] Notification service improvement (#15727) · d3ae2bd3

Lysandre Debut authored Feb 23, 2022



* Per-folder tests reorganization

* Review comments
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>

d3ae2bd3

[Test refactor 2/5] Tests fetcher (#15726) · 0400b226

Lysandre Debut authored Feb 23, 2022



* Tests fetcher

* Review comments
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Review comments

0400b226

[Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41

Lysandre Debut authored Feb 23, 2022



* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>

29c10a41

🧼 NLP task guides (#15731) · fecb08c2

Steven Liu authored Feb 23, 2022

* clean commit of changes to NLP tasks

* 🖍 apply feedback

* 📝

 move tf data collator in multiple choice
Co-authored-by: Steven <stevhliu@gmail.com>

fecb08c2

Fix indent in doc-builder CI (#15798) · 86636f52
Eliott C authored Feb 23, 2022

86636f52

HTML dev docs (#15678) · a1efc823

Eliott C authored Feb 23, 2022


Co-authored-by: Pierric Cistac <Pierrci@users.noreply.github.com>

a1efc823

Align documentation with code defaults (#15468) · 3f76bf54
lsb authored Feb 23, 2022
```
In the code, `do_normalize` defaults to True
```
3f76bf54

[doc] custom_models: mention security features of the Hub (#15768) · 32f5de10

Julien Chaumond authored Feb 23, 2022



* custom_models: tiny doc addition

* mention security feature earlier in the section
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

32f5de10

Enable `image-segmentation` on `AutoModelForSemanticSegmentation` (#15647) · 9e71d464

Nicolas Patry authored Feb 23, 2022

* Enabling Beit SegFormer to `image-segmentation`.

* Fixing the score.

* Fix import ?

* Missing in type hint.

* Multiple test fixes:

- Add `raw_image` support. It should be the default IMHO since in Python
  world it doesn't make any sense to base64 encode the image (Sorry
  @mishig, didn't catch that in my review). I really think we should
  consider breaking BC here.
- Add support for Segformer tiny test (needed
  `SegformerModelTester.get_config` to enable TinyConfig
  @NielsRogge)
- Add the check that `batch_size` works correctly on that pipeline.
  Uncovered that it doesn't for Detr, which IMO is OK since images
  after `feature_extractor` don't have the same size. Comment should
  explain.

* Type hint as a string.

* Make fixup + update black.

* torch+vision protections.

* Don't use torchvision, use F.interpolate instead (no new dep).

* Last fixes for Segformer.

* Update test to reflect new image (which was broken)

* Update tests.

* Major BC modification:

- Removed the string compressed PNG string, that's a job for users
`transformers` stays in python land.
- Removed the `score` for semantic segmentation. It has hardly a meaning
  on its own in this context.
- Don't include the grayscale with logits for now (which could enable
  users to get a sense of confidence). Might be done later.
- Don't include the surface of the mask (could be used for sorting by
  users, to filter out small masks). It's already calculable, and
  it's easier to add later, than to add now and break later if we need.

* `make fixup`.

* Small changes.

* Rebase + doc fixup.

9e71d464

[ViLT] Fix checkpoint url in config (#15790) · 1b239797

Suraj Patil authored Feb 23, 2022



* [ViLT] Fix checkpoint url in config

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

1b239797

[CLIP] fix grad ckpt (#15789) · de737866
Suraj Patil authored Feb 23, 2022

de737866
Supporting Merges.txt files than contain an endline. (#15782) · a3e607d1
Nicolas Patry authored Feb 23, 2022
```
(`hf-internal-testing/tiny-clip` for instance)
```
a3e607d1
[M2M100, XGLM] fix create_position_ids_from_inputs_embeds (#15751) · 24588c67
Suraj Patil authored Feb 23, 2022

24588c67

Adding ZeroShotImageClassificationPipeline (#12119) · f9582c20

Nicolas Patry authored Feb 23, 2022



* [Proposal] Adding ZeroShotImageClassificationPipeline

- Based on CLIP

* WIP, Resurection in progress.

* Resurrection... achieved.

* Reword handling different `padding_value` for `feature_extractor` and
`tokenizer`.

* Thanks doc-builder !

* Adding docs + global namespace `ZeroShotImageClassificationPipeline`.

* Fixing templates.

* Make the test pass and be robust to floating error.

* Adressing suraj's comments on docs mostly.

* Tf support start.

* TF support.

* Update src/transformers/pipelines/zero_shot_image_classification.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

f9582c20

22 Feb, 2022 8 commits

Fix `HfArgumentParser` when passing a generator (#15758) · 05a12a09

Santiago Castro authored Feb 22, 2022

* Fix `HfArgumentParser` when passing a generator

* Add missing import

* Always convert `dataclass_types` into a list

05a12a09

Cleanup transformers-cli (#15767) · db57bb2b
Julien Chaumond authored Feb 22, 2022

db57bb2b
Fix typo on examples/pytorch/question-answering (#15644) · 3db2e8f9
Yongrae Jo authored Feb 23, 2022
```
cna -> can
```
3db2e8f9
fixed pipeline code (#15607) · 2cdb6dbe
Boumadane Abdelmoumene authored Feb 22, 2022
```
Co-authored-by: Boumadane Abdelmoumene <moumene.boumadane@gmail.com>
```
2cdb6dbe

Time stamps for CTC models (#15687) · c44d3675

Patrick von Platen authored Feb 22, 2022



* [Wav2Vec2 Time Stamps]

* Add first version

* add word time stamps

* Fix

* save intermediate space

* improve

* [Finish CTC Tokenizer]

* remove @

* remove @

* push

* continue with phonemes

* up

* finish PR

* up

* add example

* rename

* finish

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* correct split

* finalize
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c44d3675

Gelu10 (#15676) · 32295b15

Funtowicz Morgan authored Feb 22, 2022

* Add GeLU10 (clipped version of GeLU) to transformers to improve quantization performances.

* Add unittests.

* Import tensorflow after `is_tf_available` check.

* Fix tensorflow wrong function `tf.tensor` to `tf.constant`

* style.

* use `tf.math.max`

* Fix tf tests.

* style.

* style style style style style style

* style style style style style style

* Address @sgugger comments.

* Fix wrong operator for raising ValueError for ClippedGELUActivation.

32295b15

TF train_step docstring (#15755) · 2c3fcc64
Joao Gante authored Feb 22, 2022
```
* TF train_step docstring
```
2c3fcc64
added link to our writing-doc document (#15756) · 38bed912
Francesco Saverio Zuppichini authored Feb 22, 2022

38bed912

21 Feb, 2022 6 commits

revert temporary addition to test next version of CLIPTokenizerFast (#15717) · 0187c6f0
SaulLu authored Feb 21, 2022

0187c6f0
TF text classification examples (#15704) · 3956b133
Joao Gante authored Feb 21, 2022
```
* Working example with to_tf_dataset

* updated text_classification

* more comments
```
3956b133
Add layer_idx to CrossAttention of GPT2 model (#15730) · 142b69f2
Kevin Ko authored Feb 22, 2022
```
* Add layer_idx to CrossAttention

* Add layer_idx to crossattention of ImageGPT model
```
142b69f2

add VisionTextDualEncoder and CLIP fine-tuning script (#15701) · 86119c11

Suraj Patil authored Feb 21, 2022



* begin script

* update script

* fix features and data args

* main

* add requirements

* add column name args

* fix captions

* don't jit transforms

* fix caption

* fix labels, handle attention mask

* convert pixel values to numpy

* labels => input_ids

* transform images on the fly

* use AutoModel class, create the hybird model outside of the script

* fix version message

* add readme

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* adderss review comments

* add more comments

* allow freezing vision and text models
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

86119c11

Fix minor comment typos (#15740) · 5444687f
Ivan Agarský authored Feb 21, 2022

5444687f
Remove input and target reset after preprocessing (#15741) · a63bd367
Simon Sardorf authored Feb 21, 2022
```
Remove input and target reset after preprocessing
```
a63bd367

18 Feb, 2022 6 commits

Add missing PLBart entry in README (#15721) · 2c2a31ff

Gunjan Chhablani authored Feb 19, 2022

* Add missing PLBart entry in index

* Fix README

* Fix README

* Fix style

* Change to master model doc

2c2a31ff

fix bug in PT speech-encoder-decoder (#15699) · 60ba4820

Sanchit Gandhi authored Feb 18, 2022



* fix bug in PT speech-encoder-decoder

* add pt test for `inputs is not None`

* fix test

* new pt test

* Update tests/test_modeling_speech_encoder_decoder.py

* make fixup
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

60ba4820

fix: hfdeepspeed config argument (#15711) · 3de12906

Jake Tae authored Feb 18, 2022

`HfDeepSpeedConfig` accepts a dictionary or path to `.json` file containing DS configurations, not `TrainingArguments`.

3de12906

Fix auto (#15706) · 83f45cd6
Lysandre Debut authored Feb 18, 2022

83f45cd6
style_doc handles decorators in examples (#15719) · d5083c33
Sylvain Gugger authored Feb 18, 2022

d5083c33

Add PLBart (#13269) · ae1f8350

Gunjan Chhablani authored Feb 18, 2022

* Init PLBART

* Add missing configuration file

* Add conversion script and configurationf ile

* Fix style

* Update modeling and conversion scripts

* Fix scale embedding in config

* Add comment

* Fix conversion script

* Add classification option to conversion script

* Fix vocab size in config doc

* Add tokenizer files from MBart50

* Allow no lang code in regular tokenizer

* Add PLBart Tokenizer Converters

* Remove mask from multi tokenizer

* Remove mask from multi tokenizer

* Change from MBart-50 to MBart tokenizer

* Fix names and modify src/tgt behavior

* Fix imports for tokenizer

* Remove <mask> from multi tokenizer

* Fix style

* Change tokenizer_class to processor_class

* Add attribute map to config class

* Update modeling file to modified MBart code

* Update configuration file to MBart style configuration

* Fix tokenizer

* Separate tokenizers

* Fix error in tokenization auto

* Copy MBart tests

* Replace with MBart tokenization tests

* Fix style

* Fix language code in multi tokenizer

* Fix configuration docs

* Add entry for plbart_multi in transformers init

* Add dummy objects and fix imports

* Fix modeling tests

* Add TODO in config

* Fix copyright year

* Fix modeling docs and test

* Fix some tokenization tests and style

* Add changes from review

* Fix copies

* Fix docs

* Fix docs

* Fix style

* Fix year

* Add changes from review

* Remove extra changes

* Fix base tokenizer and doc

* Fix style

* Fix modeling and slow tokenizer tests

* Remove Multi-tokenizer Converter and Tests

* Delete QA model and Multi Tokenizer dummy objects

* Fix repo consistency and code quality issues

* Fix example documentation

* Fix style

* Remove PLBartTokenizer from type checking in init

* Fix consistency issue

* Add changes from review

* Fix style

* Remove PLBartTokenizerFast

* Remove FastTokenizer converter

* Fix AutoTokenzier mapping

* Add plbart to toctree and fix consistency issues

* Add language codes tokenizer test

* Fix styling and doc issues

* Add fixes for failing tests

* Fix copies

* Fix failing modeling test

* Change assert to assertTrue in modeling tests

ae1f8350