Commits · 985c7e3ac9136253f250219a607dce881f0bb97b · chenpangpang / transformers

28 Jul, 2022 3 commits
- Updated _toctree.yml (#18337) · 985c7e3a
  Nicola Procopio authored Jul 28, 2022
  
  985c7e3a
- updated translation (#18333) · a8e27957
  Edoardo Federici authored Jul 28, 2022
```
Left the term fine-tuning since there is no correct translation into Italian and the English term is generally used. The same was done with some terms like "learning rate"
```
  a8e27957
- fixed typo (#18331) · 1e380c7d
  Edoardo Federici authored Jul 28, 2022
  
  1e380c7d
27 Jul, 2022 19 commits

Update feature extractor docs (#18324) · 96be1b7f

Steven Liu authored Jul 27, 2022

As pointed out by @NielsRogge, a feature extractor is used to prepare inputs for a model with a single modality rather than multimodal models.

96be1b7f

start from 1.12, torch_ccl is renamed as oneccl_bindings_for_pytorch … (#18229) · 2b81f72b

Wang, Yi authored Jul 27, 2022



* start from 1.12, torch_ccl is renamed as oneccl_bindings_for_pytorch and should import it before use
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* add doc for perf_train_cpu_many
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* update doc
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

2b81f72b

Add swin transformer v2 (#17469) · e87ac9d1

Ritik Nandwal authored Jul 27, 2022

* Add files generated using transformer-cli add-new-model-like command

* Add changes for swinv2 attention and forward method

* Add fixes

* Add modifications for weight conversion and remaining args in swin model

* Add changes for patchmerging

* Add changes for SwinV2selfattention

* Update conversion script

* Add final fixes for the swin_v2 model

* Add changes for conversion script for pretrained window size case

* Add pretrained window size value from config in SwinV2Encoder class

* Make fixup

* Add swinv2 to models_not_in_readme to utils/check_copies.py

* Modify Swinv2v2 to Swin Transformer V2

* Remove copied from, to run make fixup command

* Add updates to swinv2tf from main branch

* Add pretrained_window_size to config, to make tests pass

* Add modified weights from nandwalritik profile for swinv2

* Update model weights from swinv2 from nandwalritik profile

* Add fix for build_pr_documentation CI fix

* Add fixes f...

e87ac9d1

Dev version · c89a592e
Lysandre authored Jul 27, 2022

c89a592e

[Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c

Sanchit Gandhi authored Jul 27, 2022

* [Flax] Fix incomplete batches in example scripts

* fix dataloader batching

* convert jnp batch idxs to np array

* add missing `pad_shard_unpad` to final prediction generate step

* only `pad_shard_unpad` at inference time

* merge conflicts

* remove incomplete batch step from eval

* fix run_qa.py

* add `pad_shard_unpad` to run_flax_ner.py

* add `pad_shard_unpad` to run_flax_glue.py

* add `pad_shard_unpad` to run_image_classification.py

* make style

* fix mlm flax eval batches

* remove redundant imports

7490a97c

Owlvit test fixes (#18303) · 9caf68a6

Alara Dirik authored Jul 27, 2022

* fix owlvit test assertion errors

* fix gpu test error

* remove redundant lines

* fix styling

9caf68a6

Fix sacremoses sof dependency for Transformers XL (#18321) · 0077360d
Sylvain Gugger authored Jul 27, 2022
```
* Fix sacremoses sof dependency for Transofmers XL

* Add function to the submodule init
```
0077360d
sentencepiece shouldn't be required for the fast LayoutXLM tokenizer (#18320) · 5c5676cd
Lysandre Debut authored Jul 27, 2022

5c5676cd
Remove all uses of six (#18318) · cf32b2ee
Sylvain Gugger authored Jul 27, 2022
```
* Remove all uses of six

* fix quality
```
cf32b2ee
Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
Duong A. Nguyen authored Jul 27, 2022
```
* generalize decay_mask_fn to find all layernorm params

* fixup

* generalising decay_mask_fn
```
170fcaa6
fix loading from pretrained for sharded model with `torch_dtype="auto" (#18061) · 83d2d745
Nouamane Tazi authored Jul 27, 2022

83d2d745
fix module order (#18312) · 7996ef74
Younes Belkada authored Jul 27, 2022
```
- put gelu before 4h to h
```
7996ef74

Fixes torch jit tracing for LayoutLMv2 model (re-open) (#18313) · 70e7d1d6

Mikkel Denker authored Jul 27, 2022

* Fixes torch jit tracing for LayoutLMv2 model.
Pytorch seems to reuse memory for input_shape which caused a mismatch in shapes later in the forward pass.

* Fixed code quality

* avoid unneeded allocation of vector for shape

70e7d1d6

Update CodeParrot readme to include training in Megatron (#17798) · 1d71ad89

Loubna Ben Allal authored Jul 27, 2022



* add info about megatron training

* upload models and datasets from CodeParrot organization

* upload models and datasets from CodeParrot organization

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* fix typo and add comment about codeparrot vs megatron
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

1d71ad89

[XLA] Improve t5 model performance (#18288) · d5610b53
Yanming Wang authored Jul 27, 2022

d5610b53
Apply type correction to `TFSwinModelOutput` (#18295) · e318cda9
Seunghwan Hong authored Jul 27, 2022
```
Signed-off-by: Seunghwan Hong <seunghwan@scatterlab.co.kr>
```
e318cda9

[EncoderDecoder] Improve docs (#18271) · ccd4180f

NielsRogge authored Jul 27, 2022



* Improve docs

* Improve docs of speech one as well

* Apply suggestions from code review
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

ccd4180f

Remove duplicated line (#18310) · 5dfec704

Manuel R. Ciosici authored Jul 27, 2022

Removes a duplicated instantiation of device. I removed the second instance of the line to maintain code alignment with the GPT-J implementation of forward.

5dfec704

[DETR] Improve code examples (#18262) · 47c2af09

NielsRogge authored Jul 27, 2022



* Improve doc test

* Improve code example of segmentation model

* Apply suggestion

* Update src/transformers/models/detr/modeling_detr.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

47c2af09

26 Jul, 2022 18 commits

patch for smddp import (#18244) · ee67e7ad
Carolyn Wang authored Jul 26, 2022
```
* add import

* format
```
ee67e7ad

Fix Sylvain's nits on the original KerasMetricCallback PR (#18300) · 68097dcc

Matt authored Jul 26, 2022



* Fix Sylvain's nits on the original PR

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Re-add "optional" to docstring
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

68097dcc

Add PYTEST_TIMEOUT for CircleCI test jobs (#18251) · 66491331
Yih-Dar authored Jul 26, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
66491331

Add Spanish translation of custom_models.mdx (#17807) · a5d50483

Ian Castillo authored Jul 26, 2022

* Update index

* Translate to Spanish two sections from custom_models

* Translate to Spanish custom models documentation

* Fixing typos and grammatical errors

* Add requested changes from reviewer

a5d50483

Add Italian translation of sharing_custom_models.mdx (#17631) · 7ea7eba3

Federico Panero authored Jul 26, 2022



* work in progress: custom_models

* Update custom_models.mdx

* Update custom_models.mdx

* Update _toctree.yml

* Update _toctree.yml

* Update custom_models.mdx

* Update custom_models.mdx

* Update _toctree.yml

* Update _toctree.yml
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7ea7eba3

Add PyTorch 1.11 to past CI (#18302) · c4c6b4db
Yih-Dar authored Jul 26, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c4c6b4db

Add Italian translation of converting_tensorflow_models.mdx (#18283) · bbc28106

Federico Panero authored Jul 26, 2022



* Add Italian translation of converting_tensorflow_models.mdx

* Update _toctree.yml

* Update converting_tensorflow_models.mdx

* Update docs/source/it/_toctree.yml
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

bbc28106

Raise a TF-specific error when importing Torch classes (#18280) · a649de55

Matt authored Jul 26, 2022



* Raise a TF-specific error when importing Torch classes

* Update src/transformers/utils/import_utils.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Add an inverse error for PyTorch users
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

a649de55

[ create_a_model.mdx ] translate to pt (#18098) · 5e0ffd91

Fellip Silva Alves authored Jul 26, 2022



* [ fast_tokenizers.mdx ] - Added translation to portuguese to tutorial

* Delete docs/source/pt-br directory

* [ fast_tokenizers.mdx ] - Continuing work on file

* [ fast_tokenizers.mdx ] - Continuing work on file

* Add fast tokenizers to _toctree.yml

* Eliminated config and toctree.yml

* Nits in fast_tokenizers.mdx

* Finishing create_a_model

* [ create_a_model.mdx ] finishing create a model in pt-br

* [ Changing _toctree.yml ] adding create a model in pt
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

5e0ffd91

Update translation.mdx (#18169) · f58b9c05
Gorkem Ozkaya authored Jul 26, 2022
```
* Update translation.mdx

* update translation.mdx by running make style
```
f58b9c05
Add TFAutoModelForImageClassification to pipelines.py (#18292) · b5169527
Yih-Dar authored Jul 26, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b5169527
Adding type hints of TF:OpenAIGPT (#18263) · f374d391
Tom Mathews authored Jul 26, 2022

f374d391
Adding type hints of TF:CTRL (#18264) · 5bb211be
Tom Mathews authored Jul 26, 2022

5bb211be
Replace false parameter by a buffer (#18259) · c8ed1b8b
Sylvain Gugger authored Jul 26, 2022

c8ed1b8b

Fix ORTTrainer failure on gpt2 fp16 training (#18017) · 2844c5de

Jingya HUANG authored Jul 26, 2022



* Ensure value and attn weights have the same dtype

* Remove prints

* Modify decision transformers copied from gpt2

* Nit device
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Fix style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2844c5de

Add ViltForTokenClassification e.g. for Named-Entity-Recognition (NER) (#17924) · 2b096508

gilad19 authored Jul 26, 2022



* Add ViltForTokenClassification e.g. for Named-Entity-Recognition (NER)

* Add ViltForTokenClassification e.g. for Named-Entity-Recognition (NER)

* provide classifier only text hidden states

* add test_for_token_classification

* Update src/transformers/models/vilt/modeling_vilt.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/vilt/modeling_vilt.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/vilt/modeling_vilt.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/vilt/modeling_vilt.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* add test_for_token_classification
Co-authored-by: gfuchs <gfuchs@ebay.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

2b096508

Owlvit docs test (#18257) · 002915aa

Alara Dirik authored Jul 26, 2022

* fix docs and add owlvit docs test

* fix minor bug in post_process, add to processor

* improve owlvit code examples

* fix hardcoded image size

002915aa

Good difficult issue override for the stalebot (#18094) · d32558cc
Lysandre Debut authored Jul 26, 2022

d32558cc