Commits · 7490a97cac20cef6858f32e5f39a61f31ad64552 · chenpangpang / transformers

27 Jul, 2022 15 commits

[Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c

Sanchit Gandhi authored Jul 27, 2022

* [Flax] Fix incomplete batches in example scripts

* fix dataloader batching

* convert jnp batch idxs to np array

* add missing `pad_shard_unpad` to final prediction generate step

* only `pad_shard_unpad` at inference time

* merge conflicts

* remove incomplete batch step from eval

* fix run_qa.py

* add `pad_shard_unpad` to run_flax_ner.py

* add `pad_shard_unpad` to run_flax_glue.py

* add `pad_shard_unpad` to run_image_classification.py

* make style

* fix mlm flax eval batches

* remove redundant imports

7490a97c

Owlvit test fixes (#18303) · 9caf68a6

Alara Dirik authored Jul 27, 2022

* fix owlvit test assertion errors

* fix gpu test error

* remove redundant lines

* fix styling

9caf68a6

Fix sacremoses sof dependency for Transformers XL (#18321) · 0077360d
Sylvain Gugger authored Jul 27, 2022
```
* Fix sacremoses sof dependency for Transofmers XL

* Add function to the submodule init
```
0077360d
sentencepiece shouldn't be required for the fast LayoutXLM tokenizer (#18320) · 5c5676cd
Lysandre Debut authored Jul 27, 2022

5c5676cd
Remove all uses of six (#18318) · cf32b2ee
Sylvain Gugger authored Jul 27, 2022
```
* Remove all uses of six

* fix quality
```
cf32b2ee
Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
Duong A. Nguyen authored Jul 27, 2022
```
* generalize decay_mask_fn to find all layernorm params

* fixup

* generalising decay_mask_fn
```
170fcaa6
fix loading from pretrained for sharded model with `torch_dtype="auto" (#18061) · 83d2d745
Nouamane Tazi authored Jul 27, 2022

83d2d745
fix module order (#18312) · 7996ef74
Younes Belkada authored Jul 27, 2022
```
- put gelu before 4h to h
```
7996ef74

Fixes torch jit tracing for LayoutLMv2 model (re-open) (#18313) · 70e7d1d6

Mikkel Denker authored Jul 27, 2022

* Fixes torch jit tracing for LayoutLMv2 model.
Pytorch seems to reuse memory for input_shape which caused a mismatch in shapes later in the forward pass.

* Fixed code quality

* avoid unneeded allocation of vector for shape

70e7d1d6

Update CodeParrot readme to include training in Megatron (#17798) · 1d71ad89

Loubna Ben Allal authored Jul 27, 2022



* add info about megatron training

* upload models and datasets from CodeParrot organization

* upload models and datasets from CodeParrot organization

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* fix typo and add comment about codeparrot vs megatron
Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

1d71ad89

[XLA] Improve t5 model performance (#18288) · d5610b53
Yanming Wang authored Jul 27, 2022

d5610b53
Apply type correction to `TFSwinModelOutput` (#18295) · e318cda9
Seunghwan Hong authored Jul 27, 2022
```
Signed-off-by: Seunghwan Hong <seunghwan@scatterlab.co.kr>
```
e318cda9

[EncoderDecoder] Improve docs (#18271) · ccd4180f

NielsRogge authored Jul 27, 2022



* Improve docs

* Improve docs of speech one as well

* Apply suggestions from code review
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

ccd4180f

Remove duplicated line (#18310) · 5dfec704

Manuel R. Ciosici authored Jul 27, 2022

Removes a duplicated instantiation of device. I removed the second instance of the line to maintain code alignment with the GPT-J implementation of forward.

5dfec704

[DETR] Improve code examples (#18262) · 47c2af09

NielsRogge authored Jul 27, 2022



* Improve doc test

* Improve code example of segmentation model

* Apply suggestion

* Update src/transformers/models/detr/modeling_detr.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

47c2af09

26 Jul, 2022 20 commits

patch for smddp import (#18244) · ee67e7ad
Carolyn Wang authored Jul 26, 2022
```
* add import

* format
```
ee67e7ad

Fix Sylvain's nits on the original KerasMetricCallback PR (#18300) · 68097dcc

Matt authored Jul 26, 2022



* Fix Sylvain's nits on the original PR

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Re-add "optional" to docstring
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

68097dcc

Add PYTEST_TIMEOUT for CircleCI test jobs (#18251) · 66491331
Yih-Dar authored Jul 26, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
66491331

Add Spanish translation of custom_models.mdx (#17807) · a5d50483

Ian Castillo authored Jul 26, 2022

* Update index

* Translate to Spanish two sections from custom_models

* Translate to Spanish custom models documentation

* Fixing typos and grammatical errors

* Add requested changes from reviewer

a5d50483

Add Italian translation of sharing_custom_models.mdx (#17631) · 7ea7eba3

Federico Panero authored Jul 26, 2022



* work in progress: custom_models

* Update custom_models.mdx

* Update custom_models.mdx

* Update _toctree.yml

* Update _toctree.yml

* Update custom_models.mdx

* Update custom_models.mdx

* Update _toctree.yml

* Update _toctree.yml
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7ea7eba3

Add PyTorch 1.11 to past CI (#18302) · c4c6b4db
Yih-Dar authored Jul 26, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c4c6b4db

Add Italian translation of converting_tensorflow_models.mdx (#18283) · bbc28106

Federico Panero authored Jul 26, 2022



* Add Italian translation of converting_tensorflow_models.mdx

* Update _toctree.yml

* Update converting_tensorflow_models.mdx

* Update docs/source/it/_toctree.yml
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

bbc28106

Raise a TF-specific error when importing Torch classes (#18280) · a649de55

Matt authored Jul 26, 2022



* Raise a TF-specific error when importing Torch classes

* Update src/transformers/utils/import_utils.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Add an inverse error for PyTorch users
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

a649de55

[ create_a_model.mdx ] translate to pt (#18098) · 5e0ffd91

Fellip Silva Alves authored Jul 26, 2022



* [ fast_tokenizers.mdx ] - Added translation to portuguese to tutorial

* Delete docs/source/pt-br directory

* [ fast_tokenizers.mdx ] - Continuing work on file

* [ fast_tokenizers.mdx ] - Continuing work on file

* Add fast tokenizers to _toctree.yml

* Eliminated config and toctree.yml

* Nits in fast_tokenizers.mdx

* Finishing create_a_model

* [ create_a_model.mdx ] finishing create a model in pt-br

* [ Changing _toctree.yml ] adding create a model in pt
Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

5e0ffd91

Update translation.mdx (#18169) · f58b9c05
Gorkem Ozkaya authored Jul 26, 2022
```
* Update translation.mdx

* update translation.mdx by running make style
```
f58b9c05
Add TFAutoModelForImageClassification to pipelines.py (#18292) · b5169527
Yih-Dar authored Jul 26, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b5169527
Adding type hints of TF:OpenAIGPT (#18263) · f374d391
Tom Mathews authored Jul 26, 2022

f374d391
Adding type hints of TF:CTRL (#18264) · 5bb211be
Tom Mathews authored Jul 26, 2022

5bb211be
Replace false parameter by a buffer (#18259) · c8ed1b8b
Sylvain Gugger authored Jul 26, 2022

c8ed1b8b

Fix ORTTrainer failure on gpt2 fp16 training (#18017) · 2844c5de

Jingya HUANG authored Jul 26, 2022



* Ensure value and attn weights have the same dtype

* Remove prints

* Modify decision transformers copied from gpt2

* Nit device
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Fix style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2844c5de

Add ViltForTokenClassification e.g. for Named-Entity-Recognition (NER) (#17924) · 2b096508

gilad19 authored Jul 26, 2022



* Add ViltForTokenClassification e.g. for Named-Entity-Recognition (NER)

* Add ViltForTokenClassification e.g. for Named-Entity-Recognition (NER)

* provide classifier only text hidden states

* add test_for_token_classification

* Update src/transformers/models/vilt/modeling_vilt.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/vilt/modeling_vilt.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/vilt/modeling_vilt.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/vilt/modeling_vilt.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* add test_for_token_classification
Co-authored-by: gfuchs <gfuchs@ebay.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

2b096508

Owlvit docs test (#18257) · 002915aa

Alara Dirik authored Jul 26, 2022

* fix docs and add owlvit docs test

* fix minor bug in post_process, add to processor

* improve owlvit code examples

* fix hardcoded image size

002915aa

Good difficult issue override for the stalebot (#18094) · d32558cc
Lysandre Debut authored Jul 26, 2022

d32558cc
Fix dtype of input_features in docstring (#18258) · f65307e4
Yih-Dar authored Jul 26, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f65307e4
Fix command of doc tests for local testing (#18236) · bd87480d
Raghavan authored Jul 26, 2022
```
* Fix command of doc tests for local testing

* Fix command for after running doc tests locally
```
bd87480d

25 Jul, 2022 3 commits
- Fix TF bad words filter with XLA (#18286) · 45a14754
  Matt authored Jul 25, 2022
```
* Fix bad words filter in XLA generation

* Remove my cool debug breakpoints (again)
```
  45a14754
- Allows `KerasMetricCallback` to use XLA generation (#18265) · f4e17271
  Matt authored Jul 25, 2022
```
* Allows `KerasMetricCallback` to use XLA generation

* make fixup

* Slightly reword docstring
```
  f4e17271
- Skip passes report for `--make-reports` (#18250) · bbb62f29
  Yih-Dar authored Jul 25, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  bbb62f29
23 Jul, 2022 1 commit
- Generate: deprecate default `max_length` (#18018) · 7e44226f
  Joao Gante authored Jul 23, 2022
  
  7e44226f
22 Jul, 2022 1 commit

Update serving code to enable `saved_model=True` (#18153) · 8e838466

amyeroberts authored Jul 22, 2022



* Add serving_output and serving methods to some vision models

* Add serving outputs for DeiT

* Don't convert hidden states - differing shapes

* Make saveable

* Fix up

* Make swin saveable

* Add in tests

* Fix funnel tests (can't convert to tensor)

* Fix numpy call

* Tidy up a bit

* Add in hidden states - resnet

* Remove numpy

* Fix failing tests - tensor shape and skipping tests

* Remove duplicated function

* PR comments - formatting and var names

* PR comments
Add suggestions made by Joao Gante:
* Use tf.shape instead of shape_list
* Use @tooslow decorator on tests
* Simplify some of the logic

* PR comments
Address Yih-Dar Sheih comments - making tensor names consistent and make types float

* Types consistent with docs; disable test on swin (slow)

* CI trigger

* Change input_features to float32

* Add serving_output for segformer

* Fixup
Co-authored-by: Amy Roberts <amyeroberts@users.noreply.github.com>

8e838466