Commits · 9d64f7f00c7904c1e8eaccea0d18b7434f2d9cbc · chenpangpang / transformers

05 Aug, 2022 10 commits

Update some expected values in `quicktour.mdx` for `resampy 0.3.0` (#18484) · 9d64f7f0
Yih-Dar authored Aug 05, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9d64f7f0
Move cache folder to huggingface/hub for consistency with hf_hub (#18492) · faacdf00
Sylvain Gugger authored Aug 05, 2022
```
* Move cache folder to just huggingface

* Thank you VsCode for this needless import

* Move to hub

* Forgot one
```
faacdf00
Fix `test_dbmdz_english` by updating expected values (#18482) · 280db2e3
Yih-Dar authored Aug 05, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
280db2e3

Use new huggingface_hub tools for download models (#18438) · 5cd40323

Sylvain Gugger authored Aug 05, 2022

* Draft new cached_file

* Initial draft for config and model

* Small fixes

* Fix first batch of tests

* Look in cache when internet is down

* Fix last tests

* Bad black, not fixing all quality errors

* Make diff less

* Implement change for TF and Flax models

* Add tokenizer and feature extractor

* For compatibility with main

* Add utils to move the cache and auto-do it at first use.

* Quality

* Deal with empty commit shas

* Deal with empty etag

* Address review comments

5cd40323

Fix pipeline tests (#18487) · 70fa1a8d
Sylvain Gugger authored Aug 05, 2022
```
* Fix pipeline tests

* Make sure all pipelines tests run with init changes
```
70fa1a8d
Remove py.typed (#18485) · c7849d9e
Sylvain Gugger authored Aug 05, 2022

c7849d9e
Add TF prefix to TF-Res test class (#18481) · 893122f6
Yih-Dar authored Aug 05, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
893122f6

Refactor `TFSwinLayer` to increase serving compatibility (#18352) · bf174f91

Seunghwan Hong authored Aug 05, 2022



* Refactor `TFSwinLayer` to increase serving compatibility
Signed-off-by: Seunghwan Hong <seunghwan@scatterlab.co.kr>

* Fix missed parameters while refactoring
Signed-off-by: Seunghwan Hong <seunghwan@scatterlab.co.kr>

* Fix window_reverse to calculate batch size
Signed-off-by: Seunghwan Hong <harrydrippin@gmail.com>
Co-Authored-By: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

bf174f91

Fix TFSwinSelfAttention to have relative position index as non-trainable weight (#18226) · 575aa6ef
Seunghwan Hong authored Aug 05, 2022
```
Signed-off-by: Seunghwan Hong <seunghwan@scatterlab.co.kr>
```
575aa6ef

Fixing issue where generic model types wouldn't load properly with the pipeline (#18392) · 586dcf6b

Nicolas Patry authored Aug 05, 2022

* Adding a better error message when the model is improperly configured

within transformers.

* Update src/transformers/pipelines/__init__.py

* Black version.

* Overriding task aliases so that tokenizer+feature_extractor

values are correct.

* Fixing task aliases by overriding their names early

* X.

* Fixing feature-extraction.

* black again.

* Normalizing `translation` too.

* Fixing last few corner cases.

translation need to use its non normalized name (translation_XX_to_YY,
so that the task_specific_params are correctly overloaded).
This can be removed and cleaned up in a later PR.

`speech-encode-decoder` actually REQUIRES to pass a `tokenizer` manually
so the error needs to be discarded when the `tokenizer` is already
there.

* doc-builder fix.

* Fixing the real issue.

* Removing dead code.

* Do not import the actual config classes.

586dcf6b

04 Aug, 2022 10 commits

Add `TF_MODEL_FOR_SEMANTIC_SEGMENTATION_MAPPING` (#18469) · 14928921
Yih-Dar authored Aug 04, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
14928921

Update no trainer examples for QA and Semantic Segmentation (#18474) · 0bf1e1ac

Kian Sierra McGettigan authored Aug 04, 2022

* swag_no_trainer updated for with gather_metrics

* Removed unused variable samples_seen

* updated examples with gather_for_metrics

0bf1e1ac

Add machine type in the artifact of Examples directory job (#18459) · d2704c41
Yih-Dar authored Aug 04, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d2704c41

Add VideoMAE (#17821) · f9a0008d

NielsRogge authored Aug 04, 2022



* First draft

* Add VideoMAEForVideoClassification

* Improve conversion script

* Add VideoMAEForPreTraining

* Add VideoMAEFeatureExtractor

* Improve VideoMAEFeatureExtractor

* Improve docs

* Add first draft of model tests

* Improve VideoMAEForPreTraining

* Fix base_model_prefix

* Make model take pixel_values of shape (B, T, C, H, W)

* Add loss computation of VideoMAEForPreTraining

* Improve tests

* Improve model testsé

* Make all tests pass

* Add VideoMAE to main README

* Add tests for VideoMAEFeatureExtractor

* Add integration test

* Improve conversion script

* Rename patch embedding class

* Remove VideoMAELayer from init

* Update design of patch embeddings

* Improve comments

* Improve conversion script

* Improve conversion script

* Add conversion of pretrained model

* Add loss verification of pretrained model

* Add loss verification of unnormalized targets

* Add integration test for pretraining model

* Apply suggestions from code review

* Fix bug to make feature extractor resize only shorter edge

* Address more comments

* Improve normalization of videos

* Add doc examples

* Move constants to dedicated script

* Remove scripts

* Transfer checkpoints, fix docs

* Update script

* Update image mean and std

* Fix doc tests

* Set return_tensors to NumPy by default

* Revert the previous change
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

f9a0008d

Add FX support for torch.baddbmm andd torch.Tensor.baddbmm (#18363) · 672b6626
Thomas Wang authored Aug 04, 2022

672b6626
Fix load of model checkpoints in the Trainer (#18470) · df28de05
Sylvain Gugger authored Aug 04, 2022

df28de05
Update no trainer scripts for multiple-choice (#18468) · 330247ed
Kian Sierra McGettigan authored Aug 04, 2022
```
* swag_no_trainer updated for with gather_metrics

* Removed unused variable samples_seen
```
330247ed

HFTracer.trace can now take callables and torch.nn.Module (#18457) · c74befc9

Michael Benayoun authored Aug 04, 2022

* Enable HFTracer to trace with custom dummy inputs instead of pre-computed ones

* Add HFTracer.trace docstring, and make it possible to handle callable and torch.nn.Module in general

* Remove pdb comment

* Apply suggestions

c74befc9

change shape to support dynamic batch input in tf.function XLA generate for tf serving (#18372) · fc1d841b
nlpcat authored Aug 04, 2022
```
* change shape to support dynamic batch input in tf.generate

* add tests
Co-authored-by: nlpcatcode <nlpcodecat@gmail.com>
```
fc1d841b

[BLOOM] Clean modeling code (#18344) · b69a62d5

Thomas Wang authored Aug 04, 2022



* Cleanup some code

* Improve signatures

* Try to reduce the number of reshape/copies

* I don't think we actually need the layer_num scaling trick

* No need for duplication

* Try to fix beam_search

* Fix beam search

* Removing layer num normalization seems to be breaking

* Not sure self.layer_number normalization actually matters

* Try and be backward compatible

* Try to fix beam_search

* Revert attempt to be backward compatible

* Improve documentation on past_key_values format

* Optimize the device allocation in case of hidden_states in multiple devices

* No need to manually cast the values to a specific device

* Rename with long version of variables

* Improve type hinting

* Add comment that explains that some methods return views

* Actually i think the attention casting only makes sense when we use torch.float16

* We don't actually need layer_number to be passed anymore

* Fix FX test

* Bypass torch.baddbmm

* Apply suggestions from code review

* Add comment about support for torchScript v1.11

* fix ONNX support for bloom (#18456)
Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com>
Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>

b69a62d5

03 Aug, 2022 10 commits

Fix torch version comparisons (#18460) · 02b176c4

LSinev authored Aug 03, 2022

Comparisons like
version.parse(torch.__version__) > version.parse("1.6")
are True for torch==1.6.0+cu101 or torch==1.6.0+cpu

version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py

02b176c4

fix: keras fit tests for segformer tf and minor refactors. (#18412) · be41eaf5

Sayak Paul authored Aug 03, 2022

* fix: keras fit tests for segformer tf and minor refactors.

* refactor: test_keras_fit to make it simpler using the existing one.

* fix: styling issues.

be41eaf5

add zero-shot obj detection notebook to docs (#18453) · fc546332
Alara Dirik authored Aug 03, 2022

fc546332
Fix failing tests for XLA generation in TF (#18298) · 8fb7c908
Daniel Suess authored Aug 03, 2022
```
* Fix failing test_xla_generate_slow tests

* Fix failing speech-to-text xla_generate tests
```
8fb7c908
Update pinned hhub version (#18448) · a507908c
Omar Sanseviero authored Aug 03, 2022
```
* Update pinned hhub version

* Make style
```
a507908c

Update no trainer scripts for language modeling and image classification examples (#18443) · 3db4378b

Ritik Nandwal authored Aug 03, 2022

* Update no_trainer script for image-classification

* Update no_trainer scripts for language-modeling examples

* Remove unused variable

* Removing truncation from losses array for language modeling examples

3db4378b

Add Spanish translation of run_scripts.mdx (#18415) · 10e1ec9a

Ian Castillo authored Aug 03, 2022

* Add file in spanish docs to be translated

* Translate first two sections to Spanish

* Translate four additional sections to Spanish

* Finish translation to Spanish

* Improve writing style in Spanish

* Add suggested changes from reviewer

10e1ec9a

support ONNX export of XDropout in deberta{,_v2} and sew_d (#17502) · 9d7b70bc

Gary Miguel authored Aug 03, 2022

* support ONNX export of XDropout in deberta{,_v2}

* black

* copy to sew_d

* add test

* isort

* use pytest.mark.filterwarnings

* review comments

9d7b70bc

Update _toctree.yml (#18440) · 92915ebe

Steven Liu authored Aug 03, 2022

This PR moves GroupViT and LXMert to their correct sections. As pointed out by @NielsRogge and @LysandreJik, GroupViT and LXMert are both multimodal models.

92915ebe

fixing error when using sharded ddp (#18435) · 22a0dd2e
Sourab Mangrulkar authored Aug 03, 2022

22a0dd2e

02 Aug, 2022 10 commits

Add programming languages (#18434) · 5096a654

Christopher Akiki authored Aug 02, 2022

The current wording makes it sound as if the programming languages are part of the 46 natural languages.

5096a654

Update pipeline word heuristic to work with whitespace in token offsets (#18402) · 042f4203

David authored Aug 02, 2022

* Update pipeline word heuristic to work with whitespace in token offsets

This change checks for whitespace in the input string at either the
character preceding the token or in the first character of the token.
This works with tokenizers that return offsets excluding whitespace
between words or with offsets including whitespace.

fixes #18111

starting

* Use smaller model, ensure expected tokenization

* Re-run CI (please squash)

042f4203

Accept `trust_remote_code` and ignore it in `PreTrainedModel.from_pretrained` (#18428) · c382ed8a
Yih-Dar authored Aug 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c382ed8a
Improve `generate` docstring (#18198) · dbd9641c
João Lages authored Aug 02, 2022
```
* improve generate docstring

* Remove 'defaults to None' comment
```
dbd9641c
fix run_clip README (#18332) · 5546fb61
Yih-Dar authored Aug 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5546fb61
Fix `test_load_default_pipelines_tf` test error (#18422) · 2959d090
Yih-Dar authored Aug 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
2959d090
update maskformer docs (#18423) · 8ae77842
Alara Dirik authored Aug 02, 2022
```
* update maskformer docs

* fix typo
```
8ae77842
Change audio kwarg to images in TROCR processor (#18421) · 0b8c1b69
Yih-Dar authored Aug 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0b8c1b69
Fix the hub user name in a longformer doctest checkpoint (#18418) · dd21fb37
Yih-Dar authored Aug 02, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
dd21fb37

Fix uninitialized parameter in conformer relative attention. (#18368) · 68a894a5

Piotr Dabkowski authored Aug 02, 2022

`torch.Tensor` creates an unitialized tensor (as via `torch.empty`), this leads to undeterministic behavior, poor initialization, and nans if you have unlucky init. The paper does not specify the initialization for bias terms, so I guess zero seems like a good choice - no bias initially. `torch.Tensor` is usually populated with zeros, so this fix will be close to the intended behavior:

```
>>> torch.Tensor(100, 100).sum()
tensor(0.)
>>> torch.Tensor(100, 100).sum()
tensor(nan)
>>> torch.Tensor(100, 100).sum()
tensor(0.)
```

68a894a5