Commits · 131e258411179d84358f4b28c3ed90fd65f8ba4a · chenpangpang / transformers

07 Feb, 2022 9 commits

Fix TF T5/LED missing cross attn in retrun values (#15511) · 131e2584

Yih-Dar authored Feb 07, 2022



* add cross attn to outputs

* add cross attn to outputs for TFLED

* add undo padding

* remove unused import

* fix style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

131e2584

Remove Longformers from ONNX-supported models (#15273) · 6775b211
lewtun authored Feb 07, 2022

6775b211

Wav2Vec2 models must either throw or deal with add_apater (#15409) · 7a1412e1

François REMY authored Feb 07, 2022



* Wav2Vec2 models must either throw or deal with add_apater
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Add pre-add_adapter backwards compatibility

* Add pre-add_adapter backwards compatibility

* Fix issue in tests/test_modeling_wav2vec2.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

7a1412e1

Add ASR CTC streaming example (#15309) · a459f7f9

Anton Lozhkov authored Feb 07, 2022



* Single-epoch run

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Infinite dataset

* Trainer fix + distributed benchmark

* Benchmark fix

* unused import

* interleaved splits

* interleaved splits

* has_length util

* Move to research projects

* Leftover Sized checks

* Bump min version

* Unused import

* Revert trainer changes
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a459f7f9

[Trainer] Deeper length checks for IterableDatasetShard (#15539) · 75b13f82

Anton Lozhkov authored Feb 07, 2022



* Unused import

* Make `has_length()` torch-independent to use in callbacks

* Update src/transformers/trainer_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

75b13f82

Add ConvNeXT (#15277) · 84eec9e6

NielsRogge authored Feb 07, 2022



* First draft

* Add conversion script

* Improve conversion script

* Improve docs and implement tests

* Define model output class

* Fix tests

* Fix more tests

* Add model to README

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply more suggestions from code review

* Apply suggestions from code review

* Rename dims to hidden_sizes

* Fix equivalence test

* Rename gamma to gamma_parameter

* Clean up conversion script

* Add ConvNextFeatureExtractor

* Add corresponding tests

* Implement feature extractor correctly

* Make implementation cleaner

* Add ConvNextStem class

* Improve design

* Update design to also include encoder

* Fix gamma parameter

* Use sample docstrings

* Finish conversion, add center cropping

* Replace nielsr by facebook, make feature extractor tests smaller

* Fix integration test

Co-authored-by: Sylvain Gugger <35901082+...

84eec9e6

[torch_int_div] Correct true division in generation (#15498) · c47d2592
Patrick von Platen authored Feb 07, 2022
```
* [torch_int_div] Correct true division in generation

* up

* up
```
c47d2592
[ASR pipeline] correct asr pipeline for seq2seq models (#15541) · 5f1918a4
Patrick von Platen authored Feb 07, 2022

5f1918a4
Revert "Handle PyTorch to Flax conversion of 1D convolutions (#15519)" (#15540) · e02bdce7
Patrick von Platen authored Feb 07, 2022
```
This reverts commit 854a0d52.
```
e02bdce7

04 Feb, 2022 6 commits

[deepspeed docs] DeepSpeed ZeRO Inference (#15486) · 8ce13306

Stas Bekman authored Feb 04, 2022



* [deepspeed docs] DeepSpeed ZeRO Inference

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* tweak

* deal with black

* extra cleanup, better comments
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8ce13306

Standardize semantic segmentation models outputs (#15469) · ac6aa10f

Sylvain Gugger authored Feb 04, 2022



* Standardize instance segmentation models outputs

* Rename output

* Update src/transformers/modeling_outputs.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add legacy argument to the config and model forward

* Update src/transformers/models/beit/modeling_beit.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Copy fix in Segformer
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

ac6aa10f

[deepspeed docs] Megatron-Deepspeed info (#15488) · 31be2f45
Stas Bekman authored Feb 04, 2022

31be2f45

Fix TFRemBertEncoder all_hidden_states (#15510) · bbe9c698

Yih-Dar authored Feb 04, 2022



* fix

* fix test

* remove expected_num_hidden_layers
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

bbe9c698

Handle PyTorch to Flax conversion of 1D convolutions (#15519) · 854a0d52
Sanchit Gandhi authored Feb 04, 2022

854a0d52
use kwargs (#15509) · 486260c6
Yih-Dar authored Feb 04, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
486260c6

03 Feb, 2022 8 commits
- Remove loss from some flax models docs & examples (#15492) · 525dbbf8
  Yih-Dar authored Feb 03, 2022
```
* Remove return_loss from Flax models

* fix more

* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  525dbbf8
- [deepspeed docs] memory requirements (#15506) · 21dcaec5
  Stas Bekman authored Feb 03, 2022
  
  21dcaec5
- [WIP] Add preprocess_logits_for_metrics Trainer param (#15473) · f1a4c4ea
  davidleonfdez authored Feb 03, 2022
```
* Add preprocess_logits_for_metrics Trainer param

* Compute accuracy in LM examples

* Improve comments
```
  f1a4c4ea
- [deepspeed] fix a bug in a test (#15493) · 4f5faaf0
  Stas Bekman authored Feb 03, 2022
```
* [deepspeed] fix a bug in a test

* consistency
```
  4f5faaf0
- Add general vision docstrings (#15501) · 90166121
  NielsRogge authored Feb 03, 2022
```
* Add general docstrings

* Remove legacy docstrings

* Add BEiT

* Add DEiT

* Add SegFormer

* Fix beit output class

* Fix missing return_dict
```
  90166121
- [Flax tests] Disable scheduled GPU tests (#15503) · e2b6e73f
  Patrick von Platen authored Feb 03, 2022
  
  e2b6e73f
- fix load_weight_prefix (#15101) · f5d98da2
  Yih-Dar authored Feb 03, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  f5d98da2
- fix (#15494) · 71dccd07
  Yih-Dar authored Feb 03, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  71dccd07
02 Feb, 2022 12 commits

Correct eos_token_id settings in generate (#15403) · 5ec368d7

CHI LIU authored Feb 03, 2022

* Correct eos_token_id set in generate

* Set eos_token_id in test

* Correct eos_token_id set in generate

* Set eos_token_id in test

5ec368d7

fix set truncation attribute in `__init__` of `PreTrainedTokenizerBase` (#15456) · 39b5d1a6

SaulLu authored Feb 02, 2022



* change truncation_side in init of `PreTrainedTokenizerBase`
Co-authored-by: LSinev <LSinev@users.noreply.github.com>

* add test

* Revert "replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`"

This reverts commit 7a98b87962d2635c7e4d4f00db3948b694624843.

* fix kwargs

* Revert "fix kwargs"

This reverts commit 67b0a5270e8cf1dbf70e6b0232e94c0452b6946f.

* Update tests/test_tokenization_common.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* delete truncation_side variable

* reorganize test

* format

* complete doc

* Revert "Revert "replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`""

This reverts commit d5a10a7e2680539e5d9e98ae5d896c893d224b80.

* fix typo

* fix typos to render documentation

* Revert "Revert "Revert "replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`"""

This reverts commit 16cf58811943a08f43409a7c83eaa330686591d0.

* format
Co-authored-by: LSinev <LSinev@users.noreply.github.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

39b5d1a6

Fix labels stored in model config for token classification examples (#15482) · 45cac3fa
Sylvain Gugger authored Feb 02, 2022
```
* Playing

* Properly set labels in model config for token classification example

* Port to run_ner_no_trainer

* Quality
```
45cac3fa

Add W&B backend for hyperparameter sweep (#14582) · c74f3d4c

Ayush Chaurasia authored Feb 03, 2022

# Add support for W&B hyperparameter sweep
This PR:
* allows using wandb for running hyperparameter search.
* The runs are visualized on W&B sweeps dashboard
* This supports runnning sweeps on parallel devices, all reporting to the same central dashboard.

### Usage
**To run new a hyperparameter search:**
```
trainer.hyperparameter_search(
    backend="wandb", 
    project="transformers_sweep", # name of the project
    n_trials=5,
    metric="eval/loss", # metric to be optimized, default 'eval/loss'. A warning is raised if the passed metric is not found
)
```
This outputs a sweep id. Eg. `my_project/sweep_id`

**To run sweeps on parallel devices:**
Just pass sweep id which you want to run parallel
```
trainer.hyperparameter_search(
    backend="wandb", 
    sweep_id = "my_project/sweep_id"
)
```

c74f3d4c

Fic docstring of ASR pipeline (#15481) · 13297ac7
Sylvain Gugger authored Feb 02, 2022

13297ac7

fix error posted in issue #15448 (#15480) · dd360d58

bugface authored Feb 02, 2022



* fix error posted in issue #15448
Signed-off-by: bugface <alexgre@ufl.edu>

* clean up - remove commented line
Signed-off-by: bugface <alexgre@ufl.edu>

dd360d58

Save code of registered custom models (#15379) · 44b21f11

Sylvain Gugger authored Feb 02, 2022



* Allow dynamic modules to use relative imports

* Work for configs

* Fix last merge conflict

* Save code of registered custom objects

* Map strings to strings

* Fix test

* Add tokenizer

* Rework tests

* Tests

* Ignore fixtures py files for tests

* Tokenizer test + fix collection

* With full path

* Rework integration

* Fix typo

* Remove changes in conftest

* Test for tokenizers

* Add documentation

* Update docs/source/custom_models.mdx
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Add file structure and file content

* Add more doc

* Style

* Update docs/source/custom_models.mdx
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Address review comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

44b21f11

Adding support for `microphone` streaming within pipeline. (#15046) · 623d8cb4

Nicolas Patry authored Feb 02, 2022



* Adding support for `microphone` streaming within pipeline.

- Uses `ffmpeg` to get microphone data.
- Makes sure alignment is made to `size_of_sample`.
- Works by sending `{"raw": ..data.., "stride": (n, left, right),
"partial": bool}`
directly to the pipeline enabling to stream partial results and still
get inference.
- Let's `partial` information flow through the pipeline to enable caller
  to get it back and choose to display text or not.

- The striding reconstitution is bound to have errors since CTC does not
keep previous state. Currently most of the errors are we don't know if
there's a space or not between two chunks.
Since we have some left striding info, we could use that during decoding
to choose what to do with those spaces and even extra letters maybe (if
the stride is long enough, it's bound to cover at least a few symbols)

Fixing tests.

Protecting with `require_torch`.

`raw_ctc` support for nicer demo.

Post rebase fixes.

Revamp to split raw_mic_data from it's live chunking.

- Requires a refactor to make everything a bit cleaner.

Automatic resampling.

Small fix.

Small fix.

* Post rebase fix (need to let super handle more logic, reorder args.)

* Update docstrings

* Docstring format.

* Remove print.

* Prevent flow of `input_values`.

* Fixing `stride` too.

* Fixing the PR by removing `raw_ctc`.

* Better docstrings.

* Fixing init.

* Update src/transformers/pipelines/audio_utils.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update tests/test_pipelines_automatic_speech_recognition.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Quality.
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

623d8cb4

[Wav2Vec2ProcessorWithLM] add alpha & beta to batch decode & decode (#15465) · d718c0c3
Patrick von Platen authored Feb 02, 2022

d718c0c3

Add option to resize like torchvision's Resize (#15419) · 1d94d575

NielsRogge authored Feb 02, 2022

* Add torchvision's resize

* Rename torch_resize to default_to_square

* Apply suggestions from code review

* Add support for default_to_square and tuple of length 1

1d94d575

Update tutorial docs (#15165) · b9418a1d

Steven Liu authored Feb 01, 2022

* first draft of pipeline, autoclass, preprocess tutorials

* apply review feedback

* 🖍 apply feedback from patrick/niels

* 📝add output image to preprocessed image

* 🖍 apply feedback from patrick

b9418a1d

Update fine-tune docs (#15259) · c157c7e3

Steven Liu authored Feb 01, 2022

* add fine-tune tutorial

* make edits, fix style

* 📝 make edits

* 🖍 fix code format links to external libraries

* 🔄revert code formatting

* 🖍 use DefaultDataCollator instead of DataCollatorWithPadding

c157c7e3

01 Feb, 2022 5 commits

Harder check for IndexErrors in QA scripts (#15438) · d0b5ed11
Sylvain Gugger authored Feb 01, 2022
```
* Harder check for IndexErrors in QA scripts

* Make test stronger
```
d0b5ed11
`Trainer.push_to_hub` always tries to push to the Hub (#15463) · 8e5d4e49
Sylvain Gugger authored Feb 01, 2022

8e5d4e49
[BartTokenizer] remove inheritance on RobertaTokenizer (#15461) · 37800f13
Suraj Patil authored Feb 01, 2022
```
* refactor bart tokenizers

* doc

* replace assert with ValueError
```
37800f13

use mean instead of elementwise_mean in XLMPredLayer (#15436) · f427e750

Yih-Dar authored Feb 01, 2022



* use mean instead of elementwise_mean

* make style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f427e750

fix the `tokenizer_config.json` file for the slow tokenizer when a fast... · 7b8bdd86

SaulLu authored Feb 01, 2022

fix the `tokenizer_config.json` file for the slow tokenizer when a fast version is available (#15319)

* add new test

* update test

* remove `tokenizer_file` from `additional_files_names` in `tokenization_utils_base.py`

* add `tokenizer_file` for the fast only tokenizer

* change global variables layoutxml

* remove `"tokenizer_file"` from DPR tokenizer's Global variables

* remove `tokenizer_file` from herbert slow tokenizer init

* `"tokenizer_file"` from LED tokenizer's Global variables

* remove `tokenizer_file` from mbart slow tokenizer init

* remove `tokenizer_file` from slow tokenizer template

* adapt to versioning

* adapt the `test_tokenizer_mismatch_warning` test

* clean test

* clarify `VOCAB_FILES_NAMES` in tokenization_utils_fast.py

* Revert "remove `tokenizer_file` from mbart slow tokenizer init"

This reverts commit 0dbb723fa9c7599d4640fe30b3647a74eb4a64e1.

* Revert "`"tokenizer_file"` from LED tokenizer's Global variables"

This reverts commit 5a3f879bdd651233f3d74a3d1146c34cde82b0c2.

* Revert "remove `tokenizer_file` from herbert slow tokenizer init"

This reverts commit f5e10007b7b0ec5345e015b9de7ffec72c5407fd.

* Revert "remove `"tokenizer_file"` from DPR tokenizer's Global variables"

This reverts commit da0895330bedfafc81ae3073470a9348c669f032.

* set `tokenizer_file` in super `__init__` of mbart

7b8bdd86