Commits · ac6aa10f23967373142d7a23d84a45ffd494d64b · chenpangpang / transformers

04 Feb, 2022 5 commits

Standardize semantic segmentation models outputs (#15469) · ac6aa10f

Sylvain Gugger authored Feb 04, 2022



* Standardize instance segmentation models outputs

* Rename output

* Update src/transformers/modeling_outputs.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add legacy argument to the config and model forward

* Update src/transformers/models/beit/modeling_beit.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Copy fix in Segformer
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

ac6aa10f

[deepspeed docs] Megatron-Deepspeed info (#15488) · 31be2f45
Stas Bekman authored Feb 04, 2022

31be2f45

Fix TFRemBertEncoder all_hidden_states (#15510) · bbe9c698

Yih-Dar authored Feb 04, 2022



* fix

* fix test

* remove expected_num_hidden_layers
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

bbe9c698

Handle PyTorch to Flax conversion of 1D convolutions (#15519) · 854a0d52
Sanchit Gandhi authored Feb 04, 2022

854a0d52
use kwargs (#15509) · 486260c6
Yih-Dar authored Feb 04, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
486260c6

03 Feb, 2022 8 commits
- Remove loss from some flax models docs & examples (#15492) · 525dbbf8
  Yih-Dar authored Feb 03, 2022
```
* Remove return_loss from Flax models

* fix more

* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  525dbbf8
- [deepspeed docs] memory requirements (#15506) · 21dcaec5
  Stas Bekman authored Feb 03, 2022
  
  21dcaec5
- [WIP] Add preprocess_logits_for_metrics Trainer param (#15473) · f1a4c4ea
  davidleonfdez authored Feb 03, 2022
```
* Add preprocess_logits_for_metrics Trainer param

* Compute accuracy in LM examples

* Improve comments
```
  f1a4c4ea
- [deepspeed] fix a bug in a test (#15493) · 4f5faaf0
  Stas Bekman authored Feb 03, 2022
```
* [deepspeed] fix a bug in a test

* consistency
```
  4f5faaf0
- Add general vision docstrings (#15501) · 90166121
  NielsRogge authored Feb 03, 2022
```
* Add general docstrings

* Remove legacy docstrings

* Add BEiT

* Add DEiT

* Add SegFormer

* Fix beit output class

* Fix missing return_dict
```
  90166121
- [Flax tests] Disable scheduled GPU tests (#15503) · e2b6e73f
  Patrick von Platen authored Feb 03, 2022
  
  e2b6e73f
- fix load_weight_prefix (#15101) · f5d98da2
  Yih-Dar authored Feb 03, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  f5d98da2
- fix (#15494) · 71dccd07
  Yih-Dar authored Feb 03, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  71dccd07
02 Feb, 2022 12 commits

Correct eos_token_id settings in generate (#15403) · 5ec368d7

CHI LIU authored Feb 03, 2022

* Correct eos_token_id set in generate

* Set eos_token_id in test

* Correct eos_token_id set in generate

* Set eos_token_id in test

5ec368d7

fix set truncation attribute in `__init__` of `PreTrainedTokenizerBase` (#15456) · 39b5d1a6

SaulLu authored Feb 02, 2022



* change truncation_side in init of `PreTrainedTokenizerBase`
Co-authored-by: LSinev <LSinev@users.noreply.github.com>

* add test

* Revert "replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`"

This reverts commit 7a98b87962d2635c7e4d4f00db3948b694624843.

* fix kwargs

* Revert "fix kwargs"

This reverts commit 67b0a5270e8cf1dbf70e6b0232e94c0452b6946f.

* Update tests/test_tokenization_common.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* delete truncation_side variable

* reorganize test

* format

* complete doc

* Revert "Revert "replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`""

This reverts commit d5a10a7e2680539e5d9e98ae5d896c893d224b80.

* fix typo

* fix typos to render documentation

* Revert "Revert "Revert "replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`"""

This reverts commit 16cf58811943a08f43409a7c83eaa330686591d0.

* format
Co-authored-by: LSinev <LSinev@users.noreply.github.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

39b5d1a6

Fix labels stored in model config for token classification examples (#15482) · 45cac3fa
Sylvain Gugger authored Feb 02, 2022
```
* Playing

* Properly set labels in model config for token classification example

* Port to run_ner_no_trainer

* Quality
```
45cac3fa

Add W&B backend for hyperparameter sweep (#14582) · c74f3d4c

Ayush Chaurasia authored Feb 03, 2022

# Add support for W&B hyperparameter sweep
This PR:
* allows using wandb for running hyperparameter search.
* The runs are visualized on W&B sweeps dashboard
* This supports runnning sweeps on parallel devices, all reporting to the same central dashboard.

### Usage
**To run new a hyperparameter search:**
```
trainer.hyperparameter_search(
    backend="wandb", 
    project="transformers_sweep", # name of the project
    n_trials=5,
    metric="eval/loss", # metric to be optimized, default 'eval/loss'. A warning is raised if the passed metric is not found
)
```
This outputs a sweep id. Eg. `my_project/sweep_id`

**To run sweeps on parallel devices:**
Just pass sweep id which you want to run parallel
```
trainer.hyperparameter_search(
    backend="wandb", 
    sweep_id = "my_project/sweep_id"
)
```

c74f3d4c

Fic docstring of ASR pipeline (#15481) · 13297ac7
Sylvain Gugger authored Feb 02, 2022

13297ac7

fix error posted in issue #15448 (#15480) · dd360d58

bugface authored Feb 02, 2022



* fix error posted in issue #15448
Signed-off-by: bugface <alexgre@ufl.edu>

* clean up - remove commented line
Signed-off-by: bugface <alexgre@ufl.edu>

dd360d58

Save code of registered custom models (#15379) · 44b21f11

Sylvain Gugger authored Feb 02, 2022



* Allow dynamic modules to use relative imports

* Work for configs

* Fix last merge conflict

* Save code of registered custom objects

* Map strings to strings

* Fix test

* Add tokenizer

* Rework tests

* Tests

* Ignore fixtures py files for tests

* Tokenizer test + fix collection

* With full path

* Rework integration

* Fix typo

* Remove changes in conftest

* Test for tokenizers

* Add documentation

* Update docs/source/custom_models.mdx
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Add file structure and file content

* Add more doc

* Style

* Update docs/source/custom_models.mdx
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Address review comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

44b21f11

Adding support for `microphone` streaming within pipeline. (#15046) · 623d8cb4

Nicolas Patry authored Feb 02, 2022



* Adding support for `microphone` streaming within pipeline.

- Uses `ffmpeg` to get microphone data.
- Makes sure alignment is made to `size_of_sample`.
- Works by sending `{"raw": ..data.., "stride": (n, left, right),
"partial": bool}`
directly to the pipeline enabling to stream partial results and still
get inference.
- Let's `partial` information flow through the pipeline to enable caller
  to get it back and choose to display text or not.

- The striding reconstitution is bound to have errors since CTC does not
keep previous state. Currently most of the errors are we don't know if
there's a space or not between two chunks.
Since we have some left striding info, we could use that during decoding
to choose what to do with those spaces and even extra letters maybe (if
the stride is long enough, it's bound to cover at least a few symbols)

Fixing tests.

Protecting with `require_torch`.

`raw_ctc` support for nicer demo.

Post rebase fixes.

Revamp to split raw_mic_data from it's live chunking.

- Requires a refactor to make everything a bit cleaner.

Automatic resampling.

Small fix.

Small fix.

* Post rebase fix (need to let super handle more logic, reorder args.)

* Update docstrings

* Docstring format.

* Remove print.

* Prevent flow of `input_values`.

* Fixing `stride` too.

* Fixing the PR by removing `raw_ctc`.

* Better docstrings.

* Fixing init.

* Update src/transformers/pipelines/audio_utils.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update tests/test_pipelines_automatic_speech_recognition.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Quality.
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

623d8cb4

[Wav2Vec2ProcessorWithLM] add alpha & beta to batch decode & decode (#15465) · d718c0c3
Patrick von Platen authored Feb 02, 2022

d718c0c3

Add option to resize like torchvision's Resize (#15419) · 1d94d575

NielsRogge authored Feb 02, 2022

* Add torchvision's resize

* Rename torch_resize to default_to_square

* Apply suggestions from code review

* Add support for default_to_square and tuple of length 1

1d94d575

Update tutorial docs (#15165) · b9418a1d

Steven Liu authored Feb 01, 2022

* first draft of pipeline, autoclass, preprocess tutorials

* apply review feedback

* 🖍 apply feedback from patrick/niels

* 📝add output image to preprocessed image

* 🖍 apply feedback from patrick

b9418a1d

Update fine-tune docs (#15259) · c157c7e3

Steven Liu authored Feb 01, 2022

* add fine-tune tutorial

* make edits, fix style

* 📝 make edits

* 🖍 fix code format links to external libraries

* 🔄revert code formatting

* 🖍 use DefaultDataCollator instead of DataCollatorWithPadding

c157c7e3

01 Feb, 2022 11 commits

Harder check for IndexErrors in QA scripts (#15438) · d0b5ed11
Sylvain Gugger authored Feb 01, 2022
```
* Harder check for IndexErrors in QA scripts

* Make test stronger
```
d0b5ed11
`Trainer.push_to_hub` always tries to push to the Hub (#15463) · 8e5d4e49
Sylvain Gugger authored Feb 01, 2022

8e5d4e49
[BartTokenizer] remove inheritance on RobertaTokenizer (#15461) · 37800f13
Suraj Patil authored Feb 01, 2022
```
* refactor bart tokenizers

* doc

* replace assert with ValueError
```
37800f13

use mean instead of elementwise_mean in XLMPredLayer (#15436) · f427e750

Yih-Dar authored Feb 01, 2022



* use mean instead of elementwise_mean

* make style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f427e750

fix the `tokenizer_config.json` file for the slow tokenizer when a fast... · 7b8bdd86

SaulLu authored Feb 01, 2022

fix the `tokenizer_config.json` file for the slow tokenizer when a fast version is available (#15319)

* add new test

* update test

* remove `tokenizer_file` from `additional_files_names` in `tokenization_utils_base.py`

* add `tokenizer_file` for the fast only tokenizer

* change global variables layoutxml

* remove `"tokenizer_file"` from DPR tokenizer's Global variables

* remove `tokenizer_file` from herbert slow tokenizer init

* `"tokenizer_file"` from LED tokenizer's Global variables

* remove `tokenizer_file` from mbart slow tokenizer init

* remove `tokenizer_file` from slow tokenizer template

* adapt to versioning

* adapt the `test_tokenizer_mismatch_warning` test

* clean test

* clarify `VOCAB_FILES_NAMES` in tokenization_utils_fast.py

* Revert "remove `tokenizer_file` from mbart slow tokenizer init"

This reverts commit 0dbb723fa9c7599d4640fe30b3647a74eb4a64e1.

* Revert "`"tokenizer_file"` from LED tokenizer's Global variables"

This reverts commit 5a3f879bdd651233f3d74a3d1146c34cde82b0c2.

* Revert "remove `tokenizer_file` from herbert slow tokenizer init"

This reverts commit f5e10007b7b0ec5345e015b9de7ffec72c5407fd.

* Revert "remove `"tokenizer_file"` from DPR tokenizer's Global variables"

This reverts commit da0895330bedfafc81ae3073470a9348c669f032.

* set `tokenizer_file` in super `__init__` of mbart

7b8bdd86

replace assert with exception for padding_side arg in `PreTrainedTokenizerBase` `__init__` (#15454) · 6d585fe0

SaulLu authored Feb 01, 2022

* replace assert with exception for `padding_side` arg in `PreTrainedTokenizerBase` `__init__`

* add test

* fix kwargs

* reformat test

* format

* format

* fix typo to render the documentation

6d585fe0

Update README.md (#15462) · d2749cf7
Kamal Raj authored Feb 01, 2022
```
fix typo
```
d2749cf7
[M2M100, XGLM] fix positional emb resize (#15444) · 1c9648c4
Suraj Patil authored Feb 01, 2022

1c9648c4
fix from_vision_text_pretrained doc example (#15453) · 2ca62683
Yih-Dar authored Feb 01, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
2ca62683

Fix TF Causal LM models' returned logits (#15256) · dc05dd53

Yih-Dar authored Feb 01, 2022



* Fix TF Causal LM models' returned logits

* Fix expected shape in the tests
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

dc05dd53

remove "inputs" in tf common test script (no longer required) (#15262) · af5c3329
Yih-Dar authored Feb 01, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
af5c3329

31 Jan, 2022 4 commits
- [generate] fix synced_gpus default (#15446) · d12ae816
  Stas Bekman authored Jan 31, 2022
  
  d12ae816
- skip test for XGLM (#15445) · d4f201b8
  Suraj Patil authored Jan 31, 2022
  
  d4f201b8
- Error when group_by_length is used with an IterableDataset (#15437) · 0c17e766
  Sylvain Gugger authored Jan 31, 2022
  
  0c17e766
- Update modeling_wav2vec2.py (#15423) · 125a2882
  peregilk authored Jan 31, 2022
```
* Update modeling_wav2vec2.py

With very tiny sound files (less than 0.1 seconds) the num_masked_span can be too long. The issue is described in issue #15366 and discussed with @patrickvonplaten.

* correct errors with mask time indices

* remove bogus file

* make fix-copies
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  125a2882