Commits · 513fa30a636642ccc1d93f3e6a48d612d08dbce8 · chenpangpang / transformers

29 Oct, 2021 10 commits

Docs for v4.12.1 · 513fa30a
Lysandre authored Oct 29, 2021

513fa30a
Torch 1.10 (#14169) · 63d91f44
Lysandre Debut authored Oct 29, 2021
```
* Torch 1.10

* torch scatter for 1.10

* style

* Skip tests
ok
```
63d91f44
Add a condition for checking labels (#14211) · e823d819
Haram Lee authored Oct 30, 2021

e823d819

Fixing image segmentation with inference mode. (#14204) · b3385963

Nicolas Patry authored Oct 29, 2021



* Fixing image segmentation for inference mode.

* Update src/transformers/pipelines/base.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

b3385963

Generalize problem_type to all sequence classification models (#14180) · c28bc80b

Sylvain Gugger authored Oct 29, 2021

* Generalize problem_type to all classification models

* Missing import

* Deberta BC and fix tests

* Fix template

* Missing imports

* Revert change to reformer test

* Fix style

c28bc80b

Fix pipeline tests env and fetch (#14209) · 4ab6a4a0
Sylvain Gugger authored Oct 29, 2021
```
* Fix pipeline tests env and fetch

* Fix quality
```
4ab6a4a0

Adding `handle_long_generation` paramters for `text-generation` pipeline. (#14118) · dc540dd3

Nicolas Patry authored Oct 29, 2021

* Adding `handle_long_generation` paramters for `text-generation` pipeline.

* More error handling

* Fixing tests by dropping tf support on this functionality, it needs

`max_new_tokens` to make it possible to understand user's intent.
Otherwise, `max_length` == `tokenizer.model_max_length` <
input_ids.shape[0].

* Fixing doc ?

* Doc ?

* Remove link from doc.

* Catched an issue on roberta.

* Damn doc.

* Non BC proposal ?

* Cleaning the fix ?

* Finally using only a test override.

* Don't need to modify this.

* Bad print.

dc540dd3

Add `BlenderbotTokenizerFast` (#13720) · d37f1fb8

Daniel Stancl authored Oct 29, 2021

* Add the support for the fast (rust) implementation of BlenbderbotTokenizer

* Fix a converter and a typo in a doc

* Apply the patil-suraj's suggestion

* (Nitpick) Fast tokenization -> Fast Tokenization in doc

* Apply the SaulLu's suggestion

* Apply Narsil's suggestion to fix test pipelines

* Add encoder_no_repeat_ngram_size according to the Narsil's suggestion

* Revert the last (unnecessary) commit

* Override pipeline config for Blenderbot to allow for larger pos. emb.

* make fix-copies

d37f1fb8

Remove n_ctx from configs (#14165) · 5b45422b

Thomas Wang authored Oct 29, 2021

* Remove n_ctx from configs

* Fix GPTJ and OpenAIGPT, both are acceptable breaking changes as there are no configs such that it breaks

* Remove unecessary n_positions from TFOpenAIGPT

5b45422b

Adding `batch_size` support for (almost) all pipelines (#13724) · be236361

Nicolas Patry authored Oct 29, 2021



* Tentative enabling of `batch_size` for pipelines.

* Add systematic test for pipeline batching.

* Enabling batch_size on almost all pipelines

- Not `zero-shot` (it's already passing stuff as batched so trickier)
- Not `QA` (preprocess uses squad features, we need to switch to real
tensors at this boundary.

* Adding `min_length_for_response` for conversational.

* Making CTC, speech mappings avaiable regardless of framework.

* Attempt at fixing automatic tests (ffmpeg not enabled for fast tests)

* Removing ffmpeg dependency in tests.

* Small fixes.

* Slight cleanup.

* Adding docs

and adressing comments.

* Quality.

* Update docs/source/main_classes/pipelines.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/question_answering.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/zero_shot_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Improving docs.

* Update docs/source/main_classes/pipelines.rst
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>

* N -> oberved_batch_size

softmax trick.

* Follow `padding_side`.

* Supporting image pipeline batching (and padding).

* Rename `unbatch` -> `loader_batch`.

* unbatch_size forgot.

* Custom padding for offset mappings.

* Attempt to remove librosa.

* Adding require_audio.

* torchaudio.

* Back to using datasets librosa.

* Adding help to set a pad_token on the tokenizer.

* Update src/transformers/pipelines/base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Quality.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>

be236361

28 Oct, 2021 11 commits

Replace assertions with RuntimeError exceptions (#14186) · 4469010c
David del Río Medina authored Oct 28, 2021

4469010c
Update README.md · ba71f1b5
Patrick von Platen authored Oct 28, 2021

ba71f1b5
v4.13.0.dev0 · b8fad022
Lysandre authored Oct 28, 2021

b8fad022
Release v4.12.0 · 62bf5366
Lysandre authored Oct 28, 2021

62bf5366
Fix EncoderDecoderModel docs (#14197) · 5f3bf651
NielsRogge authored Oct 28, 2021
```
* Fix docs

* Apply suggestions from review + fix bug
```
5f3bf651

Fix EncoderDecoderModel classes to be more like BART and T5 (#14139) · ac12a5ae

NielsRogge authored Oct 28, 2021

* First draft

* Make tuple output more readable

* Replace assertions by value errors

* Make it possible to predict_with_generate for vision and speech models

* Adapt Seq2SeqTrainer to work with VisionEncoderDecoder/SpeechEncoderDecoder

* Add deprecation warning

* Add copied from statements to vision and speech encoder decoders

* Fix failing test

* Apply @patrickvonplaten's suggestion

* Use reshape instead of view for consistency

ac12a5ae

Fix SEW-D implementation differences (#14191) · 1251072f
Anton Lozhkov authored Oct 28, 2021
```
* Fix SEW-D

* Update tests

* isort
```
1251072f
Add audio-classification benchmarking results (#14192) · 78b6a2ec
Anton Lozhkov authored Oct 28, 2021

78b6a2ec

Add SegFormer (#14019) · 1dc96a76

NielsRogge authored Oct 28, 2021



* First draft

* Make style & quality

* Improve conversion script

* Add print statement to see actual slice

* Make absolute tolerance smaller

* Fix image classification models

* Add post_process_semantic method

* Disable padding

* Improve conversion script

* Rename to ForSemanticSegmentation, add integration test, remove post_process methods

* Improve docs

* Fix code quality

* Fix feature extractor tests

* Fix tests for image classification model

* Delete file

* Add is_torch_available to feature extractor

* Improve documentation of feature extractor methods

* Apply suggestions from @sgugger's code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply some more suggestions of code review

* Rebase with master

* Fix rebase issues

* Make sure model only outputs hidden states when the user wants to

* Apply suggestions from code review

* Add pad method

* Support padding of 2d images

* Add print statement

* Add print statement

* Move padding method to SegformerFeatureExtractor

* Fix issue

* Add casting of segmentation maps

* Add test for padding

* Add small note about padding
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1dc96a76

[modeling_utils] respect original dtype in _get_resized_lm_head (#14181) · 123cce6f

Stas Bekman authored Oct 27, 2021



* respect dtype in _get_resized_lm_head

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* consistency
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

123cce6f

Update README.md · 88cd82e8
Patrick von Platen authored Oct 28, 2021

88cd82e8

27 Oct, 2021 8 commits
- Update README.md · e118db15
  Patrick von Platen authored Oct 28, 2021
  
  e118db15
- [TPU tests] Enable first TPU examples pytorch (#14121) · 01b14669
  Patrick von Platen authored Oct 28, 2021
```
* up

* up

* fix

* up

* Update examples/pytorch/test_xla_examples.py

* correct labels

* up

* up

* up

* up

* up

* up
```
  01b14669
- Add DistilHuBERT (#14174) · 232822f3
  Anton Lozhkov authored Oct 27, 2021
```
* Add conversion

* Rename

* Add an integration test and remove layer_norm

* Remove layer_norm from the converter

* wording

* Fix imports
```
  232822f3
- Replace assert of data/data_collator.py by ValueError (#14131) · e5b8ffb8
  Lahfa Samy authored Oct 27, 2021
```
* Replace assert of data_collator.py by ValueError

* Replace assert of data_collator.py by ValueError
```
  e5b8ffb8
- [Pipelines] Fix ASR model types check (#14178) · 25ceb818
  Anton Lozhkov authored Oct 27, 2021
  
  25ceb818
- [Gradient checkpointing] Enable for Deberta + DebertaV2 + SEW-D (#14175) · 6200fd7b
  Patrick von Platen authored Oct 27, 2021
```
* up

* up

* finish

* up

* final changes
```
  6200fd7b
- Add SEW CTC models (#14158) · e1dc5afd
  Anton Lozhkov authored Oct 27, 2021
```
* Add SEW CTC models

* Update paths

* Update paths
```
  e1dc5afd
- Fix gelu test for torch 1.10 (#14167) · 1e53faeb
  Lysandre Debut authored Oct 26, 2021
  
  1e53faeb
26 Oct, 2021 11 commits

switch to inference_mode from no_gard (#13667) · 8ddbfe97

Kamal Raj authored Oct 27, 2021

* switch to inference_mode from no_gard
faster inference

* added switch to support older version of pytorch

8ddbfe97

Replace assertions with ValueError exception (#14142) · ebd48c6d
Emanuel Huber authored Oct 26, 2021
```
Updated masked-language modeling examples in pytorch
with convention defined by #12789
```
ebd48c6d
fix typos in error messages in speech recognition example and modelcard.py (#14166) · 42bfb83d
Matthew Goldey authored Oct 26, 2021
```
* specify the text column name in the error message

* pluralize the word fields
```
42bfb83d
chore: typo on ner accelerate example code (#14150) · 41dad89f
Jangwon Park authored Oct 27, 2021

41dad89f
Fix copies · 27c888db
Lysandre authored Oct 26, 2021

27c888db
[ONNX] Add symbolic function for XSoftmax op for exporting to ONNX. (#14013) · 3f23634a
Jay Zhang authored Oct 27, 2021
```
* Add symbolic function for XSoftmax op for exporting to ONNX.

* Fix format issues.

* Fix a CI issue relative to copies.
```
3f23634a

Add Unispeech & Unispeech-SAT (#13963) · 9f3aa46f

Patrick von Platen authored Oct 26, 2021



* unispeech

* add copy from

* remove hubert copy from

* finish for today

* add unispeech-sat

* adapt more

* up

* up

* up

* up

* add modeling

* add tests

* up

* up

* finish

* up

* Apply suggestions from code review

* up

* up

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* up

* up
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9f3aa46f

Update README.md · 9799f4e1
Patrick von Platen authored Oct 26, 2021

9799f4e1

[megatron_gpt2] dynamic gelu, add tokenizer, save config (#13928) · bfd81766

Stas Bekman authored Oct 26, 2021



* [megatron_gpt2] dynamic gelu, add tokenizer, save config

* cleanup

* Update src/transformers/models/megatron_gpt2/convert_megatron_gpt2_checkpoint.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

bfd81766

Include Keras tensor in the allowed types (#14155) · 919a964b

Sergio Valcarcel Macua authored Oct 26, 2021



* Include KerasTensor in allowed types

- This allows propagating symbolic tensors through TFBert models and layers' call(),
  which allows converting the subclass models to functional models.

* Style pass
Co-authored-by: Sergio Valcarcel Macua <sergiov@graphcore.ai>
Co-authored-by: matt <rocketknight1@gmail.com>

919a964b

[Speech Recognition] - Distributed training: Make sure vocab file removal and... · f5ed19f5
Patrick von Platen authored Oct 26, 2021
```
[Speech Recognition] - Distributed training: Make sure vocab file removal and creation don't interfer  (#14161)

* up

* better
```
f5ed19f5