Commits · 55f49c5f4bf787cce6f04868737291f44a9710ec · chenpangpang / transformers

12 Nov, 2021 3 commits

[Wav2Vec2 Example] Improve fine-tuning script (#14373) · 55f49c5f
Patrick von Platen authored Nov 12, 2021
```
* improve some stuff

* finish

* correct last
```
55f49c5f
fix docs (#14377) · 21546e59
Suraj Patil authored Nov 12, 2021

21546e59

Adding support for raw python `generator` in addition to `Dataset` for pipelines (#14352) · ed5d1551

Nicolas Patry authored Nov 12, 2021

* Adding support for raw python `generator` in addition to `Dataset`

The main goal is to ease the create of streaming data to the pipe.

`Dataset` is more involved and pytorch specific.

This PR, provides a way to use a python iterator too.
This enabled #14250 but can be proposed as a standalone PR.

```python
from transformers import pipeline

def read_data(filename):
    with open(filename, 'r') as f:
        for line in f:
            yield f

pipe = pipeline("text-classification")
for classified in pipe(read_data("large_file.txt")):
    print("Success ! ", classified)
```

The main caveat of this, is the interaction with `DataLoader` with
`num_workers>1`. When you have multiple workers, each receive a copy
of the generator (like `IterableDataset`). That means the naive Iterator
will fail since all workers iterate on all items of the generator.

There are ways to do clever "skipping", but it could be bad still
because all workers still do have to pass through all items of the
generator (they just ignore items they don't handle), depending on
the case it might be bad.

Using `num_workers=1` is the simplest fix and if the cost of loading
your data is small enough should be good enough. In the above example
trying to do smart tricks to skip some lines is unlikely to be a net
positive for instance.

If there are better ways to do "jumps" on some data, then using
`Dataset` is more advised (since then differents workers can just jump
themselves).

* Adding iterator support for `tf` too.

ed5d1551

11 Nov, 2021 7 commits

fix --gradient_checkpointing (#13964) · 77262ef7
Stas Bekman authored Nov 11, 2021

77262ef7

fix loading flax bf16 weights in pt (#14369) · 3d607df8

Suraj Patil authored Nov 11, 2021



* fix loading flax bf16 weights in pt

* fix clip test

* fix t5 test

* add logging statement

* Update src/transformers/modeling_flax_pytorch_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* switch back to native any

* fix check for bf16 weights
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3d607df8

Fixing requirements for TF LM models and use correct model mappings (#14372) · 7f20bf0d
Matt authored Nov 11, 2021
```
* Fixing requirements for TF LM models and use correct model mappings

* make style
```
7f20bf0d

Experimenting with adding proper get_config() and from_config() methods (#14361) · 4c35c8d8

Matt authored Nov 11, 2021

* Experimenting with adding proper get_config() and from_config() methods

* Adding a test for get/from config

* Fix test for get/from config

4c35c8d8

pass params to encode (#14370) · b1dbdf22
Suraj Patil authored Nov 11, 2021

b1dbdf22

Fix Flax params dtype (#13098) · e92190c0

Suraj Patil authored Nov 11, 2021



* fix inits

* fix embed dtype

* fix embed dtype

* add test to check default dtype

* quality

* add type conversion methods for flax models

* more robust casting

* cast sinusoidal positions

* update pegasus

* update albert

* update test

* make sure dtype is passed to every module

* style

* fix electra dense

* fix t5

* quality

* add more tests

* better name

* use the dtype for lm head computation

* fix albert

* style

* fix albert embed dtype

* more tests

* fix vision enc-dec

* cleanup

* fix embed dtype pegasus

* fix default param test

* doc

* update template

* fix final_logits_bias dtype

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix doc

* fix doc

* add detailed docstring for dtype parameter

* remove un-necessary import
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

e92190c0

solve the port conflict (#14362) · 1c76a516
Stas Bekman authored Nov 10, 2021

1c76a516

10 Nov, 2021 6 commits

Fix list index out of range when padding nested empty lists (#13876) · 9e37c5cd
Li-Huai (Allan) Lin authored Nov 11, 2021
```
* Fix index out of range when padding

* Apply suggestions from code review

* Style
```
9e37c5cd
enhance rewrite state_dict missing _metadata (#14348) · bec02ff2
Chang Wang authored Nov 10, 2021

bec02ff2

Add notebook INC quantization for text classification tasks (#14293) · 2b0d9389

Ella Charlaix authored Nov 10, 2021

* Add notebook applying Intel Neural Compressor quantization for text classification tasks

* Add Optimum notebooks section

2b0d9389

Fix fast tokenization problems (#13930) · ea163d09

Li-Huai (Allan) Lin authored Nov 10, 2021

* Fix albert mask token tokenization.

* Ensure special tokans sanitized.

* Style

* Fix

* Apply suggestions from code review

ea163d09

Adding some quality of life for `pipeline` function. (#14322) · 5c153079

Nicolas Patry authored Nov 10, 2021



* Adding some quality of life for `pipeline` function.

* Update docs/source/main_classes/pipelines.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Improve the tests.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

5c153079

`BatchFeature`: Convert `List[np.ndarray]` to `np.ndarray` before converting... · 321eb562

Elad Segal authored Nov 10, 2021


`BatchFeature`: Convert `List[np.ndarray]` to `np.ndarray` before converting to pytorch tensors (#14306)

* update

* style fix

* retrigger checks

* check first element

* fix syntax error

* Update src/transformers/feature_extraction_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove import
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

321eb562

09 Nov, 2021 10 commits

Support for TF >= 2.7 (#14345) · 46d0cdae
Sylvain Gugger authored Nov 09, 2021

46d0cdae
[Bert2Bert] allow bert2bert + relative embeddings (#14324) · e81d8d7f
Patrick von Platen authored Nov 09, 2021
```
* [Bert2Bert] allow bert2bert + relative embeddings

* up

* Update README_ko.md

* up

* up
```
e81d8d7f

Rewrite guides for fine-tuning with Datasets (#13923) · e4d8f517

Steven Liu authored Nov 09, 2021

* rewrite guides for fine-tuning with datasets

* simple qa code example

* use anonymous rST links

* style

e4d8f517

bump flax version (#14343) · 85a4bda4
Suraj Patil authored Nov 09, 2021

85a4bda4
remove test_model_various_embeddings (#14341) · babd0b9a
Yih-Dar authored Nov 09, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
babd0b9a

Update Seq2Seq QA example script to use SQuAD metric. (#14335) · 4f24058c

karthikrangasai authored Nov 09, 2021

* Update postporcessing accordingly to use SQuAD metric.

* Update assets accordingly based on SQuAD metrics.

* Fix function naming error.

4f24058c

Add TFViTModel (#13778) · be4a6c64

Yih-Dar authored Nov 09, 2021



* Start the work for TFViTModel

* Convert to TF code - need to check in the follow up commits

* Clean up model code

* Expose TFViTModel

* make style

* make quality

* Add test

* make style & quality

* Fix some imports

* fix wrong usage - *kwargs => ** kwargs

* Fix Conv2D weight loading (PT->TF) issue

* Add tests for images with different sizes + fix model

* Fix some common tests for TFViTModel

* Use inputs instead of input_ids in test_compile_tf_model

* Add a comment about transpose and Conv2D in convert_tf_weight_name_to_pt_weight_name

* Avoid transpose in TFViT call

* Fix Conv2D issue in load_tf2_weights_in_pytorch_model

* Use tf.keras.layers.Conv2D instead of tf.nn.conv2d

* Using simpler heuristic to detect Conv2D layer

* Change convert_tf_weight_name_to_pt_weight_name to return TransposeType

* Check tf_weight_shape is not None before using it

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix missing comma

* fix input dtype
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

be4a6c64

Correct order of overflowing tokens for LayoutLmV2 tokenizer (#13495) · 6326aa4b

Apoorv Garg authored Nov 09, 2021



* correct order of overflowing tokens for LayoutLmV2 tokenizer

* test to check order of overflowing_tokens for a seq of input_ids

* fix up quality

* added suggested changes

* check that tests the bbox sequence

* pair_input test added

* pass quality test

* check bbox sequence added

* unittest method

* comments added

* add overflowing bbox test

* improved "seq_1"
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

* improve code quality
Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

6326aa4b

Add FlaxVisionEncoderDecoderModel (#13359) · 95b3ec3b

Yih-Dar authored Nov 09, 2021



* Start the work on FlaxVisionEncoderDecoderModel

* Add FlaxVisionEncoderDecoderModel

* Add VisionEncoderDecoderConfig

* Make FlaxVisionEncoderDecoderModel visible to transformers

* Add test

* Fix wrong getattr usage

* Fix tests

* Add FlaxAutoModelForVision2Seq

* Expose FLAX_MODEL_FOR_VISION_2_SEQ_MAPPING

* clean-up

* add integration test

* update expected logits

* update expected scores

* Add ViT2GPT2ModelIntegrationTest + some cleaning

* Add projection layer + PT/Flax equivalence tests

* Fix import

* minor changes

* make test slow again

* Apply suggestions

* Add modeling_flax_vision_encoder_decoder to _ignore_modules in get_model_modules()

* fix copies

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* split long strings in multiple lines

* decoder_input_ids can't be None

* Add back test_configuration_tie

* Remove attention_mask parameter

* fix test - encoder_last_hidden_state should be encoder_outputs.last_hidden_state instead of the projected vector

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Remove more encoder_attention_mask

* remove encoder_attention_mask when calling self.decode (in FlaxVisionEncoderDecoderModule)

* Fix style + pass 1s instead of None as encoder_attention_mask

* fix init_weights

* pass None for encoder_attention_mask

* pass 1s instead of None as encoder_attention_mask

* Fix doc style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

95b3ec3b

Small change to Wav2Vec2 model to support Tensor-Parallelism with DeepSpeed (#14298) · a5030122

Reza Yazdani authored Nov 08, 2021

* minor modification to the wav2vec2 modeling file to support tensor-parallelism with DeepSpeed on this HuggingFace model

* refine the comments

* synch changes

* fix comments

* refine comments

* fix format

a5030122

08 Nov, 2021 8 commits
- [deepspeed] Enable multiple test runs on single box, defer to DS_TEST_PORT if set (#14331) · d0e96c6d
  Jeff Rasley authored Nov 08, 2021
```
* defer to DS_TEST_PORT if set

* style
Co-authored-by: Stas Bekman <stas@stason.org>
```
  d0e96c6d
- Expand dynamic supported objects to configs and tokenizers (#14296) · dfb00bf6
  Sylvain Gugger authored Nov 08, 2021
```
* Dynamic configs

* Add config test

* Better tests

* Add tokenizer and test

* Add to from_config

* With save
```
  dfb00bf6
- Changed relative imports to absolute to allow convert_graph_to_onnx.py to run as a script. (#14325) · de635af3
  nbertagnolli authored Nov 08, 2021
```
* Changed relative imports to absolute to allow convert_graph_to_onnx.py to be run as a script

* isorted code
```
  de635af3
- Fixing mutable default argument in `pipeline`. (#14316) · a3ded170
  Nicolas Patry authored Nov 08, 2021
```
* Fixing mutable default argument.

* XX.

* Revert "XX."

This reverts commit 61d4bb333f6d39a7fbe31d161b8bd14787ceec2e.
```
  a3ded170
- Fixing tests on master. (#14317) · 9b78b070
  Nicolas Patry authored Nov 08, 2021
```
* Fixing tests on master.

* Better fix.

* Lxmert doesn't have feature extractor but is bimodal.
```
  9b78b070
- [TFWav2Vec2Model] Fix input shapes in TFWav2Vec2WeightNormConv1D (#14319) · df1f94eb
  Anton Lozhkov authored Nov 08, 2021
```
* Add paddings to input shapes

* Add padding comment
```
  df1f94eb
- [Tests] Update audio classification tests to support torch 1.10 (#14318) · e30078b5
  Anton Lozhkov authored Nov 08, 2021
  
  e30078b5
- [Marian Conversion] Fix eos_token_id conversion in conversion script (#14320) · b48faae3
  Patrick von Platen authored Nov 08, 2021
  
  b48faae3
06 Nov, 2021 4 commits
- Fix execution PATH for PPLM Example (#14287) · c016dbdb
  Junbum Lee authored Nov 06, 2021
  
  c016dbdb
- Fix tests (#14289) · 34307bb3
  NielsRogge authored Nov 06, 2021
  
  34307bb3
- Handle long answer needs to be updated. (#14279) · 24b30d4d
  Nicolas Patry authored Nov 06, 2021
```
`start_` and `end_` tensors now contain a batch_size at this point.
```
  24b30d4d
- Update dpr.rst (#14300) · 843c326e
  Xing Han Lu authored Nov 06, 2021
  
  843c326e
05 Nov, 2021 2 commits
- Add new LFS prune API (#14294) · 08a5f575
  Sylvain Gugger authored Nov 05, 2021
  
  08a5f575
- [Hubert Docs] Make sure example uses a fine-tuned model (#14291) · 4be78c22
  Patrick von Platen authored Nov 05, 2021
  
  4be78c22