Commits · e92190c0f81bb8740ae784962f6d81ce753483aa · chenpangpang / transformers

11 Nov, 2021 2 commits

Fix Flax params dtype (#13098) · e92190c0

Suraj Patil authored Nov 11, 2021



* fix inits

* fix embed dtype

* fix embed dtype

* add test to check default dtype

* quality

* add type conversion methods for flax models

* more robust casting

* cast sinusoidal positions

* update pegasus

* update albert

* update test

* make sure dtype is passed to every module

* style

* fix electra dense

* fix t5

* quality

* add more tests

* better name

* use the dtype for lm head computation

* fix albert

* style

* fix albert embed dtype

* more tests

* fix vision enc-dec

* cleanup

* fix embed dtype pegasus

* fix default param test

* doc

* update template

* fix final_logits_bias dtype

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix doc

* fix doc

* add detailed docstring for dtype parameter

* remove un-necessary import
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

e92190c0

solve the port conflict (#14362) · 1c76a516
Stas Bekman authored Nov 10, 2021

1c76a516

10 Nov, 2021 6 commits

Fix list index out of range when padding nested empty lists (#13876) · 9e37c5cd
Li-Huai (Allan) Lin authored Nov 11, 2021
```
* Fix index out of range when padding

* Apply suggestions from code review

* Style
```
9e37c5cd
enhance rewrite state_dict missing _metadata (#14348) · bec02ff2
Chang Wang authored Nov 10, 2021

bec02ff2

Add notebook INC quantization for text classification tasks (#14293) · 2b0d9389

Ella Charlaix authored Nov 10, 2021

* Add notebook applying Intel Neural Compressor quantization for text classification tasks

* Add Optimum notebooks section

2b0d9389

Fix fast tokenization problems (#13930) · ea163d09

Li-Huai (Allan) Lin authored Nov 10, 2021

* Fix albert mask token tokenization.

* Ensure special tokans sanitized.

* Style

* Fix

* Apply suggestions from code review

ea163d09

Adding some quality of life for `pipeline` function. (#14322) · 5c153079

Nicolas Patry authored Nov 10, 2021



* Adding some quality of life for `pipeline` function.

* Update docs/source/main_classes/pipelines.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Improve the tests.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

5c153079

`BatchFeature`: Convert `List[np.ndarray]` to `np.ndarray` before converting... · 321eb562

Elad Segal authored Nov 10, 2021


`BatchFeature`: Convert `List[np.ndarray]` to `np.ndarray` before converting to pytorch tensors (#14306)

* update

* style fix

* retrigger checks

* check first element

* fix syntax error

* Update src/transformers/feature_extraction_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove import
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

321eb562

09 Nov, 2021 10 commits

Support for TF >= 2.7 (#14345) · 46d0cdae
Sylvain Gugger authored Nov 09, 2021

46d0cdae
[Bert2Bert] allow bert2bert + relative embeddings (#14324) · e81d8d7f
Patrick von Platen authored Nov 09, 2021
```
* [Bert2Bert] allow bert2bert + relative embeddings

* up

* Update README_ko.md

* up

* up
```
e81d8d7f

Rewrite guides for fine-tuning with Datasets (#13923) · e4d8f517

Steven Liu authored Nov 09, 2021

* rewrite guides for fine-tuning with datasets

* simple qa code example

* use anonymous rST links

* style

e4d8f517

bump flax version (#14343) · 85a4bda4
Suraj Patil authored Nov 09, 2021

85a4bda4
remove test_model_various_embeddings (#14341) · babd0b9a
Yih-Dar authored Nov 09, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
babd0b9a

Update Seq2Seq QA example script to use SQuAD metric. (#14335) · 4f24058c

karthikrangasai authored Nov 09, 2021

* Update postporcessing accordingly to use SQuAD metric.

* Update assets accordingly based on SQuAD metrics.

* Fix function naming error.

4f24058c

Add TFViTModel (#13778) · be4a6c64

Yih-Dar authored Nov 09, 2021



* Start the work for TFViTModel

* Convert to TF code - need to check in the follow up commits

* Clean up model code

* Expose TFViTModel

* make style

* make quality

* Add test

* make style & quality

* Fix some imports

* fix wrong usage - *kwargs => ** kwargs

* Fix Conv2D weight loading (PT->TF) issue

* Add tests for images with different sizes + fix model

* Fix some common tests for TFViTModel

* Use inputs instead of input_ids in test_compile_tf_model

* Add a comment about transpose and Conv2D in convert_tf_weight_name_to_pt_weight_name

* Avoid transpose in TFViT call

* Fix Conv2D issue in load_tf2_weights_in_pytorch_model

* Use tf.keras.layers.Conv2D instead of tf.nn.conv2d

* Using simpler heuristic to detect Conv2D layer

* Change convert_tf_weight_name_to_pt_weight_name to return TransposeType

* Check tf_weight_shape is not None before using it

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix missing comma

* fix input dtype
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

be4a6c64

Correct order of overflowing tokens for LayoutLmV2 tokenizer (#13495) · 6326aa4b

Apoorv Garg authored Nov 09, 2021



* correct order of overflowing tokens for LayoutLmV2 tokenizer

* test to check order of overflowing_tokens for a seq of input_ids

* fix up quality

* added suggested changes

* check that tests the bbox sequence

* pair_input test added

* pass quality test

* check bbox sequence added

* unittest method

* comments added

* add overflowing bbox test

* improved "seq_1"
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

* improve code quality
Co-authored-by: SaulLu <lucilesaul.com@gmail.com>
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

6326aa4b

Add FlaxVisionEncoderDecoderModel (#13359) · 95b3ec3b

Yih-Dar authored Nov 09, 2021



* Start the work on FlaxVisionEncoderDecoderModel

* Add FlaxVisionEncoderDecoderModel

* Add VisionEncoderDecoderConfig

* Make FlaxVisionEncoderDecoderModel visible to transformers

* Add test

* Fix wrong getattr usage

* Fix tests

* Add FlaxAutoModelForVision2Seq

* Expose FLAX_MODEL_FOR_VISION_2_SEQ_MAPPING

* clean-up

* add integration test

* update expected logits

* update expected scores

* Add ViT2GPT2ModelIntegrationTest + some cleaning

* Add projection layer + PT/Flax equivalence tests

* Fix import

* minor changes

* make test slow again

* Apply suggestions

* Add modeling_flax_vision_encoder_decoder to _ignore_modules in get_model_modules()

* fix copies

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* split long strings in multiple lines

* decoder_input_ids can't be None

* Add back test_configuration_tie

* Remove attention_mask parameter

* fix test - encoder_last_hidden_state should be encoder_outputs.last_hidden_state instead of the projected vector

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Remove more encoder_attention_mask

* remove encoder_attention_mask when calling self.decode (in FlaxVisionEncoderDecoderModule)

* Fix style + pass 1s instead of None as encoder_attention_mask

* fix init_weights

* pass None for encoder_attention_mask

* pass 1s instead of None as encoder_attention_mask

* Fix doc style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

95b3ec3b

Small change to Wav2Vec2 model to support Tensor-Parallelism with DeepSpeed (#14298) · a5030122

Reza Yazdani authored Nov 08, 2021

* minor modification to the wav2vec2 modeling file to support tensor-parallelism with DeepSpeed on this HuggingFace model

* refine the comments

* synch changes

* fix comments

* refine comments

* fix format

a5030122

08 Nov, 2021 8 commits
- [deepspeed] Enable multiple test runs on single box, defer to DS_TEST_PORT if set (#14331) · d0e96c6d
  Jeff Rasley authored Nov 08, 2021
```
* defer to DS_TEST_PORT if set

* style
Co-authored-by: Stas Bekman <stas@stason.org>
```
  d0e96c6d
- Expand dynamic supported objects to configs and tokenizers (#14296) · dfb00bf6
  Sylvain Gugger authored Nov 08, 2021
```
* Dynamic configs

* Add config test

* Better tests

* Add tokenizer and test

* Add to from_config

* With save
```
  dfb00bf6
- Changed relative imports to absolute to allow convert_graph_to_onnx.py to run as a script. (#14325) · de635af3
  nbertagnolli authored Nov 08, 2021
```
* Changed relative imports to absolute to allow convert_graph_to_onnx.py to be run as a script

* isorted code
```
  de635af3
- Fixing mutable default argument in `pipeline`. (#14316) · a3ded170
  Nicolas Patry authored Nov 08, 2021
```
* Fixing mutable default argument.

* XX.

* Revert "XX."

This reverts commit 61d4bb333f6d39a7fbe31d161b8bd14787ceec2e.
```
  a3ded170
- Fixing tests on master. (#14317) · 9b78b070
  Nicolas Patry authored Nov 08, 2021
```
* Fixing tests on master.

* Better fix.

* Lxmert doesn't have feature extractor but is bimodal.
```
  9b78b070
- [TFWav2Vec2Model] Fix input shapes in TFWav2Vec2WeightNormConv1D (#14319) · df1f94eb
  Anton Lozhkov authored Nov 08, 2021
```
* Add paddings to input shapes

* Add padding comment
```
  df1f94eb
- [Tests] Update audio classification tests to support torch 1.10 (#14318) · e30078b5
  Anton Lozhkov authored Nov 08, 2021
  
  e30078b5
- [Marian Conversion] Fix eos_token_id conversion in conversion script (#14320) · b48faae3
  Patrick von Platen authored Nov 08, 2021
  
  b48faae3
06 Nov, 2021 4 commits
- Fix execution PATH for PPLM Example (#14287) · c016dbdb
  Junbum Lee authored Nov 06, 2021
  
  c016dbdb
- Fix tests (#14289) · 34307bb3
  NielsRogge authored Nov 06, 2021
  
  34307bb3
- Handle long answer needs to be updated. (#14279) · 24b30d4d
  Nicolas Patry authored Nov 06, 2021
```
`start_` and `end_` tensors now contain a batch_size at this point.
```
  24b30d4d
- Update dpr.rst (#14300) · 843c326e
  Xing Han Lu authored Nov 06, 2021
  
  843c326e
05 Nov, 2021 3 commits
- Add new LFS prune API (#14294) · 08a5f575
  Sylvain Gugger authored Nov 05, 2021
  
  08a5f575
- [Hubert Docs] Make sure example uses a fine-tuned model (#14291) · 4be78c22
  Patrick von Platen authored Nov 05, 2021
  
  4be78c22
- Pin TF until tests are fixed (#14283) · a14d62b0
  Sylvain Gugger authored Nov 04, 2021
```
* Pin TF until tests are fixed

* Also pin TF CPU
```
  a14d62b0
04 Nov, 2021 4 commits
- Removing Keras version pinning (#14280) · b90a48f6
  Matt authored Nov 04, 2021
```
* Removing Keras version pinning

* make fixup
```
  b90a48f6
- improve rewrite state_dict missing _metadata (#14276) · fd8136fa
  Chang Wang authored Nov 04, 2021
  
  fd8136fa
- Fixing mishandling of `ignore_labels`. (#14274) · d29baf69
  Nicolas Patry authored Nov 04, 2021
```
Fixes #14272
```
  d29baf69
- Fixing slow pipeline tests (#14260) · 68427c9b
  Nicolas Patry authored Nov 04, 2021
```
* Fiixng slow pipeline tests

* Remove the image-segmentaiton override.

* Fixing clamping only in training.

* Wav2vec2.

* Remove last mention of `no_grad`.

* Fixing copies.

* Rename.
```
  68427c9b
03 Nov, 2021 3 commits

Add more instructions to the release guide (#14263) · 1a674ce6

Sylvain Gugger authored Nov 03, 2021



* Add more instructions to the release guide

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comment
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

1a674ce6

Quality explain (#14264) · f0d6e952

Sylvain Gugger authored Nov 03, 2021



* Start PR doc

* Cleanup the quality checks and document them

* Add reference in the contributing guide

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename file as per review suggestion
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

f0d6e952

Pin Keras cause they messed their release (#14262) · a1c15ea8

Sylvain Gugger authored Nov 03, 2021

* Pin Keras cause they messed their release

* Put != instead of <

* Try this way

* Back to the beginning but more agressive

a1c15ea8