Commits · 1988849bbf413aa78528c2582a43cddb0d35d074 · chenpangpang / transformers

23 Sep, 2021 4 commits

Handle `UnicodeDecodeError` (#13717) · 1988849b
Li-Huai (Allan) Lin authored Sep 24, 2021

1988849b

Add cpu distributed fine-tuning support for transformers Trainer API (#13574) · 8632a60d

kding1 authored Sep 23, 2021



* update trainer with cpu distributed fine-tuning support.
Signed-off-by: Ding, Ke <ke.ding@intel.com>

* Style.

* refinement on cpu dist training check.
Signed-off-by: Ding, Ke <ke.ding@intel.com>

* style.
Signed-off-by: Ding, Ke <ke.ding@intel.com>

* Test over private field not public one.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>
Co-authored-by: Funtowicz Morgan <mfuntowicz@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8632a60d

Add SigOpt HPO to transformers trainer api (#13572) · 6a3a197f

kding1 authored Sep 23, 2021



* add sigopt hpo to transformers.
Signed-off-by: Ding, Ke <ke.ding@intel.com>

* extend sigopt changes to test code and others..
Signed-off-by: Ding, Ke <ke.ding@intel.com>

* Style.

* fix style for sigopt integration.
Signed-off-by: Ding, Ke <ke.ding@intel.com>

* Add necessary information to run unittests on SigOpt.
Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>

6a3a197f

1x model size CPU memory usage for `from_pretrained` (#13466) · 62832c96

Stas Bekman authored Sep 22, 2021

* one possible solution

* low mem from_pretrained

* edge cases

* solve the persistent buffers

* style

* parametrize

* for later

* proper solution

* cleanup

* refactor; rework based on suggestions

* revert splitting into 2 parts, move checks into main func

62832c96

22 Sep, 2021 11 commits

Fix torchscript tests (#13701) · ca257a06
Lysandre Debut authored Sep 22, 2021

ca257a06

Add BlenderBot small tokenizer to the init (#13367) · 5b570754

Lysandre Debut authored Sep 22, 2021



* Add BlenderBot small tokenizer to the init

* Update src/transformers/__init__.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Style

* Bugfix
Co-authored-by: Suraj Patil <surajp815@gmail.com>

5b570754

Fix reference to tpu short seq length (#13686) · 9e0fd780
Gunjan Chhablani authored Sep 23, 2021

9e0fd780
add a note about tokenizer (#13696) · 6dc41d9f
Suraj Patil authored Sep 23, 2021

6dc41d9f

[GPT-J] Use the `float16` checkpoints in integration tests (#13676) · 7c7d2ec9

Anton Lozhkov authored Sep 22, 2021

* Use fp16 checkpoints

* Style

* Fix outputs and disable OOM tests

* Correct another output

* Use a random smaller model for generation tests

* repo quickfix

* fix gradient checkpointing

7c7d2ec9

Patch training arguments issue (#13700) · 0ecdf6de

Lysandre Debut authored Sep 22, 2021



* Patch training arguments issue

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

0ecdf6de

Allow only textual inputs to VisualBert (#13687) · 50c746ee
Gunjan Chhablani authored Sep 22, 2021

50c746ee
Fix non-negligible difference between GPT2 and TFGP2 (#13679) · 93624bfe
Yih-Dar authored Sep 22, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
93624bfe

Assertions to exceptions (#13692) · a0c08aa3

MocktaiLEngineer authored Sep 22, 2021



* Raise exceptions instead of using assertions for control flow #12789

* # coding=utf-8

* Raise exceptions instead of using assertions for control flow

* Raise exceptions instead of using assertions for control flow

* Update src/transformers/tokenization_utils.py

Raise exceptions instead of using assertions for control flow
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Update src/transformers/tokenization_utils.py

Raise exceptions instead of using assertions for control flow
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Raise exceptions instead of using assertions for control flow

* test

* Raise exceptions instead of using assertions for control flow
Co-authored-by: MocktaiLEngineer <kavinarasu22@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

a0c08aa3

Make gradient_checkpointing a training argument (#13657) · 27d46397

Sylvain Gugger authored Sep 22, 2021



* Make gradient_checkpointing a training argument

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/configuration_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Fix tests

* Style

* document Gradient Checkpointing as a performance feature

* Small rename

* PoC for not using the config

* Adapt BC to new PoC

* Forgot to save

* Rollout changes to all other models

* Fix typo
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

27d46397

[Wav2Vec2FeatureExtractor] Fix `extractor.pad()` dtype backwards compatibility (#13693) · 75f6641e
Anton Lozhkov authored Sep 22, 2021
```
* Force dtype, add tests

* Local torch imports

* Remove unused logic (always ndarray)
```
75f6641e

21 Sep, 2021 12 commits

[AutoTokenizer] Allow creation of tokenizers by tokenizer type (#13668) · 8e908c8c
Patrick von Platen authored Sep 22, 2021
```
* up

* up
```
8e908c8c
up (#13688) · 2608944d
Patrick von Platen authored Sep 22, 2021

2608944d

Update modeling_flax_wav2vec2.py (#13680) · 8565d38f

Kamal Raj authored Sep 22, 2021

conv kernel_size to Tuple,
Flax Version 0.3.5 breaking change, https://github.com/google/flax/releases/tag/v0.3.5

8565d38f

Skip FlaxWav2Vec2 test until fixed · d16bec95
Sylvain Gugger authored Sep 21, 2021

d16bec95

Layoutlm onnx support (Issue #13300) (#13562) · ddd4d02f

Nishant Prabhu authored Sep 22, 2021



* Add support for exporting PyTorch LayoutLM to ONNX

* Added tests for converting LayoutLM to ONNX

* Add support for exporting PyTorch LayoutLM to ONNX

* Added tests for converting LayoutLM to ONNX

* cleanup

* Removed regression/ folder

* Add support for exporting PyTorch LayoutLM to ONNX

* Added tests for converting LayoutLM to ONNX

* cleanup

* Fixed import error

* Remove unnecessary import statements

* Changed max_2d_positions from class variable to instance variable of the config class

* Add support for exporting PyTorch LayoutLM to ONNX

* Added tests for converting LayoutLM to ONNX

* cleanup

* Add support for exporting PyTorch LayoutLM to ONNX

* cleanup

* Fixed import error

* Changed max_2d_positions from class variable to instance variable of the config class

* Use super class generate_dummy_inputs method
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

* Add support for Masked LM, sequence classification and token classification
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

* Removed uncessary import and method

* Fixed code styling

* Raise error if PyTorch is not installed

* Remove unnecessary import statement
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

ddd4d02f

Add push_to_hub to no_trainer examples (#13659) · b7d264be

Sylvain Gugger authored Sep 21, 2021

* Add push_to_hub to no_trainer examples

* Quality

* Document integration

* Roll out to other examples

b7d264be

[SinusoidalPositionalEmbedding] incorrect dtype when make_weights in forward (#13665) · a722c301
Stas Bekman authored Sep 21, 2021

a722c301

[SequenceFeatureExtractor] Rewrite padding logic from pure python to numpy (#13650) · 1417978c

Anton Lozhkov authored Sep 21, 2021

* Test np padding

* Pass feature extraction tests

* Update type hints

* Fix flaky integration tests

* Try a more stable waveform

* Add to_numpy jax support

* int32 attention masks

* Refactor normalization tests

1417978c

Typo "UNKWOWN" -> "UNKNOWN" (#13675) · 8d533e6a
Kamal Raj authored Sep 21, 2021

8d533e6a

[FLAX] Question Answering Example (#13649) · 78807d86

Kamal Raj authored Sep 21, 2021

* flax qa example

* Updated README:  Added Large model

* added utils_qa.py FULL_COPIES

* Updates:
1. Copyright Year updated
2. added dtype arg
3. passing seed and dtype to load model
4. Check eval flag before running eval

* updated README

* updated code comment

78807d86

beit-flax (#13515) · a2dec768

Kamal Raj authored Sep 21, 2021

* beit-flax

* updated FLAX_BEIT_MLM_DOCSTRING

* removed bool_masked_pos from classification

* updated Copyright

* code refactoring: x -> embeddings

* updated test: rm from_pt

* Update docs/source/model_doc/beit.rst

* model code dtype updates and
other changes according to review

* relative_position_bias
revert back to pytorch design

a2dec768

Add Speech AutoModels (#13655) · 48fa42e5
Patrick von Platen authored Sep 21, 2021
```
* upload

* correct

* correct

* correct

* finish

* up

* up

* up again
```
48fa42e5

20 Sep, 2021 10 commits

Fix typo distilbert doc (#13643) · ea921365
flozi00 authored Sep 20, 2021

ea921365
fix research_projects/mlm_wwm readme.md examples (#13646) · 28d5700a
Lowin authored Sep 21, 2021
```
the variables of run example is not correct
```
28d5700a

Dynamically load model code from the Hub (#13467) · 002a078a

Sylvain Gugger authored Sep 20, 2021



* Dynamic model

* Use defensive flag

* Style

* Doc and arg rename

* Arg rename

* Add tests

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

002a078a

Change https:/ to https:// (#13644) · aeb2dac0
flozi00 authored Sep 20, 2021

aeb2dac0

[megatron_gpt2] checkpoint v3 (#13508) · 0af901e8

Stas Bekman authored Sep 20, 2021

* [megatron_gpt2] checkpoint v3

* bug fix

* fixes

* switch to default  from  - which is what the current megatron-lm uses

* cleanup

* back compat

0af901e8

Update modeling_tf_deberta.py (#13654) · 936b3fde
Kamal Raj authored Sep 20, 2021
```
Fixed expand_dims axis
```
936b3fde
Fix mT5 documentation (#13639) · 04976a32
Ayaka Mikazuki authored Sep 20, 2021
```
* Fix MT5 documentation

The abstract is incomplete

* MT5 -> mT5
```
04976a32
[Fix]Make sure the args tb_writer passed to the TensorBoardCallback works (#13636) · fe379f85
Chengjiang Li authored Sep 20, 2021

fe379f85

Add FNet (#13045) · d8049331

Gunjan Chhablani authored Sep 20, 2021



* Init FNet

* Update config

* Fix config

* Update model classes

* Update tokenizers to use sentencepiece

* Fix errors in model

* Fix defaults in config

* Remove position embedding type completely

* Fix typo and take only real numbers

* Fix type vocab size in configuration

* Add projection layer to embeddings

* Fix position ids bug in embeddings

* Add minor changes

* Add conversion script and remove CausalLM vestiges

* Fix conversion script

* Fix conversion script

* Remove CausalLM Test

* Update checkpoint names to dummy checkpoints

* Add tokenizer mapping

* Fix modeling file and corresponding tests

* Add tokenization test file

* Add PreTraining model test

* Make style and quality

* Make tokenization base tests work

* Update docs

* Add FastTokenizer tests

* Fix fast tokenizer special tokens

* Fix style and quality

* Remove load_tf_weights vestiges

* Add FNet to  main README

* Fix configuration example indentation

* Comment tokenization slow test

* Fix style

* Add changes from review

* Fix style

* Remove bos and eos tokens from tokenizers

* Add tokenizer slow test, TPU transforms, NSP

* Add scipy check

* Add scipy availabilty check to test

* Fix tokenizer and use correct inputs

* Remove remaining TODOs

* Fix tests

* Fix tests

* Comment Fourier Test

* Uncomment Fourier Test

* Change to google checkpoint

* Add changes from review

* Fix activation function

* Fix model integration test

* Add more integration tests

* Add comparison steps to MLM integration test

* Fix style

* Add masked tokenization fix

* Improve mask tokenization fix

* Fix index docs

* Add changes from review

* Fix issue

* Fix failing import in test

* some more fixes

* correct fast tokenizer

* finalize

* make style

* Remove additional tokenization logic

* Set do_lower_case to False

* Allow keeping accents

* Fix tokenization test

* Fix FNet Tokenizer Fast

* fix tests

* make style

* Add tips to FNet docs
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

d8049331

fix typo (#13647) · 87d5057d
Suraj Patil authored Sep 20, 2021

87d5057d

17 Sep, 2021 3 commits
- Fix GPT2Config parameters in GPT2ModelTester (#13630) · b518aaf1
  calpt authored Sep 17, 2021
  
  b518aaf1
- Updated tiny distilbert models (#13631) · 300ee0c7
  Lysandre Debut authored Sep 17, 2021
  
  300ee0c7
- fix some docstring in encoder-decoder models (#13611) · afb07a79
  Yih-Dar authored Sep 17, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  afb07a79