Commits · cc12e1dbf685d14bfbce34ad1374176071df243a · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "986ac03e374a00a52cee98c8ac14fb1ba6b66610"

09 Jul, 2021 2 commits

This will reduce "Already borrowed error": (#12550) · cc12e1db

Nicolas Patry authored Jul 09, 2021

* This will reduce "Already borrowed error":

Original issue https://github.com/huggingface/tokenizers/issues/537



The original issue is caused by transformers calling many times
mutable functions on the rust tokenizers.
Rust needs to guarantee that only 1 agent has a mutable reference
to memory at a given time (for many reasons which don't need explaining
here). Usually, the rust compiler can guarantee that this property is
true at compile time.

Unfortunately, this is impossible for Python to do that, so PyO3, the
bridge between rust and python used by `tokenizers`, will change the
compile guarantee for a dynamic guarantee, so if multiple agents try
to have multiple mutable borrows at the same time, then the runtime will
yell with "Already borrowed".

The proposed fix here in transformers, is simply to reduce the actual
number of calls that really need mutable borrows. By reducing them,
we reduce the risk of running into "Already borrowed" error.
The caveat is now we add a call to read the current configuration of the
`_tokenizer`, so worst case we have 2 calls instead of 1, and best case
we simply have 1 + a Python comparison of a dict (should be negligible).

* Adding a test.

* trivial error :(.

* Update tests/test_tokenization_fast.py
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

* Adding reference to original issues in the tests.

* Update the tests with fast tokenizer.
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

cc12e1db

Add Flax sprint project evaluation section (#12592) · 8fe836af
Omar Sanseviero authored Jul 09, 2021

8fe836af

08 Jul, 2021 9 commits

[doc] fix broken ref (#12597) · ce111fee
Stas Bekman authored Jul 08, 2021

ce111fee

[model.from_pretrained] raise exception early on failed load (#12574) · f0dde601

Stas Bekman authored Jul 08, 2021




* [model.from_pretrained] raise exception early on failed load

Currently if `load` pretrained weights fails in `from_pretrained`, we first print a whole bunch of successful messages and then fail - this PR puts the exception first to avoid all the misleading messages.

* style
Co-authored-by: Suraj Patil <surajp815@gmail.com>

f0dde601

Fix MT5 init (#12591) · 75e63dbf
Sylvain Gugger authored Jul 08, 2021

75e63dbf

Fixing the pipeline optimization by reindexing targets (V2) (#12330) · 4da568c1

Nicolas Patry authored Jul 08, 2021



* Fixing the pipeline optimization by rescaling the logits first.

* Add test for target equivalence
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

4da568c1

[RFC] Laying down building stone for more flexible ONNX export capabilities (#11786) · 2aa3cd93

Funtowicz Morgan authored Jul 08, 2021

* Laying down building stone for more flexible ONNX export capabilities

* Ability to provide a map of config key to override before exporting.

* Makes it possible to export BART with/without past keys.

* Supports simple mathematical syntax for OnnxVariable.repeated

* Effectively apply value override from onnx config for model

* Supports export with additional features such as with-past for seq2seq

* Store the output path directly in the args for uniform usage across.

* Make BART_ONNX_CONFIG_* constants and fix imports.

* Support BERT model.

* Use tokenizer for more flexibility in defining the inputs of a model.

* Add TODO as remainder to provide the batch/sequence_length as CLI args

* Enable optimizations to be done on the model.

* Enable GPT2 + past

* Improve model validation with outputs containing nested structures

* Enable Roberta

* Enable Albert

* Albert requires opset >= 12

* BERT-like models requires opset >= 12

* Remove double printing.

* Enable XLM-Roberta

* Enable DistilBERT

* Disable optimization by default

* Fix missing setattr when applying optimizer_features

* Add value field to OnnxVariable to define constant input (not from tokenizers)

* Add T5 support.

* Simplify model type retrieval

* Example exporting token_classification pipeline for DistilBERT.

* Refactoring to package `transformers.onnx`

* Solve circular dependency & __main__

* Remove unnecessary imports in `__init__`

* Licences

* Use @Narsil's suggestion to forward the model's configuration to the ONNXConfig to avoid interpolation.

* Onnx export v2 fixes (#12388)

* Tiny fixes
Remove `convert_pytorch` from onnxruntime-less runtimes
Correct reference to model

* Style

* Fix Copied from

* LongFormer ONNX config.

* Removed optimizations

* Remvoe bad merge relicas.

* Remove unused constants.

* Remove some deleted constants from imports.

* Fix unittest to remove usage of PyTorch model for onnx.utils.

* Fix distilbert export

* Enable ONNX export test for supported model.

* Style.

* Fix lint.

* Enable all supported default models.

* GPT2 only has one output

* Fix bad property name when overriding config.

* Added unittests and docstrings.

* Disable with_past tests for now.

* Enable outputs validation for default export.

* Remove graph opt lvls.

* Last commit with on-going past commented.

* Style.

* Disabled `with_past` for now

* Remove unused imports.

* Remove framework argument

* Remove TFPreTrainedModel reference

* Add documentation

* Add onnxruntime tests to CircleCI

* Add test

* Rename `convert_pytorch` to `export`

* Use OrderedDict for dummy inputs

* WIP Wav2Vec2

* Revert "WIP Wav2Vec2"

This reverts commit f665efb04c92525c3530e589029f0ae7afdf603e.

* Style

* Use OrderedDict for I/O

* Style.

* Specify OrderedDict documentation.

* Style :)
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2aa3cd93

Don't stop at num_epochs when using IterableDataset (#12561) · 0085e712
Sylvain Gugger authored Jul 08, 2021

0085e712
Fix group_lengths for short datasets (#12558) · 6f1adc43
Sylvain Gugger authored Jul 08, 2021

6f1adc43

Init pickle (#12567) · 0a6b9048

Sylvain Gugger authored Jul 08, 2021

* Try to pickle transformers

* Deal with special objs better

* Make picklable

0a6b9048

raise exception when arguments to pipeline are incomplete (#12548) · b29c3945
Hwijeen Ahn authored Jul 08, 2021
```
* raise exception when arguments are incomplete

* change exception to runtime error
```
b29c3945

07 Jul, 2021 12 commits

Remove logging of GPU count etc logging. (#12569) · 122d7dc3
Ibraheem Moosa authored Jul 08, 2021
```
Successfully logging this requires Pytorch. For the purposes of this script we are not using Pytorch.
```
122d7dc3
fix loading clip vision model (#12566) · d7e156bd
Suraj Patil authored Jul 07, 2021

d7e156bd
Double check for attribute num_examples (#12562) · b8682609
Sylvain Gugger authored Jul 07, 2021
```
* Double check for attribute

* Use right name
```
b8682609

Remove tf.roll wherever not needed (#12512) · 0d2bffad

Michal Szutenberg authored Jul 07, 2021

It was used in shift_right.
After this change TF code is more similar to Pytorch implementations
Also, TF graphs are optimized (one node less)

0d2bffad

Adding prepare_decoder_input_ids_from_labels methods to all... · 95425d54
Matt authored Jul 07, 2021
```
Adding prepare_decoder_input_ids_from_labels methods to all ConditionalGeneration TF models (#12560)
```
95425d54

Adding support for `pipeline("automatic-speech-recognition")`. (#11525) · ebc69afc

Nicolas Patry authored Jul 07, 2021

* Adding support for `pipeline("automatic-speech-recognition")`.

- Ugly `"config"` choice for AutoModel. It would be great to have the
possibility to have something like `AutoModelFor` that would implement
the same logic (Load the config, check Architectures and load the first
one)

* Remove `model_id` was not needed in the end.

* Rebased !

* Remove old code.

* Rename `nlp`.

ebc69afc

[Flax] Allow retraining from save checkpoint (#12559) · 7d321b76
Patrick von Platen authored Jul 07, 2021
```
* fix_torch_device_generate_test

* remove @

* finish
```
7d321b76

MLM training fails with no validation file(same as #12406 for pytorch now) (#12517) · 1d6623c6

Souvic Chakraborty authored Jul 07, 2021

* Validation split percentage to be used for custom data files also

Issue same as https://github.com/huggingface/transformers/issues/12406 fixed for pytorch branch run_mlm.py

* Validation split added in the right place

* Update run_clm.py

* validation split added for custom files

* Validation split added for custom files

* Update run_plm.py

* fixed validation split for custom files as input for pytorch examples in lm

* Update run_clm_no_trainer.py

* args modified

1d6623c6

[trainer] add option to ignore keys for the train function too (#11719) (#12551) · 3488ef5a
shabie authored Jul 07, 2021

3488ef5a
Add a warning for broken ProphetNet fine-tuning (#12511) · 45dcfdec
Kevin Canwen Xu authored Jul 07, 2021

45dcfdec

[Flax] Add FlaxMBart (#12236) · 61400e1e

Daniel Stancl authored Jul 07, 2021



* Copy BART to MBart and rename some stuff

* Add copy statements pointing to FlaxBart

* Update/add some common files

* Update shift_tokens_rigth + fix imports

* Fix shift_tokens_right method according to MBart implementation

* Update shift_tokens_right in tests accordingly

* Fix the import issue and update docs file
* make style quality

* Do some minor changes according to patil-suraj suggestions

* Change the order of normalization layer and attention

* Add some copu statementes

* Update generate method and add integration test for mBart

* Make a few updates after a review

Besides, add `lang_code_to_id` to MBartTokenizeFast

* fix-copies; make style quality

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* fix output type, style

* add copied from

* resolve conflicts
Co-authored-by: Suraj Patil <surajp815@gmail.com>

61400e1e

[examples/flax] add adafactor optimizer (#12544) · 2d42915a

Suraj Patil authored Jul 07, 2021



* add adafactor

* Update examples/flax/language-modeling/run_mlm_flax.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2d42915a

06 Jul, 2021 9 commits

[Flax] Adapt examples to be able to use eval_steps and save_steps (#12543) · 208df208

Patrick von Platen authored Jul 06, 2021



* fix_torch_device_generate_test

* remove @

* up

* up

* correct

* upload
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

208df208

Bump CircleCI machine sizes · 2870fd19
Lysandre authored Jul 06, 2021

2870fd19
implementing tflxmertmodel integration test (#12497) · 3fd85777
sadakmed authored Jul 06, 2021
```
* implementing tflxmertmodel integration test

* move import

* revert and fix
```
3fd85777
Replace `nn.Moudle` by `nn.Module` (#12541) · 09af5bde
SaulLu authored Jul 06, 2021

09af5bde
Update README.md · f42a0abf
Patrick von Platen authored Jul 06, 2021

f42a0abf
Update README (#12540) · 029b9d3f
Suzana Ilić authored Jul 06, 2021

029b9d3f

FlaxGPTNeo (#12493) · 7a259c19

Suraj Patil authored Jul 06, 2021

* flax gpt neo

* fix query scaling

* update generation test

* use flax model for test

7a259c19

[RoFormer] Fix some issues (#12397) · 626a0a01

yujun authored Jul 06, 2021



* add RoFormerTokenizerFast into AutoTokenizer

* fix typo in roformer docs

* make onnx export happy

* update RoFormerConfig embedding_size

* use jieba not rjieba

* fix 12244 and make test_alignement passed

* update ARCHIVE_MAP

* make style & quality & fixup

* update

* make style & quality & fixup

* make style quality fixup

* update

* suggestion from LysandreJik
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* make style

* use rjieba
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

626a0a01

[Flax] Fix hybrid clip (#12519) · f5b0c1ec
Suraj Patil authored Jul 06, 2021
```
* fix saving and loading

* update readme
```
f5b0c1ec

05 Jul, 2021 8 commits

[Wav2Vec2] Flax - Adapt wav2vec2 script (#12520) · 7d6285a9
Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* adapt flax pretrain script
```
7d6285a9
[Flax] Fix another bug in logging steps (#12516) · 4605b2b8
Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* up
```
4605b2b8
[Flax] Correct logging steps flax (#12515) · d0f7508a
Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* push
```
d0f7508a

[Flax] Correct flax training scripts (#12514) · bb4ac2b5

Patrick von Platen authored Jul 05, 2021

* fix_torch_device_generate_test

* remove @

* add logging steps

* correct training scripts

* correct readme

* correct

bb4ac2b5

NER example for Tensorflow (#12469) · ea556750

Matt authored Jul 05, 2021

* NER example for Tensorflow

* Style pass

* Style pass

* Added metric computation on the evaluation set

* Style pass

* Fixed label masking

* Style pass

* Style pass

ea556750

[Flax] Dataset streaming example (#12470) · 9b908105

Patrick von Platen authored Jul 05, 2021



* fix_torch_device_generate_test

* remove @

* upload

* finish dataset streaming

* adapt readme

* finish

* up

* up

* up

* up

* Apply suggestions from code review

* finish

* make style

* make style2

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

9b908105

flax.linen.apply takes state as the first param, followed by the input (#12510) · eceb1042
Navjot authored Jul 05, 2021

eceb1042

[Flax] ViT training example (#12300) · f1c81d6b

Suraj Patil authored Jul 05, 2021

* begin script

* clean example, add readme

* update readme

* remove decay mask

* remove masking

* update readme & make flake happy

f1c81d6b