Commits · 8aa01d2a6db0059077ecdd4e5948e6ffe51340ee · chenpangpang / transformers

05 Aug, 2021 1 commit
- Create perplexity.rst (#13004) · 8aa01d2a
  Sasha Luccioni authored Aug 05, 2021
```
Updating the import for load_dataset
```
  8aa01d2a
04 Aug, 2021 10 commits

NielsRogge authored Aug 04, 2021



* First pass

* Make conversion script work

* Improve conversion script

* Fix bug, conversion script working

* Improve conversion script, implement BEiTFeatureExtractor

* Make conversion script work based on URL

* Improve conversion script

* Add tests, add documentation

* Fix bug in conversion script

* Fix another bug

* Add support for converting masked image modeling model

* Add support for converting masked image modeling

* Fix bug

* Add print statement for debugging

* Fix another bug

* Make conversion script finally work for masked image modeling models

* Move id2label for datasets to JSON files on the hub

* Make sure id's are read in as integers

* Add integration tests

* Make style & quality

* Fix test, add BEiT to README

* Apply suggestions from @sgugger's review

* Apply suggestions from code review

* Make quality

* Replace nielsr by microsoft in tests, add docs

* Rename BEiT to Beit

* Minor fix

* Fix docs of BeitForMaskedImageModeling
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

83e5a106

Skip ProphetNet test (#12462) · 0dd1152c
Lysandre Debut authored Aug 04, 2021

0dd1152c
create tensors on device (#12846) · f8265387
Arman Cohan authored Aug 04, 2021

f8265387

[Flax] Correct flax docs (#12782) · fbf468b0

Patrick von Platen authored Aug 04, 2021

* fix_torch_device_generate_test

* remove @

* fix flax docs

* correct more docs in flax

* another correction

* fix flax docs

* Apply suggestions from code review

fbf468b0

[Flax] Correctly Add MT5 (#12988) · a317e6c3

Patrick von Platen authored Aug 04, 2021



* finish PR

* finish mt5

* push

* up

* Update tests/test_modeling_flax_mt5.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

a317e6c3

[Flax] Align jax flax device name (#12987) · da9754a3
Patrick von Platen authored Aug 04, 2021
```
* [Flax] Align device name in docs

* make style

* fix import error
```
da9754a3

pad_to_multiple_of added to DataCollatorForWholeWordMask (#12999) · 07df5578

Aktsvigun authored Aug 04, 2021



* pad_to_multiple_of added to DataCollatorForWholeWordMask

* pad_to_multiple_of added to DataCollatorForWholeWordMask
Co-authored-by: Цвигун Аким Олегович <AOTsvigun@sberbank.ru>

07df5578

Return raw outputs in TextClassificationPipeline (#8328) · 3f44a66c

Lysandre Debut authored Aug 04, 2021



* Return raw outputs in TextClassificationPipeline

* Style

* Support for problem type

* Update src/transformers/pipelines/text_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply Nicolas' comments
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3f44a66c

Fix from_pretrained with corrupted state_dict (#12939) · d4c834d2

Sylvain Gugger authored Aug 04, 2021

* Fix from_pretrained with corrupted state_dict

* Adapt test

* Use better checkpoint

* Style

* Clean up

d4c834d2

Replace nielsr by google namespace in tests (#12453) · a28da4c4
NielsRogge authored Aug 04, 2021

a28da4c4

03 Aug, 2021 3 commits

Cast logits to fp32 at the end of TF_T5 (#12332) · f064e0a4
Michal Szutenberg authored Aug 03, 2021
```
This change enables tf.keras.mixed_precision with bf16
```
f064e0a4

fix `Trainer.train(resume_from_checkpoint=False)` is causing an exception (#12981) · b7439675

Philip May authored Aug 03, 2021



* fix #12970

* Update tests/test_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove unnecessary issue link

* fix test formatting
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b7439675

Fix template for inputs docstrings (#12976) · 790f1c95
Sylvain Gugger authored Aug 03, 2021

790f1c95

02 Aug, 2021 3 commits
- fix typo in example/text-classification README (#12974) · 75b8990d
  Chungman Lee authored Aug 02, 2021
```
* fix typo in example/text-classification README

* add space to align the table
```
  75b8990d
- Place BigBirdTokenizer in sentencepiece-only objects (#12975) · c1a65385
  Sylvain Gugger authored Aug 02, 2021
  
  c1a65385
- Fix typo in example of DPRReader (#12954) · b5995bad
  Tadej Svetina authored Aug 02, 2021
  
  b5995bad
01 Aug, 2021 1 commit
- Set tb_writer to None in TensorBoardCallback.on_train_end() (#12963) · a4340d3b
  Alex Hedges authored Aug 01, 2021
  
  a4340d3b
30 Jul, 2021 6 commits
- examples: use correct way to get vocab size in flax lm readme (#12947) · 3d4b3bc3
  Stefan Schweter authored Jul 30, 2021
  
  3d4b3bc3
- Fix division by zero in NotebookProgressPar (#12953) · 23d6761f
  Sylvain Gugger authored Jul 30, 2021
  
  23d6761f
- Add multilingual documentation support (#12952) · 8ff619d9
  Kevin Canwen Xu authored Jul 30, 2021
```
* Add multilingual documentation support

* Add multilingual documentation support

* make style

* make style

* revert
```
  8ff619d9
- Add substep callbacks (#12951) · fe6ff4a9
  wulu473 authored Jul 30, 2021
```
Co-authored-by: Lukas Wutschitz <lukas.wutschitz@microsoft.com>
```
  fe6ff4a9
- Log Azure ML metrics only for rank 0 (#12766) · f84226b7
  harshithapv authored Jul 30, 2021
```
* minor change to log azureml only for rank 0

* fix typo
```
  f84226b7
- fix typo in gradient_checkpointing arg (#12855) · 5c673efa
  21jun authored Jul 30, 2021
```
help for `ModelArguments.gradient_checkpointing` should be
"If True, use gradient checkpointing to save memory
at the expense of slower backward pass."
not "Whether to freeze the feature extractor layers of the model."
(which is duplicated from `freeze_feature_extractor` arg)
```
  5c673efa
29 Jul, 2021 3 commits

Add CpmTokenizerFast (#12938) · fd0255b4
Kevin Canwen Xu authored Jul 30, 2021
```
* Add CpmTokenizerFast

* Fix isort

* Overwrite _batch_encode_plus
```
fd0255b4

Moving feature-extraction pipeline to new testing scheme (#12843) · e2d22eef

Nicolas Patry authored Jul 29, 2021



* Update feature extraction pipelilne.

* Leaving 1 small model for actual values check.

* Fixes tests

- Better support for tokenizer with no pad token
- Increasing PegasusModelTesterConfig for pipelines
- Test of feature extraction are more permissive + don't test Multimodel
models + encoder-decoder.

* Fixing model loading with incorrect shape (+ model with HEAD).

* Update tests/test_pipelines_common.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Revert modeling_utils modification.

* Some corrections.

* Update tests/test_pipelines_common.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_pipelines_feature_extraction.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Syntax.

* Fixing text-classification tests.

* Don't modify this file.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e2d22eef

ONNX v2 raises an Exception when using PyTorch < 1.8.0 (#12933) · 640421c0

Funtowicz Morgan authored Jul 29, 2021

* Raise an issue if the pytorch version is < 1.8.0

* Attempt to add a test to ensure it correctly raises.

* Missing docstring.

* Second attempt, patch with string absolute import.

* Let's do the call before checking it was called ...

* use the correct function ... 🤦

* Raise ImportError and AssertionError respectively when unable to find torch and torch version is not sufficient.

* Correct path mock patching

* relax constraint for torch_onnx_dict_inputs to ge instead of eq.

* Style.

* Split each version requirements for torch.

* Let's compare version directly.

* Import torch_version after checking pytorch is installed.

* @require_torch

640421c0

28 Jul, 2021 12 commits

Fix docstring typo in tokenization_auto.py (#12891) · 9160d81c

Will Frey authored Jul 28, 2021

Change `PreTrainedConfig` -> `PretrainedConfig` in the docstring for `AutoTokenizer.from_pretrained(...)`.

9160d81c

Fix typo in tokenization_auto.py (#12896) · 0d00c08d
Will Frey authored Jul 28, 2021
```
Fix `config.decoder.__class` -> `config.decoder.__class__`
```
0d00c08d

Update typing in generation_logits_process.py (#12900) · c3287ebd

Will Frey authored Jul 28, 2021

Change `torch.Tensor` -> `torch.FloatTensor` in `TemperatureLogitsWarper` to be consistent with the `LogitsWarper` ABC signature annotation.

c3287ebd

Update typing in generation_logits_process.py (#12901) · df55c2b9

Will Frey authored Jul 28, 2021

While `Iterable[Iterable[int]]` is a nicer annotation (it's covariant!), the defensive statements parsing out `bad_words_ids` in `__init__(...)` force the caller to pass in `List[List[int]]`. I've changed the annotation to make that clear.

df55c2b9

Fix distiller.py (#12910) · c164064e
chutaklee authored Jul 29, 2021
```
* fix distiller

* fix style
```
c164064e

Add missing classmethod decorators (#12927) · 1da782cb

Will Frey authored Jul 28, 2021

`_BaseAutoModelClass` was missing `classmethod` decorators on the `from_config(...)` and `from_pretrained(...)` methods.

1da782cb

Fix StoppingCriteria ABC signature (#12918) · bf78f523

Will Frey authored Jul 28, 2021

Change `score` -> `scores` because the argument is not positional-only, so you need consistently named parameters for the subclasses. The subclasses appear to favor `scores` over `score`.

bf78f523

Print defaults when using --help for scripts (#12930) · 63f2b9ab
Sylvain Gugger authored Jul 28, 2021

63f2b9ab
Fix QA examples for roberta tokenizer (#12928) · 3ec851dc
Sylvain Gugger authored Jul 28, 2021

3ec851dc
Add option to set max_len in run_ner (#12929) · fd85734e
Sylvain Gugger authored Jul 28, 2021

fd85734e
Fix typo in the example of MobileBertForPreTraining (#12919) · 1486fb81
Buddhi Chathuranga Senarathna authored Jul 28, 2021

1486fb81

Correct validation_split_percentage argument from int (ex:5) to float (0.05) (#12897) · f3d0866e

Elysium1436 authored Jul 27, 2021



* Fixed train_test_split test_size argument

* `Seq2SeqTrainer` set max_length and num_beams only when non None  (#12899)

* set max_length and num_beams only when non None

* fix instance variables

* fix code style

* [FLAX] Minor fixes in CLM example (#12914)

* readme: fix retrieval of vocab size for flax clm example

* examples: fix flax clm example when using training/evaluation files

* Fix module path for symbolic_trace example
Co-authored-by: cchen-dialpad <47165889+cchen-dialpad@users.noreply.github.com>
Co-authored-by: Stefan Schweter <stefan@schweter.it>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

f3d0866e

27 Jul, 2021 1 commit
- Fix module path for symbolic_trace example · 68a441fa
  Sylvain Gugger authored Jul 27, 2021
  
  68a441fa