Commits · 5008e08885b21a4cdc1df8efa4405a335db06128 · chenpangpang / transformers

09 Aug, 2021 4 commits

Lysandre Debut authored Aug 09, 2021



* Add to ONNX docs

* Add MBART example

* Update docs/source/serialization.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

5008e088

Add MBART to models exportable with ONNX (#13049) · 6f5ab9da
Lysandre Debut authored Aug 09, 2021
```
* Add MBART to models exportable with ONNX

* unittest mock

* Add tests

* Misc fixes
```
6f5ab9da

[Flax] Refactor gpt2 & bert example docs (#13024) · 13a9c9a3

Patrick von Platen authored Aug 09, 2021



* fix_torch_device_generate_test

* remove @

* improve docs for clm

* speed-ups

* correct t5 example as well

* push final touches

* Update examples/flax/language-modeling/README.md

* correct docs for mlm

* Update examples/flax/language-modeling/README.md
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

13a9c9a3

tfhub.de -> tfhub.dev (#12565) · 3ff2cde5
abhishek thakur authored Aug 09, 2021

3ff2cde5

08 Aug, 2021 2 commits
- Update README.md · 24cbf6bc
  Patrick von Platen authored Aug 08, 2021
  
  24cbf6bc
- Use min version for huggingface-hub dependency (#12961) · 7390d9de
  lewtun authored Aug 08, 2021
```
* Use min version for huggingface-hub dependency

* Update dependency version table
```
  7390d9de
06 Aug, 2021 6 commits

Tpu tie weights (#13030) · 7fcee113

Sylvain Gugger authored Aug 06, 2021

* Fix tied weights on TPU

* Manually tie weights in no trainer examples

* Fix for test

* One last missing

* Gettning owned by my scripts

* Address review comments

* Fix test

* Fix tests

* Fix reformer tests

7fcee113

Put smaller ALBERT model (#13028) · 1bf38611
Lysandre Debut authored Aug 06, 2021

1bf38611

T5 with past ONNX export (#13014) · dc420b0e

Michael Benayoun authored Aug 06, 2021



T5 with past ONNX export, and more explicit past_key_values inputs and outputs names for ONNX model
Authored-by: Michael Benayoun <michael@huggingface.co>

dc420b0e

FX submodule naming fix (#13016) · ee112246

Michael Benayoun authored Aug 06, 2021



Changed the way dynamically inserted submodules are named and the method used to insert them
Authored-by: Michael Benayoun <michael@huggingface.co>

ee112246

[WIP] Disentangle auto modules from other modeling files (#13023) · 9870093f

Sylvain Gugger authored Aug 06, 2021

* Initial work

* All auto models

* All tf auto models

* All flax auto models

* Tokenizers

* Add feature extractors

* Fix typos

* Fix other typo

* Use the right config

* Remove old mapping names and update logic in AutoTokenizer

* Update check_table

* Fix copies and check_repo script

* Fix last test

* Add back name

* clean up

* Update template

* Update template

* Forgot a )

* Use alternative to fixup

* Fix TF model template

* Address review comments

* Address review comments

* Style

9870093f

[Flax T5] Speed up t5 training (#13012) · 2e408236

Patrick von Platen authored Aug 06, 2021



* fix_torch_device_generate_test

* remove @

* update

* up

* fix

* remove f-stings

* correct readme

* up
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2e408236

05 Aug, 2021 4 commits
- [Flax] Correct pt to flax conversion if from base to head (#13006) · 60e448c8
  Patrick von Platen authored Aug 05, 2021
```
* finish PR

* add tests

* correct tests

* finish

* correct other flax tests

* better naming

* correct naming

* finish

* apply sylvains suggestions
```
  60e448c8
- Replace // operator with / operator + long() (#13013) · 33929448
  Nils Reimers authored Aug 05, 2021
  
  33929448
- GPT-Neo ONNX export (#12911) · a6d62aab
  Michael Benayoun authored Aug 05, 2021
```
GPT-Neo ONNX export and task / feature refactoring
Authored-by: Michael Benayoun <michael@huggingface.co>
```
  a6d62aab
- Create perplexity.rst (#13004) · 8aa01d2a
  Sasha Luccioni authored Aug 05, 2021
```
Updating the import for load_dataset
```
  8aa01d2a
04 Aug, 2021 10 commits

Add BEiT (#12994) · 83e5a106

NielsRogge authored Aug 04, 2021



* First pass

* Make conversion script work

* Improve conversion script

* Fix bug, conversion script working

* Improve conversion script, implement BEiTFeatureExtractor

* Make conversion script work based on URL

* Improve conversion script

* Add tests, add documentation

* Fix bug in conversion script

* Fix another bug

* Add support for converting masked image modeling model

* Add support for converting masked image modeling

* Fix bug

* Add print statement for debugging

* Fix another bug

* Make conversion script finally work for masked image modeling models

* Move id2label for datasets to JSON files on the hub

* Make sure id's are read in as integers

* Add integration tests

* Make style & quality

* Fix test, add BEiT to README

* Apply suggestions from @sgugger's review

* Apply suggestions from code review

* Make quality

* Replace nielsr by microsoft in tests, add docs

* Rename BEiT to Beit

* Minor fix

* Fix docs of BeitForMaskedImageModeling
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

83e5a106

Skip ProphetNet test (#12462) · 0dd1152c
Lysandre Debut authored Aug 04, 2021

0dd1152c
create tensors on device (#12846) · f8265387
Arman Cohan authored Aug 04, 2021

f8265387

[Flax] Correct flax docs (#12782) · fbf468b0

Patrick von Platen authored Aug 04, 2021

* fix_torch_device_generate_test

* remove @

* fix flax docs

* correct more docs in flax

* another correction

* fix flax docs

* Apply suggestions from code review

fbf468b0

[Flax] Correctly Add MT5 (#12988) · a317e6c3

Patrick von Platen authored Aug 04, 2021



* finish PR

* finish mt5

* push

* up

* Update tests/test_modeling_flax_mt5.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

a317e6c3

[Flax] Align jax flax device name (#12987) · da9754a3
Patrick von Platen authored Aug 04, 2021
```
* [Flax] Align device name in docs

* make style

* fix import error
```
da9754a3

pad_to_multiple_of added to DataCollatorForWholeWordMask (#12999) · 07df5578

Aktsvigun authored Aug 04, 2021



* pad_to_multiple_of added to DataCollatorForWholeWordMask

* pad_to_multiple_of added to DataCollatorForWholeWordMask
Co-authored-by: Цвигун Аким Олегович <AOTsvigun@sberbank.ru>

07df5578

Return raw outputs in TextClassificationPipeline (#8328) · 3f44a66c

Lysandre Debut authored Aug 04, 2021



* Return raw outputs in TextClassificationPipeline

* Style

* Support for problem type

* Update src/transformers/pipelines/text_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply Nicolas' comments
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3f44a66c

Fix from_pretrained with corrupted state_dict (#12939) · d4c834d2

Sylvain Gugger authored Aug 04, 2021

* Fix from_pretrained with corrupted state_dict

* Adapt test

* Use better checkpoint

* Style

* Clean up

d4c834d2

Replace nielsr by google namespace in tests (#12453) · a28da4c4
NielsRogge authored Aug 04, 2021

a28da4c4

03 Aug, 2021 3 commits

Cast logits to fp32 at the end of TF_T5 (#12332) · f064e0a4
Michal Szutenberg authored Aug 03, 2021
```
This change enables tf.keras.mixed_precision with bf16
```
f064e0a4

fix `Trainer.train(resume_from_checkpoint=False)` is causing an exception (#12981) · b7439675

Philip May authored Aug 03, 2021



* fix #12970

* Update tests/test_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove unnecessary issue link

* fix test formatting
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b7439675

Fix template for inputs docstrings (#12976) · 790f1c95
Sylvain Gugger authored Aug 03, 2021

790f1c95

02 Aug, 2021 3 commits
- fix typo in example/text-classification README (#12974) · 75b8990d
  Chungman Lee authored Aug 02, 2021
```
* fix typo in example/text-classification README

* add space to align the table
```
  75b8990d
- Place BigBirdTokenizer in sentencepiece-only objects (#12975) · c1a65385
  Sylvain Gugger authored Aug 02, 2021
  
  c1a65385
- Fix typo in example of DPRReader (#12954) · b5995bad
  Tadej Svetina authored Aug 02, 2021
  
  b5995bad
01 Aug, 2021 1 commit
- Set tb_writer to None in TensorBoardCallback.on_train_end() (#12963) · a4340d3b
  Alex Hedges authored Aug 01, 2021
  
  a4340d3b
30 Jul, 2021 6 commits
- examples: use correct way to get vocab size in flax lm readme (#12947) · 3d4b3bc3
  Stefan Schweter authored Jul 30, 2021
  
  3d4b3bc3
- Fix division by zero in NotebookProgressPar (#12953) · 23d6761f
  Sylvain Gugger authored Jul 30, 2021
  
  23d6761f
- Add multilingual documentation support (#12952) · 8ff619d9
  Kevin Canwen Xu authored Jul 30, 2021
```
* Add multilingual documentation support

* Add multilingual documentation support

* make style

* make style

* revert
```
  8ff619d9
- Add substep callbacks (#12951) · fe6ff4a9
  wulu473 authored Jul 30, 2021
```
Co-authored-by: Lukas Wutschitz <lukas.wutschitz@microsoft.com>
```
  fe6ff4a9
- Log Azure ML metrics only for rank 0 (#12766) · f84226b7
  harshithapv authored Jul 30, 2021
```
* minor change to log azureml only for rank 0

* fix typo
```
  f84226b7
- fix typo in gradient_checkpointing arg (#12855) · 5c673efa
  21jun authored Jul 30, 2021
```
help for `ModelArguments.gradient_checkpointing` should be
"If True, use gradient checkpointing to save memory
at the expense of slower backward pass."
not "Whether to freeze the feature extractor layers of the model."
(which is duplicated from `freeze_feature_extractor` arg)
```
  5c673efa
29 Jul, 2021 1 commit
- Add CpmTokenizerFast (#12938) · fd0255b4
  Kevin Canwen Xu authored Jul 30, 2021
```
* Add CpmTokenizerFast

* Fix isort

* Overwrite _batch_encode_plus
```
  fd0255b4