Commits · 42f63e3871ea10bb9dfc9345642bceb36458ac62 · chenpangpang / transformers

13 Nov, 2020 4 commits
- Merge remote-tracking branch 'origin/master' · 42f63e38
  Sylvain Gugger authored Nov 13, 2020
  
  42f63e38
- Update doc for v3.5.1 · bb03a14e
  Sylvain Gugger authored Nov 13, 2020
  
  bb03a14e
- Update deepset/roberta-base-squad2 model card (#8522) · 4df6b593
  Branden Chan authored Nov 13, 2020
```
* Update README.md

* Update README.md
```
  4df6b593
- Remove typo · 0c9bae09
  Sylvain Gugger authored Nov 12, 2020
  
  0c9bae09
12 Nov, 2020 9 commits
- Add pretraining loss computation for TF Bert pretraining (#8470) · 5d805394
  Julien Plu authored Nov 12, 2020
```
* Add pretraining loss computation for TF Bert pretraining

* Fix labels creation

* Fix T5 model

* restore T5 kwargs

* try a generic fix for pretraining models

* Apply style

* Overide the prepare method for the BERT tests
```
  5d805394
- Use LF instead of os.linesep (#8491) · 91a67b75
  Julien Plu authored Nov 12, 2020
  
  91a67b75
- Try to understand and apply Sylvain's comments (#8458) · 27b3ff31
  Julien Plu authored Nov 12, 2020
  
  27b3ff31
- fix SqueezeBertForMaskedLM (#8479) · 0fa03498
  Forrest Iandola authored Nov 12, 2020
  
  0fa03498
- Model sharing doc (#8498) · 79330546
  Sylvain Gugger authored Nov 12, 2020
```
* Model sharing doc

* Style
```
  79330546
- Fix doc bug (#8500) · d65e0bfe
  Chengxi Guo authored Nov 13, 2020
```
* fix doc bug
Signed-off-by: mymusise <mymusise1@gmail.com>

* fix example bug
Signed-off-by: mymusise <mymusise1@gmail.com>
```
  d65e0bfe
- quick fix on concatenating text to support more datasets (#8474) · 924c624a
  zeyuyun1 authored Nov 12, 2020
  
  924c624a
- Fix typo in roberta-base-squad2-v2 model card (#8489) · 17b1fd80
  Antonio Lanza authored Nov 12, 2020
  
  17b1fd80
- [model_cards] other chars than [\w\-_] not allowed anymore in model names · c6c08ebf
  Julien Chaumond authored Nov 12, 2020
```
cc @Pierrci
```
  c6c08ebf
11 Nov, 2020 11 commits

Update deploy-docs dependencies on CI to enable Flax (#8475) · 121c24ef

Funtowicz Morgan authored Nov 12, 2020



* Update deploy-docs dependencies on CI to enable Flax
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Added pair of ""
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

121c24ef

[s2s] distill t5-large -> t5-small (#8376) · 81ebd706
Sumithra Bhakthavatsalam authored Nov 11, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
81ebd706

Flax/Jax documentation (#8331) · a5b68232

Funtowicz Morgan authored Nov 11, 2020



* First addition of Flax/Jax documentation
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* make style

* Ensure input order match between Bert & Roberta
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Install dependencies "all" when building doc
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* wraps build_doc deps with ""
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Addressing @sgugger comments.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use list to highlight JAX features.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Make style.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Let's not look to much into the future for now.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

a5b68232

Skip test until investigation · c7b6bbec
Lysandre authored Nov 11, 2020

c7b6bbec
Replaced some iadd operations on lists with proper list methods. (#8433) · aa2a2c65
Beomsoo Kim authored Nov 12, 2020

aa2a2c65

Add TFDPR (#8203) · 026a2ff2

Ratthachat (Jung) authored Nov 12, 2020

* Create modeling_tf_dpr.py

* Add TFDPR

* Add back TFPegasus, TFMarian, TFMBart, TFBlenderBot

last commit accidentally deleted these 4 lines, so I recover them back

* Add TFDPR

* Add TFDPR

* clean up some comments, add TF input-style doc string

* Add TFDPR

* Make return_dict=False as default

* Fix return_dict bug (in .from_pretrained)

* Add get_input_embeddings()

* Create test_modeling_tf_dpr.py

The current version is already passed all 27 tests!
Please see the test run at : 
https://colab.research.google.com/drive/1czS_m9zy5k-iSJbzA_DP1k1xAAC_sdkf?usp=sharing



* fix quality

* delete init weights

* run fix copies

* fix repo consis

* del config_class, load_tf_weights

They shoud be 'pytorch only'

* add config_class back

after removing it, test failed ... so totally only removing "use_tf_weights = None" on Lysandre suggestion

* newline after .. note::

* import tf, np (Necessary for ModelIntegrationTest)

* slow_test from_pretrained with from_pt=True

At the moment we don't have TF weights (since we don't have official official TF model)
Previously, I did not run slow test, so I missed this bug

* Add simple TFDPRModelIntegrationTest

Note that this is just a test that TF and Pytorch gives approx. the same output.
However, I could not test with the official DPR repo's output yet

* upload correct tf model

* remove position_ids as missing keys
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: patrickvonplaten <patrick@huggingface.co>

026a2ff2

Example NER script predicts on tokenized dataset (#8468) · a38d1c7c

sarnoult authored Nov 11, 2020

The new run_ner.py script tries to run prediction on the input
test set `datasets["test"]`, but it should be the tokenized set
`tokenized_datasets["test"]`

a38d1c7c

Fix next sentence output (#8466) · 069b6384
Julien Plu authored Nov 11, 2020

069b6384

Add next sentence prediction loss computation (#8462) · da842e4e

Julien Plu authored Nov 11, 2020

* Add next sentence prediction loss computation

* Apply style

* Fix tests

* Add forgotten import

* Add forgotten import

* Use a new parameter

* Remove kwargs and use positional arguments

da842e4e

Fix TF Longformer (#8460) · 23290836
Julien Plu authored Nov 11, 2020

23290836
[model_cards] harmonization · 8dda9167
Julien Chaumond authored Nov 11, 2020

8dda9167

10 Nov, 2020 16 commits

Bug fix for modeling utilities function: apply_chunking_to_forward, chunking... · eb3bd73c

Pedro authored Nov 10, 2020


Bug fix for modeling utilities function: apply_chunking_to_forward, chunking should be in the chunking dimension, an exception was raised if the complete shape of the inputs was not the same rather than only the chunking dimension (#8391)
Co-authored-by: pedro <pe25171@mit.edu>

eb3bd73c

fix t5 token type ids (#8437) · 70708cca
Patrick von Platen authored Nov 10, 2020

70708cca

[No merge] TF integration testing (#7621) · 9fd1f562

Lysandre Debut authored Nov 10, 2020

* stash

* TF Integration testing for ELECTRA, BERT, Longformer

* Trigger slow tests

* Apply suggestions from code review

9fd1f562

Add missing tasks to `pipeline` docstring (#8428) · 8fe6629b
Santiago Castro authored Nov 10, 2020

8fe6629b
using multi_gpu consistently (#8446) · 02bdfc02
Stas Bekman authored Nov 10, 2020
```
* s|multiple_gpu|multi_gpu|g; s|multigpu|multi_gpu|g'

* doc
```
02bdfc02
fix t5 special tokens (#8435) · b9356945
Patrick von Platen authored Nov 10, 2020

b9356945
Add missing import (#8444) · cace39af
Julien Plu authored Nov 10, 2020
```
* Add missing import

* Fix dummy objects
```
cace39af

[testing utils] get_auto_remove_tmp_dir more intuitive behavior (#8401) · e21340da

Stas Bekman authored Nov 10, 2020



* [testing utils] get_auto_remove_tmp_dir default change

Now that I have been using `get_auto_remove_tmp_dir default change` for a while, I realized that the defaults aren't most optimal.

99% of the time we want the tmp dir to be empty at the beginning of the test - so changing the default to `before=True` - this shouldn't impact any tests since this feature is used only during debug.

* simplify things

* update docs

* fix doc layout

* style

* Update src/transformers/testing_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* better 3-state doc

* style

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* s/tmp/temporary/ + style

* correct the statement
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e21340da

Windows dev section in the contributing file (#8436) · e7e15498

Julien Plu authored Nov 10, 2020

* Add a Windows dev section in the contributing file.

* Forgotten link

* Trigger CI

* Rework description

* Trigger CI

e7e15498

Add auto next sentence prediction (#8432) · 8551a992

Julien Plu authored Nov 10, 2020

* Add auto next sentence prediction

* Fix style

* Add mobilebert next sentence prediction

8551a992

[docs] improve bart/marian/mBART/pegasus docs (#8421) · c314b1fd
Sam Shleifer authored Nov 10, 2020

c314b1fd
Question template (#8440) · 3213d3bf
Sylvain Gugger authored Nov 10, 2020
```
* Remove SO from question template

* Styling
```
3213d3bf
[examples] better PL version check (#8429) · 5d4972e6
Stas Bekman authored Nov 10, 2020

5d4972e6
[s2s/distill] hparams.tokenizer_name = hparams.teacher (#8382) · ae1cb4ec
Shichao Sun authored Nov 10, 2020

ae1cb4ec
v3.5.0 documentation · aec51e56
Lysandre authored Nov 10, 2020

aec51e56
Release: v3.5.0 · 818878dc
Lysandre authored Nov 10, 2020

818878dc