Commits · 9f8619c6aa1f991dacbd8aba4f82b5f52f690f1f · chenpangpang / transformers

16 Mar, 2021 1 commit

Flax testing should not run the full torch test suite (#10725) · 9f8619c6

Patrick von Platen authored Mar 16, 2021

* make flax tests pytorch independent

* fix typo

* finish

* improve circle ci

* fix return tensors

* correct flax test

* re-add sentencepiece

* last tokenizer fixes

* finish maybe now

9f8619c6

15 Mar, 2021 16 commits

independent training / eval with local files (#10710) · 87d685b8
Russell Klopfer authored Mar 15, 2021
```
* independent training / eval with local files

* remove redundant assert
```
87d685b8

Add minimum version check in examples (#10724) · 4c379daf

Sylvain Gugger authored Mar 15, 2021

* Add minimum version check in examples

* Style

* No need for new line maybe?

* Add helpful comment

4c379daf

zero-shot pipeline multi_class -> multi_label (#10727) · 966ba081
Joe Davison authored Mar 15, 2021

966ba081

Tests run on Docker (#10681) · 58f672e6

Lysandre Debut authored Mar 15, 2021



* Tests run on Docker
Co-authored-by: Morgan <funtowiczmo@gmail.com>

* Comments from code review

* Reply to itself

* Dependencies
Co-authored-by: Morgan <funtowiczmo@gmail.com>

58f672e6

[Wav2Vec2] Fix documentation inaccuracy (#10694) · d41dd535

MikeG112 authored Mar 15, 2021



* Update super class reference

* Update default value reference

* Update src/transformers/models/wav2vec2/feature_extraction_wav2vec2.py

* Fix format style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d41dd535

Fix backward compatibility with EvaluationStrategy (#10718) · f5c097fc
Sylvain Gugger authored Mar 15, 2021

f5c097fc
make wav2vec2 test deterministic (#10714) · d9e693e1
Patrick von Platen authored Mar 15, 2021

d9e693e1

Multiple fixes in SageMakerTrainer (#10687) · 6bef7645

Sylvain Gugger authored Mar 15, 2021

* Handle save differently

* Missing imports

* Fix typo

* Adapt to recent changes in save_pretrained

* Forgotten brackets

* Optimizer load

* Fix world size

* Deal wth None

* Remove needless self

6bef7645

Adding required flags to non-default arguments in hf_argparser (#10688) · 3f1714f8

Adam Pocock authored Mar 15, 2021



* Adding required flags to non-default arguments.
Signed-off-by: Adam Pocock <adam.pocock@oracle.com>

* make style fix.

* Update src/transformers/hf_argparser.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3f1714f8

split seq2seq script into summarization & translation (#10611) · 6f840990

Théo Matussière authored Mar 15, 2021



* split seq2seq script, update docs

* needless diff

* fix readme

* remove test diff

* s/summarization/translation
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* cr

* fix arguments & better mbart/t5 refs

* copyright
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* reword readme
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* s/summarization/translation

* short script names

* fix tests

* fix isort, include mbart doc

* delete old script, update tests

* automate source prefix

* automate source prefix for translation

* s/translation/trans
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* fix script name (short version)

* typos
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* exact parameter
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* remove superfluous source_prefix calls in docs

* rename scripts & warn for source prefix

* black

* flake8
Co-authored-by: theo <theo@matussie.re>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

6f840990

GPT2DoubleHeadsModel made parallelizable (#10658) · 505494a8

Igor Shalyminov authored Mar 15, 2021

* GPT2DoubleHeadsModel made parallelizeable

* GPT2DoubleHeadsModel added as parallelizeable onto the GPT2 test suite

505494a8

Distributed barrier before loading model (#10685) · e12d6f51
Sylvain Gugger authored Mar 15, 2021

e12d6f51
fix styling · 339fc51a
Sylvain Gugger authored Mar 15, 2021

339fc51a

Wrong link to super class (#10709) · 4c41c662

cronoik authored Mar 15, 2021

Documentation was referring to slow tokenizer class while it should be the fast tokenizer.

4c41c662

enable loading Mbart50Tokenizer with AutoTokenizer (#10690) · fcf10214
Suraj Patil authored Mar 15, 2021
```
* enable auto tokenizer for mbart50 tokenizers

* fix imports
```
fcf10214
make rag tests smaller (#10679) · bd8f6caf
Patrick von Platen authored Mar 15, 2021

bd8f6caf

12 Mar, 2021 7 commits

AdamW is now supported by default (#9624) · 4c32f9f2
Stas Bekman authored Mar 12, 2021

4c32f9f2

Pass encoder outputs into GenerationMixin (#10599) · fa35cda9

ymfa authored Mar 12, 2021

* Pass encoder_outputs into generate()

* Remove an if-statement

* Reformat

* Minimize changes to generate()

* Comment on input_ids

fa35cda9

fix: #10628 expanduser path in TrainingArguments (#10660) · 00cad2e5

PaulLerner authored Mar 12, 2021



* fix: #10628 expanduser path in TrainingArguments

* docs: explain why we expand paths in TrainingArguments

* Style
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

00cad2e5

Add auto_wrap option in fairscale integration (#10673) · e8246f78
Sylvain Gugger authored Mar 12, 2021
```
* Add auto_wrap option in fairscale integration

* Style
```
e8246f78
TensorFlow tests: having from_pt set to True requires torch to be installed. (#10664) · 184ef8ec
Lysandre Debut authored Mar 12, 2021
```
* TF model exists for Blenderbot 400M

* Marian

* RAG
```
184ef8ec

Adding new parameter to `generate`: `max_time`. (#9846) · 543d0549

Nicolas Patry authored Mar 12, 2021

* [WIP] Adding new parameter to `generate`:  `max_time`.

Generation by tokens number is sometimes a bit clunky because we don't
know how many tokens are good enough or even how many tokens are in
the payload (for pipelines users for instance). This leads to hard
to understand behavior.

This PR proposes a new argument `max_time` which is a float of seconds
for the allowed time for `generate` to run on.
Ideally combinations of `max_tokens=None`, `max_time=2` could be used to
generate as many tokens as possible within time budget.

NB: Another possible approach consists of passing a callback to `generate`
  putting the caller in charge of the actual decision of when to stop
  generating tokens. It opens the door to 'which args should we pass'
  to this callback. It's hard to imagine other use-cases for this
  early stopping behavior than time (that are not already covered by
  parameters of generate)

* Revamp with StoppingCriteria...

543d0549

Adjust loss difference (#10669) · ea46e3fa
Lysandre Debut authored Mar 12, 2021

ea46e3fa

11 Mar, 2021 16 commits
- fix typing error for HfArgumentParser for Optional[bool] (#10672) · c526bde3
  Benjamin Fineran authored Mar 11, 2021
```
* fix typing error for TrainingArguments Optional[bool]

* updating equality check for Optional[bool]
```
  c526bde3
- Tentative fix for HFArgumentParser in Python 3.8 · fa1a8d10
  Sylvain Gugger authored Mar 11, 2021
  
  fa1a8d10
- Fix broken link (#10656) · 2f848519
  WybeKoper authored Mar 11, 2021
```
* Fixed broken link

* fixed max length violation
Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>
```
  2f848519
- Add DeBERTa to MODEL_FOR_PRETRAINING_MAPPING (#10668) · a01ea31b
  jeswan authored Mar 11, 2021
```
* add deberta to pretraining mapping

* add deberta_v2 to PRETRAINING_MAPPING
```
  a01ea31b
- Specify minimum version for sacrebleu (#10662) · 9fbb4cdc
  Lysandre Debut authored Mar 11, 2021
  
  9fbb4cdc
- Fix integration slow tests (#10670) · fda703a5
  Sylvain Gugger authored Mar 11, 2021
```
* PoC

* Fix slow tests for the PT1.8 Embedding problem
```
  fda703a5
- Onnx fix test (#10663) · 3ab68203
  Funtowicz Morgan authored Mar 11, 2021
```
* Allow to pass kwargs to model's from_pretrained when using pipeline.

* Disable the use of past_keys_values for GPT2 when exporting to ONNX.

* style

* Remove comment.

* Appease the documentation gods

* Fix style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  3ab68203
- Fixes Pegasus tokenization tests (#10671) · a637ae00
  Lysandre Debut authored Mar 11, 2021
  
  a637ae00
- Conversion to tensors requires padding (#10661) · 7e442874
  Lysandre Debut authored Mar 11, 2021
  
  7e442874
- W2v2 test require torch (#10665) · 2adc8c92
  Lysandre Debut authored Mar 11, 2021
```
* Adds a @require_torch to a test that requires it

* Tokenizer too

* Style
```
  2adc8c92
- [S2T] fix example in docs (#10667) · 055ed78f
  Suraj Patil authored Mar 11, 2021
  
  055ed78f
- Remove special treatment for custom vocab files (#10637) · 89693e17
  Sylvain Gugger authored Mar 11, 2021
```
* Remove special path for custom vocab files

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Expand error message
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  89693e17
- S2S + M2M100 should be available in tokenization_auto (#10657) · 6d9e11a1
  Lysandre Debut authored Mar 11, 2021
```
* S2S + M2M100 should be available in tokenization_auto

* Requires sentencepiece

* SentencePiece for S2T as well :)
```
  6d9e11a1
- [XLSR-Wav2Vec2] Add multi-lingual Wav2Vec2 models (#10648) · 602d63f0
  Patrick von Platen authored Mar 11, 2021
```
* add conversion script

* add wav2vec2 xslr models

* finish

* Update docs/source/model_doc/xlsr_wav2vec2.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  602d63f0
- Ensure metric results are JSON-serializable (#10632) · 63c295ac
  Sylvain Gugger authored Mar 11, 2021
  
  63c295ac
- Update README.md (#10647) · 27d9e05c
  ArvidYin authored Mar 11, 2021
```
correct spell error: 'nether'
```
  27d9e05c