Commits · 2097aa1826fabcd1dd2e4f63a9bb4f4f5a1befe6 · chenpangpang / transformers

16 Mar, 2021 11 commits
- Patches the full import failure and adds a test (#10750) · 2097aa18
  Lysandre Debut authored Mar 16, 2021
```
* Patches the full import failure and adds a test

* Add comment
```
  2097aa18
- Development on v4.5.0dev0 · 1b5ce1e6
  Lysandre authored Mar 16, 2021
  
  1b5ce1e6
- Release v4.4.0 · c988db5a
  Lysandre authored Mar 16, 2021
  
  c988db5a
- Fix URLs from #10744 (#10748) · 5c02b97c
  Sylvain Gugger authored Mar 16, 2021
  
  5c02b97c
- Add DistributedSamplerWithLoop (#10746) · a0a027c2
  Sylvain Gugger authored Mar 16, 2021
```
* Add DistributedSamplerWithLoop

* Fix typo

* Test and small fix
```
  a0a027c2
- Fix DeBERTa + Conversational pipeline slow tests (#10743) · 14492222
  Lysandre Debut authored Mar 16, 2021
```
* Fix DeBERTa-v2 variable assignment

* Fix conversational pipeline test
```
  14492222
- fix M2M100 example (#10745) · d3d388b9
  Suraj Patil authored Mar 16, 2021
  
  d3d388b9
- Remove old links to CDN (#10744) · b5492582
  Sylvain Gugger authored Mar 16, 2021
  
  b5492582
- Fix S2T example (#10741) · 5dcc08f1
  Lysandre Debut authored Mar 16, 2021
  
  5dcc08f1
- Release utils (#10735) · 813d730c
  Sylvain Gugger authored Mar 16, 2021
```
* Examples version update

* Refactor a bit

* All version updates

* Fixes

* README cleanup

* Post-release/patch

* Fixes

* More fixes

* Tests

* More fixes

* Moar fixes

* Make commands and update setup

* Replace spaces with weird tabs

* Fix test

* Style
```
  813d730c
- Flax testing should not run the full torch test suite (#10725) · 9f8619c6
  Patrick von Platen authored Mar 16, 2021
```
* make flax tests pytorch independent

* fix typo

* finish

* improve circle ci

* fix return tensors

* correct flax test

* re-add sentencepiece

* last tokenizer fixes

* finish maybe now
```
  9f8619c6
15 Mar, 2021 16 commits

independent training / eval with local files (#10710) · 87d685b8
Russell Klopfer authored Mar 15, 2021
```
* independent training / eval with local files

* remove redundant assert
```
87d685b8

Add minimum version check in examples (#10724) · 4c379daf

Sylvain Gugger authored Mar 15, 2021

* Add minimum version check in examples

* Style

* No need for new line maybe?

* Add helpful comment

4c379daf

zero-shot pipeline multi_class -> multi_label (#10727) · 966ba081
Joe Davison authored Mar 15, 2021

966ba081

Tests run on Docker (#10681) · 58f672e6

Lysandre Debut authored Mar 15, 2021



* Tests run on Docker
Co-authored-by: Morgan <funtowiczmo@gmail.com>

* Comments from code review

* Reply to itself

* Dependencies
Co-authored-by: Morgan <funtowiczmo@gmail.com>

58f672e6

[Wav2Vec2] Fix documentation inaccuracy (#10694) · d41dd535

MikeG112 authored Mar 15, 2021



* Update super class reference

* Update default value reference

* Update src/transformers/models/wav2vec2/feature_extraction_wav2vec2.py

* Fix format style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d41dd535

Fix backward compatibility with EvaluationStrategy (#10718) · f5c097fc
Sylvain Gugger authored Mar 15, 2021

f5c097fc
make wav2vec2 test deterministic (#10714) · d9e693e1
Patrick von Platen authored Mar 15, 2021

d9e693e1

Multiple fixes in SageMakerTrainer (#10687) · 6bef7645

Sylvain Gugger authored Mar 15, 2021

* Handle save differently

* Missing imports

* Fix typo

* Adapt to recent changes in save_pretrained

* Forgotten brackets

* Optimizer load

* Fix world size

* Deal wth None

* Remove needless self

6bef7645

Adding required flags to non-default arguments in hf_argparser (#10688) · 3f1714f8

Adam Pocock authored Mar 15, 2021



* Adding required flags to non-default arguments.
Signed-off-by: Adam Pocock <adam.pocock@oracle.com>

* make style fix.

* Update src/transformers/hf_argparser.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3f1714f8

split seq2seq script into summarization & translation (#10611) · 6f840990

Théo Matussière authored Mar 15, 2021



* split seq2seq script, update docs

* needless diff

* fix readme

* remove test diff

* s/summarization/translation
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* cr

* fix arguments & better mbart/t5 refs

* copyright
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* reword readme
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* s/summarization/translation

* short script names

* fix tests

* fix isort, include mbart doc

* delete old script, update tests

* automate source prefix

* automate source prefix for translation

* s/translation/trans
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* fix script name (short version)

* typos
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* exact parameter
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* remove superfluous source_prefix calls in docs

* rename scripts & warn for source prefix

* black

* flake8
Co-authored-by: theo <theo@matussie.re>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

6f840990

GPT2DoubleHeadsModel made parallelizable (#10658) · 505494a8

Igor Shalyminov authored Mar 15, 2021

* GPT2DoubleHeadsModel made parallelizeable

* GPT2DoubleHeadsModel added as parallelizeable onto the GPT2 test suite

505494a8

Distributed barrier before loading model (#10685) · e12d6f51
Sylvain Gugger authored Mar 15, 2021

e12d6f51
fix styling · 339fc51a
Sylvain Gugger authored Mar 15, 2021

339fc51a

Wrong link to super class (#10709) · 4c41c662

cronoik authored Mar 15, 2021

Documentation was referring to slow tokenizer class while it should be the fast tokenizer.

4c41c662

enable loading Mbart50Tokenizer with AutoTokenizer (#10690) · fcf10214
Suraj Patil authored Mar 15, 2021
```
* enable auto tokenizer for mbart50 tokenizers

* fix imports
```
fcf10214
make rag tests smaller (#10679) · bd8f6caf
Patrick von Platen authored Mar 15, 2021

bd8f6caf

12 Mar, 2021 7 commits

AdamW is now supported by default (#9624) · 4c32f9f2
Stas Bekman authored Mar 12, 2021

4c32f9f2

Pass encoder outputs into GenerationMixin (#10599) · fa35cda9

ymfa authored Mar 12, 2021

* Pass encoder_outputs into generate()

* Remove an if-statement

* Reformat

* Minimize changes to generate()

* Comment on input_ids

fa35cda9

fix: #10628 expanduser path in TrainingArguments (#10660) · 00cad2e5

PaulLerner authored Mar 12, 2021



* fix: #10628 expanduser path in TrainingArguments

* docs: explain why we expand paths in TrainingArguments

* Style
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

00cad2e5

Add auto_wrap option in fairscale integration (#10673) · e8246f78
Sylvain Gugger authored Mar 12, 2021
```
* Add auto_wrap option in fairscale integration

* Style
```
e8246f78
TensorFlow tests: having from_pt set to True requires torch to be installed. (#10664) · 184ef8ec
Lysandre Debut authored Mar 12, 2021
```
* TF model exists for Blenderbot 400M

* Marian

* RAG
```
184ef8ec

Adding new parameter to `generate`: `max_time`. (#9846) · 543d0549

Nicolas Patry authored Mar 12, 2021

* [WIP] Adding new parameter to `generate`:  `max_time`.

Generation by tokens number is sometimes a bit clunky because we don't
know how many tokens are good enough or even how many tokens are in
the payload (for pipelines users for instance). This leads to hard
to understand behavior.

This PR proposes a new argument `max_time` which is a float of seconds
for the allowed time for `generate` to run on.
Ideally combinations of `max_tokens=None`, `max_time=2` could be used to
generate as many tokens as possible within time budget.

NB: Another possible approach consists of passing a callback to `generate`
  putting the caller in charge of the actual decision of when to stop
  generating tokens. It opens the door to 'which args should we pass'
  to this callback. It's hard to imagine other use-cases for this
  early stopping behavior than time (that are not already covered by
  parameters of generate)

* Revamp with StoppingCriteria

* Removing deprecated mentions.

* Forgot arguments to stopping criteria.

* Readding max_length it's not just used as a stopping criteria.

* Default value for `stopping_criteria`.

* Address @patrickvonplaten comments.

- More docstrings
- Actual doc
- Include in global namespace
- Remove TF work.

* Put back `max_length` (deprecation different PR).

* Doc quality.

* Fixing old behavior without `stopping_criteria` but with `max_length`.

Making sure we don't break that in the future.

* Adding more tests for possible inconsistencies between

`max_length` and `stopping_criteria`.

* Fixing the torch imports.

543d0549

Adjust loss difference (#10669) · ea46e3fa
Lysandre Debut authored Mar 12, 2021

ea46e3fa

11 Mar, 2021 6 commits
- fix typing error for HfArgumentParser for Optional[bool] (#10672) · c526bde3
  Benjamin Fineran authored Mar 11, 2021
```
* fix typing error for TrainingArguments Optional[bool]

* updating equality check for Optional[bool]
```
  c526bde3
- Tentative fix for HFArgumentParser in Python 3.8 · fa1a8d10
  Sylvain Gugger authored Mar 11, 2021
  
  fa1a8d10
- Fix broken link (#10656) · 2f848519
  WybeKoper authored Mar 11, 2021
```
* Fixed broken link

* fixed max length violation
Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>
```
  2f848519
- Add DeBERTa to MODEL_FOR_PRETRAINING_MAPPING (#10668) · a01ea31b
  jeswan authored Mar 11, 2021
```
* add deberta to pretraining mapping

* add deberta_v2 to PRETRAINING_MAPPING
```
  a01ea31b
- Specify minimum version for sacrebleu (#10662) · 9fbb4cdc
  Lysandre Debut authored Mar 11, 2021
  
  9fbb4cdc
- Fix integration slow tests (#10670) · fda703a5
  Sylvain Gugger authored Mar 11, 2021
```
* PoC

* Fix slow tests for the PT1.8 Embedding problem
```
  fda703a5