Commits · a1bbcf3f6c20e15fe799a8659d6b7bd36fdf11ed · chenpangpang / transformers

03 Nov, 2020 1 commit

Refactoring the generate() function (#6949) · a1bbcf3f

Patrick von Platen authored Nov 03, 2020

* first draft

* show design proposition for new generate method

* up

* make better readable

* make first version

* gpt2 tests pass

* make beam search for gpt2 work

* add first encoder-decoder code

* delete typo

* make t5 work

* save indermediate

* make bart work with beam search

* finish beam search bart / t5

* add default kwargs

* make more tests pass

* fix no bad words sampler

* some fixes and tests for all distribution processors

* fix test

* fix rag slow tests

* merge to master

* add nograd to generate

* make all slow tests pass

* speed up generate

* fix edge case bug

* small fix

* correct typo

* add type hints and docstrings

* fix typos in tests

* add beam search tests

* add tests for beam scorer

* fix test rag

* finish beam search tests

* move generation tests in seperate file

* fix generation tests

* more tests

* add aggressive generation tests

* fix tests

* add gpt2 sample test

* add more docstring

* add more docs

* finish doc strings

* apply some more of sylvains and sams comments

* fix some typos

* make fix copies

* apply lysandres and sylvains comments

* final corrections on examples

* small fix for reformer

a1bbcf3f

02 Nov, 2020 7 commits
- 2 SinusoidalPositionalEmbedding fixes (#8226) · 504ff7bb
  Stas Bekman authored Nov 02, 2020
  
  504ff7bb
- fix encoder decoder bug (#8243) · dc26726d
  Patrick von Platen authored Nov 02, 2020
  
  dc26726d
- Add XLMProphetNetTokenizer to tokenization auto (#8245) · 9a23af4a
  Lysandre Debut authored Nov 02, 2020
  
  9a23af4a
- Fix TensorBoardCallback for older versions of PyTorch (#8239) · 5406f31a
  Sylvain Gugger authored Nov 02, 2020
  
  5406f31a
- Fix bad import with PyTorch <= 1.4.1 (#8237) · d1ad4bff
  Sylvain Gugger authored Nov 02, 2020
  
  d1ad4bff
- Fix ignore list behavior in doctests (#8213) · 0c92e7d9
  Santiago Castro authored Nov 02, 2020
  
  0c92e7d9
- Fix the behaviour of DefaultArgumentHandler (removing it). (#8180) · 84caa233
  Nicolas Patry authored Nov 02, 2020
```
* Some work to fix the behaviour of DefaultArgumentHandler by removing it.

* Fixing specific pipelines argument checking.
```
  84caa233
01 Nov, 2020 1 commit
- [Bug fix] Fixed value for BlenderBot pad token (#8205) · 1f12934d
  guillaume-be authored Nov 01, 2020
  
  1f12934d
30 Oct, 2020 8 commits

Fix two bugs with --logging_first_step (#8193) · 8f1c960e

Abi See authored Oct 30, 2020

* make sure that logging_first_step evaluates

* fix bug with incorrect loss on logging_first_step

* fix style

* logging_first_step only logs, not evals

8f1c960e

Minor style improvements for the Flax BERT and RoBERTa examples (#8178) · 689ff74f

Avital Oliver authored Oct 30, 2020

* Minor style improvements:

1. Use `@nn.compact` rather than `@compact` (as to not make it seem
   like compact is a standard Python decorator.
2. Move attribute docstrings from two `__call__` methods to comments
   on the attributes themselves. (This was probably a remnant from
   the pre-Linen version where the attributes were arguments to
   `call`.)

* Use black on the Flax modeling code

689ff74f

Replace swish with silu (#8166) · 00112c35

TFUsers authored Oct 30, 2020



* Replace swish with silu

* revert nn.silu to nn.swish due to older version

* simplify optimized silu conditional and fix format

* Update activations.py

* Update activations_tf.py

* Update modeling_flax_utils.py

* Update modeling_openai.py

* add swish testcase

* add pytorch swish testcase

* Add more robust python version check

* more formatting fixes
Co-authored-by: TFUsers <TFUsers@gmail.com>

00112c35

Doc fixes and filter warning in wandb (#8189) · 089cc101
Sylvain Gugger authored Oct 30, 2020

089cc101

TFMarian, TFMbart, TFPegasus, TFBlenderbot (#7987) · 566b083e

Sam Shleifer authored Oct 30, 2020



* Start plumbing

* Marian close

* Small stubs for all children

* Fixed bart

* marian working

* pegasus test is good, but failing

* Checkin tests

* More model files

* Subtle marian, pegasus integration test failures

* Works well

* rm print

* boom boom

* Still failing model2doc

* merge master

* Equivalence test failing, all others fixed

* cleanup

* Fix embed_scale

* Cleanup marian pipeline test

* Undo extra changes

* Smaller delta

* Cleanup model testers

* undo delta

* fix tests import structure

* cross test decorator

* Cleaner set_weights

* Respect authorized_unexpected_keys

* No warnings

* No warnings

* style

* Nest tf import

* black

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* functional dropout

* fixup

* Fixup

* style_doc

* embs

* shape list

* delete slow force_token_id_to_be_generated func

* fixup
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

566b083e

Fix typo: s/languaged/language/ (#8165) · 6279072f
Santiago Castro authored Oct 30, 2020

6279072f

Ci test tf super slow (#8007) · 10f8c636

Lysandre Debut authored Oct 30, 2020

* Test TF GPU CI

* Change cache

* Fix missing torch requirement

* Fix some model tests


Style

* LXMERT

* MobileBERT

* Longformer skip test

* XLNet

* The rest of the tests

* RAG goes OOM in multi gpu setup

* YAML test files

* Last fixes

* Skip doctests

* Fill mask tests

* Yaml files

* Last test fix

* Style

* Update cache

* Change ONNX tests to slow + use tiny model

10f8c636

Fixing some warnings in DeBerta (#8176) · 7e36deec
Nicolas Patry authored Oct 30, 2020
```
* Fixing some warnings in DeBerta

* Fixing docs with their rewritten version.
```
7e36deec

29 Oct, 2020 8 commits

[CI] Better reports #2 (#8163) · 05388207
Stas Bekman authored Oct 29, 2020

05388207

Fix eval ref miss in Chinese WWM. (#8115) · 9a21b506

wlhgtc authored Oct 30, 2020



* ADD: add whole word mask proxy for both eng and chinese

* MOD: adjust format

* MOD: reformat code

* MOD: update import

* MOD: fix bug

* MOD: add import

* MOD: fix bug

* MOD: decouple code and update readme

* MOD: reformat code

* Update examples/language-modeling/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* change wwm to whole_word_mask

* reformat code

* reformat

* format

* Code quality

* ADD: update chinese ref readme

* MOD: small changes

* MOD: small changes2

* update readme

* fix eval ref file miss bug

* format file

* MOD: move ref code to contrib

* MOD: add delimeter check

* reformat code

* refomat code

* Update examples/language-modeling/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

9a21b506

Fix typo: indinces -> indices (#8159) · fdf893c4

Santiago Castro authored Oct 29, 2020

* Fix typo: indinces -> indices

* Fix some more

* Fix some more

* Fix some more

* Fix CI

fdf893c4

improve error checking (#8157) · c83cec44
Stas Bekman authored Oct 29, 2020

c83cec44

Add a template for examples and apply it for mlm and plm examples (#8153) · 69117628

Sylvain Gugger authored Oct 29, 2020

* Add a template for example scripts and apply it to mlm

* Formatting

* Fix test

* Add plm script

* Add a template for example scripts and apply it to mlm

* Formatting

* Fix test

* Add plm script

* Add a template for example scripts and apply it to mlm

* Formatting

* Fix test

* Add plm script

* Styling

69117628

Smarter prediction loop and no- -> no_ in console args (#8151) · acf56408
Sylvain Gugger authored Oct 29, 2020
```
* Smarter prediction loop and no- -> no_ in console args

* Fix test
```
acf56408
Document tokenizer_class in configurations (#8152) · b0f1c0ee
Sylvain Gugger authored Oct 29, 2020

b0f1c0ee

Fix doc errors and typos across the board (#8139) · 969859d5

Santiago Castro authored Oct 29, 2020

* Fix doc errors and typos across the board

* Fix a typo

* Fix the CI

* Fix more typos

* Fix CI

* More fixes

* Fix CI

* More fixes

* More fixes

969859d5

28 Oct, 2020 5 commits
- Fix typo in `AutoModelForMaskedLM` docs (#8129) · e477eb91
  Santiago Castro authored Oct 28, 2020
  
  e477eb91
- Rename add_start_docstrings_to_callable (#8120) · 378142af
  Sylvain Gugger authored Oct 28, 2020
  
  378142af
- [DOC] Improve pipeline() docstrings for config and tokenizer (#8123) · 5193172f
  Bram Vanroy authored Oct 28, 2020
```
* Improve pipeline() docstrings

* make style

* Update wording for config
```
  5193172f
- fix(trainer_callback]: typo (#8121) · b4cacb7a
  Boris Dayma authored Oct 28, 2020
  
  b4cacb7a
- [testing] port test_trainer_distributed to distributed pytest + TestCasePlus enhancements (#8107) · 5423f2a9
  Stas Bekman authored Oct 28, 2020
```
* move the helper code into testing_utils

* port test_trainer_distributed to work with pytest

* improve docs

* simplify notes

* doc

* doc

* style

* doc

* further improvements

* torch might not be available

* real fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  5423f2a9
27 Oct, 2020 10 commits

Add AzureML in integrations via dedicated callback (#8062) · 995006ea

Davide Fiocco authored Oct 27, 2020

* first attempt to add AzureML callbacks

* func arg fix

* var name fix, but still won't fix error...

* fixing as in https://discuss.huggingface.co/t/how-to-integrate-an-azuremlcallback-for-logging-in-azure/1713/2



* Avoid lint check of azureml import

* black compliance

* Make isort happy

* Fix point typo in docs

* Add AzureML to Callbacks docs

* Attempt to make sphinx happy

* Format callback docs

* Make documentation style happy

* Make docs compliant to style
Co-authored-by: Davide Fiocco <davide.fiocco@frontiersin.net>

995006ea

infer entailment label id on zero shot pipeline (#8059) · 3e58b6b7

Joe Davison authored Oct 27, 2020

* add entailment dim argument

* rename dim -> id

* fix last name change, style

* rm arg, auto-infer only

* typo

* rm superfluous import

3e58b6b7

Fix a bug for `CallbackHandler.callback_list` (#8052) · 7bff0af0

Harutaka Kawamura authored Oct 27, 2020



* Fix callback_list

* Add test
Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

* Fix test
Signed-off-by: harupy <17039389+harupy@users.noreply.github.com>

7bff0af0

Fix assertion error message for MLflowCallback (#8091) · 8e28c327
Harutaka Kawamura authored Oct 27, 2020

8e28c327
Styling fix · 3220f21f
Sylvain Gugger authored Oct 27, 2020

3220f21f
Fix IterableDataset with __len__ in Trainer (#8095) · 286dc19a
Jonathan Chang authored Oct 27, 2020

286dc19a

[CI] generate separate report files as artifacts (#7995) · bfd5e370

Stas Bekman authored Oct 27, 2020

* better reports

* a whole bunch of reports in their own files

* clean up

* improvements

* github artifacts experiment

* style

* complete the report generator with multiple improvements/fixes

* fix

* save all reports under one dir to easy upload

* can remove temp failing tests

* doc fix

* some cleanup

bfd5e370

Fix DeBERTa docs (#8092) · 33f6ef73
Lysandre Debut authored Oct 27, 2020
```
* Fix DeBERTa docs

* Tokenizer and config
```
33f6ef73
Doc styling fixes (#8074) · c42596bc
Sylvain Gugger authored Oct 27, 2020
```
* Fix a few docstrings

* More fixes

* Styling
```
c42596bc

Fix comet_ml import and add ensure availability (#7933) · 1496931b

Doug Blank authored Oct 27, 2020

* Fix comet_ml import and add ensure availability

* Make isort happy

* Make flake8 happy

* Don't show comet_ml warn if COMET_MODE=DISABLED

* Make isort happy

1496931b