Commits · 443b0cad96a7fee62ae7c229536f01ddd7f41241 · chenpangpang / transformers

13 Jul, 2020 1 commit
- rename the function to match the rest of the test convention (#5692) · 443b0cad
  Stas Bekman authored Jul 13, 2020
  
  443b0cad
10 Jul, 2020 1 commit

Change model outputs types to self-document outputs (#5438) · edfd82f5

Sylvain Gugger authored Jul 10, 2020

* [WIP] Proposal for model outputs

* All Bert models

* Make CI green maybe?

* Fix ONNX test

* Isolate ModelOutput from pt and tf

* Formatting

* Add Electra models

* Auto-generate docstrings from outputs

* Add TF outputs

* Add some BERT models

* Revert TF side

* Remove last traces of TF changes

* Fail with a clear error message

* Add Albert and work through Bart

* Add CTRL and DistilBert

* Formatting

* Progress on Bart

* Renames and finish Bart

* Formatting

* Fix last test

* Add DPR

* Finish Electra and add FlauBERT

* Add GPT2

* Add Longformer

* Add MMBT

* Add MobileBert

* Add GPT

* Formatting

* Add Reformer

* Add Roberta

* Add T5

* Add Transformer XL

* Fix test

* Add XLM + fix XLMForTokenClassification

* Style + XLMRoberta

* Add XLNet

* Formatting

* Add doc of return_tuple arg

edfd82f5

01 Jul, 2020 1 commit
- Move tests/utils.py -> transformers/testing_utils.py (#5350) · 13deb95a
  Sam Shleifer authored Jul 01, 2020
  
  13deb95a
16 Jun, 2020 1 commit
- [cleanup] Hoist ModelTester objects to top level (#4939) · c852036b
  Amil Khare authored Jun 16, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
  c852036b
02 Jun, 2020 1 commit

Kill model archive maps (#4636) · d4c2cb40

Julien Chaumond authored Jun 02, 2020

* Kill model archive maps

* Fixup

* Also kill model_archive_map for MaskedBertPreTrainedModel

* Unhook config_archive_map

* Tokenizers: align with model id changes

* make style && make quality

* Fix CI

d4c2cb40

27 May, 2020 1 commit
- [testing] LanguageModelGenerationTests require_tf or require_torch (#4616) · 07797c4d
  Sam Shleifer authored May 27, 2020
  
  07797c4d
19 May, 2020 1 commit
- [Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468) · aa925a52
  Patrick von Platen authored May 19, 2020
```
* fix gpu slow tests in pytorch

* change model to device syntax
```
  aa925a52
01 May, 2020 1 commit

[ci] Load pretrained models into the default (long-lived) cache · f54dc3f4

Julien Chaumond authored Apr 23, 2020

There's an inconsistency right now where:
- we load some models into CACHE_DIR
- and some models in the default cache
- and often, in both for the same models

When running the RUN_SLOW tests, this takes a lot of disk space, time, and bandwidth.

I'd rather always use the default cache

f54dc3f4

26 Mar, 2020 1 commit

Add missing token classification for XLM (#3277) · 1a6c546c

sakares saengkaew authored Mar 26, 2020



* Add the missing token classification for XLM

* fix styling

* Add XLMForTokenClassification to AutoModelForTokenClassification class

* Fix docstring typo for non-existing class

* Add the missing token classification for XLM

* fix styling

* fix styling

* Add XLMForTokenClassification to AutoModelForTokenClassification class

* Fix docstring typo for non-existing class

* Add missing description for AlbertForTokenClassification

* fix styling

* Add missing docstring for AlBert

* Slow tests should be slow
Co-authored-by: Sakares Saengkaew <s.sakares@gmail.com>
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

1a6c546c

08 Mar, 2020 2 commits
- fixed all tests, still need to check ctrl tf and pt and xlm tf · fbd02d46
  patrickvonplaten authored Mar 08, 2020
  
  fbd02d46
- updated all tests · 57597614
  patrickvonplaten authored Mar 08, 2020
  
  57597614
25 Feb, 2020 2 commits
- make style · f5b50c6b
  Patrick von Platen authored Feb 25, 2020
  
  f5b50c6b
- add special tokens to pretrain configs of respective lm head models · e645dcbb
  Patrick von Platen authored Feb 25, 2020
  
  e645dcbb
24 Feb, 2020 1 commit

Add slow generate tests for pretrained lm models (#2909) · 17c45c39

Patrick von Platen authored Feb 24, 2020

* add slow generate lm_model tests

* fix conflicts

* merge conflicts

* fix conflicts

* add slow generate lm_model tests

* make style

* delete unused variable

* fix conflicts

* fix conflicts

* fix conflicts

* delete unused variable

* fix conflicts

* finished hard coded tests

17c45c39

21 Feb, 2020 1 commit

Improve special_token_id logic in run_generation.py and add tests (#2885) · fc38d4c8

Patrick von Platen authored Feb 21, 2020



* improving generation

* finalized special token behaviour for no_beam_search generation

* solved modeling_utils merge conflict

* solve merge conflicts in modeling_utils.py

* add run_generation improvements from PR #2749

* adapted language generation to not use hardcoded -1 if no padding token is available

* remove the -1 removal as hard coded -1`s are not necessary anymore

* add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown

* add slow language generation tests for pretrained models using hardcoded output with pytorch seed

* delete ipdb

* check that all generated tokens are valid

* renaming

* renaming Generation -> Generate

* make style

* updated so that generate_beam_search has same token behavior than generate_no_beam_search

* consistent return format for run_generation.py

* deleted pretrain lm generate tests -> will be added in another PR

* cleaning of unused if statements and renaming

* run_generate will always return an iterable

* make style

* consistent renaming

* improve naming, make sure generate function always returns the same tensor, add docstring

* add slow tests for all lmhead models

* make style and improve example comments modeling_utils

* better naming and refactoring in modeling_utils

* improving generation

* finalized special token behaviour for no_beam_search generation

* solved modeling_utils merge conflict

* solve merge conflicts in modeling_utils.py

* add run_generation improvements from PR #2749

* adapted language generation to not use hardcoded -1 if no padding token is available

* remove the -1 removal as hard coded -1`s are not necessary anymore

* add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown

* add slow language generation tests for pretrained models using hardcoded output with pytorch seed

* delete ipdb

* check that all generated tokens are valid

* renaming

* renaming Generation -> Generate

* make style

* updated so that generate_beam_search has same token behavior than generate_no_beam_search

* consistent return format for run_generation.py

* deleted pretrain lm generate tests -> will be added in another PR

* cleaning of unused if statements and renaming

* run_generate will always return an iterable

* make style

* consistent renaming

* improve naming, make sure generate function always returns the same tensor, add docstring

* add slow tests for all lmhead models

* make style and improve example comments modeling_utils

* better naming and refactoring in modeling_utils

* changed fast random lm generation testing design to more general one

* delete in old testing design in gpt2

* correct old variable name

* temporary fix for encoder_decoder lm generation tests - has to be updated when t5 is fixed

* adapted all fast random generate tests to new design

* better warning description in modeling_utils

* better comment

* better comment and error message
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

fc38d4c8

06 Jan, 2020 2 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
22 Dec, 2019 6 commits

Remove __future__ imports. · c824d15a
Aymeric Augustin authored Dec 22, 2019

c824d15a

Replace (TF)CommonTestCases for modeling with a mixin. · 345c23a6

Aymeric Augustin authored Dec 22, 2019

I suspect the wrapper classes were created in order to prevent the
abstract base class (TF)CommonModelTester from being included in test
discovery and running, because that would fail.

I solved this by replacing the abstract base class with a mixin.

Code changes are just de-indenting and automatic reformattings
performed by black to use the extra line space.

345c23a6

Remove unittest.main() in test modules. · 7e98e211

Aymeric Augustin authored Dec 22, 2019

This construct isn't used anymore these days.

Running python tests/test_foo.py puts the tests/ directory on
PYTHONPATH, which isn't representative of how we run tests.

Use python -m unittest tests/test_foo.py instead.

7e98e211

Switch test files to the standard test_*.py scheme. · ced0a942
Aymeric Augustin authored Dec 22, 2019

ced0a942
Move tests outside of library. · 067395d5
Aymeric Augustin authored Dec 22, 2019

067395d5

Sort imports with isort. · 158e82e0

Aymeric Augustin authored Dec 21, 2019

This is the result of:

    $ isort --recursive examples templates transformers utils hubconf.py setup.py

158e82e0

21 Dec, 2019 2 commits

Reformat source code with black. · fa84ae26

Aymeric Augustin authored Dec 21, 2019

This is the result of:

    $ black --line-length 119 examples templates transformers utils hubconf.py setup.py

There's a lot of fairly long lines in the project. As a consequence, I'm
picking the longest widely accepted line length, 119 characters.

This is also Thomas' preference, because it allows for explicit variable
names, to make the code easier to understand.

fa84ae26

Take advantage of the cache when running tests. · b670c266

Aymeric Augustin authored Dec 20, 2019

Caching models across test cases and across runs of the test suite makes
slow tests somewhat more bearable.

Use gettempdir() instead of /tmp in tests. This makes it easier to
change the location of the cache with semi-standard TMPDIR/TEMP/TMP
environment variables.

Fix #2222.

b670c266

13 Dec, 2019 1 commit
- cleaning up configuration classes · 47f0e3cf
  thomwolf authored Dec 13, 2019
  
  47f0e3cf
06 Dec, 2019 1 commit

Remove dependency on pytest for running tests (#2055) · 35401fe5

Aymeric Augustin authored Dec 06, 2019

* Switch to plain unittest for skipping slow tests.

Add a RUN_SLOW environment variable for running them.

* Switch to plain unittest for PyTorch dependency.

* Switch to plain unittest for TensorFlow dependency.

* Avoid leaking open files in the test suite.

This prevents spurious warnings when running tests.

* Fix unicode warning on Python 2 when running tests.

The warning was:

    UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal

* Support running PyTorch tests on a GPU.

Reverts 27e015bd.

* Tests no longer require pytest.

* Make tests pass on cuda

35401fe5

26 Sep, 2019 1 commit
- [BIG] pytorch-transformers => transformers · 31c23bd5
  thomwolf authored Sep 26, 2019
  
  31c23bd5
24 Sep, 2019 1 commit
- bidirectional conversion TF <=> PT - extended tests · e9a103c1
  thomwolf authored Sep 24, 2019
  
  e9a103c1
23 Sep, 2019 1 commit
- fix XLM tests · a31e591d
  thomwolf authored Sep 23, 2019
  
  a31e591d
11 Sep, 2019 1 commit
- XLM passing tests · 4356f791
  thomwolf authored Sep 11, 2019
  
  4356f791
09 Sep, 2019 1 commit
- fixed imports in tests and gpt2 config test · b7175a27
  thomwolf authored Sep 09, 2019
  
  b7175a27
08 Sep, 2019 2 commits
- test suite independent of framework · 518307df
  thomwolf authored Sep 05, 2019
  
  518307df
- split configuration and modeling files · 1efb1f16
  thomwolf authored Sep 05, 2019
  
  1efb1f16
05 Sep, 2019 1 commit
- test suite independent of framework · 7c0baf95
  thomwolf authored Sep 05, 2019
  
  7c0baf95
04 Sep, 2019 1 commit
- split configuration and modeling files · 2a667b1e
  thomwolf authored Sep 05, 2019
  
  2a667b1e
15 Jul, 2019 1 commit
- update QA models tests + run_generation · e691fc09
  thomwolf authored Jul 15, 2019
  
  e691fc09
12 Jul, 2019 1 commit
- updating tests · 2918b7d2
  thomwolf authored Jul 12, 2019
  
  2918b7d2
11 Jul, 2019 1 commit
- embeddings resizing + tie_weights · bd404735
  thomwolf authored Jul 12, 2019
  
  bd404735
09 Jul, 2019 1 commit
- unified tokenizer api and serialization + tests · b1978698
  thomwolf authored Jul 09, 2019
  
  b1978698