Commits · e9a2f772bcce0d48cb9adac24a0e165675f178f0 · chenpangpang / transformers

10 Sep, 2020 2 commits

Add TF Funnel Transformer (#7029) · 15a18904

Sylvain Gugger authored Sep 10, 2020



* Add TF Funnel Transformer

* Proper dummy input

* Formatting

* Update src/transformers/modeling_tf_funnel.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* One review comment forgotten
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

15a18904

Add "Leveraging Pretrained Checkpoints for Generation" Seq2Seq models. (#6594) · 7fd1febf

Patrick von Platen authored Sep 10, 2020

* add conversion script

* improve conversion script

* make style

* add tryout files

* fix

* update

* add causal bert

* better names

* add tokenizer file as well

* finish causal_bert

* fix small bugs

* improve generate

* change naming

* renaming

* renaming

* renaming

* remove leftover files

* clean files

* add fix tokenizer

* finalize

* correct slow test

* update docs

* small fixes

* fix link

* adapt check repo

* apply sams and sylvains recommendations

* fix import

* implement Lysandres recommendations

* fix logger warn

7fd1febf

02 Sep, 2020 1 commit
- fix QA example for PT (#6890) · 1889e96c
  Patrick von Platen authored Sep 02, 2020
  
  1889e96c
31 Aug, 2020 1 commit
- Patch logging issue · 05c32141
  Lysandre authored Aug 31, 2020
  
  05c32141
26 Aug, 2020 1 commit

Centralize logging (#6434) · 77abd1e7

Lysandre Debut authored Aug 26, 2020



* Logging

* Style

* hf_logging > utils.logging

* Address @thomwolf's comments

* Update test

* Update src/transformers/benchmark/benchmark_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Revert bad change
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

77abd1e7

24 Aug, 2020 1 commit
- Update repo to isort v5 (#6686) · a5737779
  Sylvain Gugger authored Aug 24, 2020
```
* Run new isort

* More changes

* Update CI, CONTRIBUTING and benchmarks
```
  a5737779
20 Aug, 2020 1 commit

Trainer automatically drops unused columns in nlp datasets (#6449) · e5f45227

Sylvain Gugger authored Aug 20, 2020

* Add a classmethod to easily build a Trainer from nlp dataset and metric

* Fix docstrings

* Split train/eval

* Formatting

* Log dropped columns + docs

* Authorize callable activations

* Poc for auto activation

* Be framework-agnostic

* Formatting

* Remove class method

* Remove unnecessary code

e5f45227

05 Aug, 2020 1 commit

Tf model outputs (#6247) · c67d1a02

Sylvain Gugger authored Aug 05, 2020

* TF outputs and test on BERT

* Albert to DistilBert

* All remaining TF models except T5

* Documentation

* One file forgotten

* TF outputs and test on BERT

* Albert to DistilBert

* All remaining TF models except T5

* Documentation

* One file forgotten

* Add new models and fix issues

* Quality improvements

* Add T5

* A bit of cleanup

* Fix for slow tests

* Style

c67d1a02

30 Jul, 2020 1 commit

Switch from return_tuple to return_dict (#6138) · 91cb9546

Sylvain Gugger authored Jul 30, 2020



* Switch from return_tuple to return_dict

* Fix test

* [WIP] Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleC… (#5614)

* Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleChoice} models and tests

* AutoModels


Tiny tweaks

* Style

* Final changes before merge

* Re-order for simpler review

* Final fixes

* Addressing @sgugger's comments

* Test MultipleChoice

* Rework TF trainer (#6038)

* Fully rework training/prediction loops

* fix method name

* Fix variable name

* Fix property name

* Fix scope

* Fix method name

* Fix tuple index

* Fix tuple index

* Fix indentation

* Fix variable name

* fix eval before log

* Add drop remainder for test dataset

* Fix step number + fix logging datetime

* fix eval loss value

* use global step instead of step + fix logging at step 0

* Fix logging datetime

* Fix global_step usage

* Fix breaking loop + logging datetime

* Fix step in prediction loop

* Fix step breaking

* Fix train/test loops

* Force TF at least 2.2 for the trainer

* Use assert_cardinality to facilitate the dataset size computation

* Log steps per epoch

* Make tfds compliant with TPU

* Make tfds compliant with TPU

* Use TF dataset enumerate instead of the Python one

* revert previous commit

* Fix data_dir

* Apply style

* rebase on master

* Address Sylvain's comments

* Address Sylvain's and Lysandre comments

* Trigger CI

* Remove unused import

* Switch from return_tuple to return_dict

* Fix test

* Add recent model
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Plu <plu.julien@gmail.com>

91cb9546

27 Jul, 2020 1 commit
- Fix the return documentation rendering for all model outputs (#6022) · 1246b20f
  Sylvain Gugger authored Jul 27, 2020
```
* Fix the return documentation rendering for all model outputs

* Formatting
```
  1246b20f
21 Jul, 2020 1 commit
- Update doc to new model outputs (#5946) · e714412f
  Sylvain Gugger authored Jul 21, 2020
```
* Update doc to new model outputs

* Fix outputs in quicktour
```
  e714412f
10 Jul, 2020 2 commits

Document model outputs (#5673) · 7fad617d

Sylvain Gugger authored Jul 10, 2020



* Document model outputs

* Update docs/source/main_classes/output.rst
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

7fad617d

Change model outputs types to self-document outputs (#5438) · edfd82f5

Sylvain Gugger authored Jul 10, 2020

* [WIP] Proposal for model outputs

* All Bert models

* Make CI green maybe?

* Fix ONNX test

* Isolate ModelOutput from pt and tf

* Formatting

* Add Electra models

* Auto-generate docstrings from outputs

* Add TF outputs

* Add some BERT models

* Revert TF side

* Remove last traces of TF changes

* Fail with a clear error message

* Add Albert and work through Bart

* Add CTRL and DistilBert

* Formatting

* Progress on Bart

* Renames and finish Bart

* Formatting

* Fix last test

* Add DPR

* Finish Electra and add FlauBERT

* Add GPT2

* Add Longformer

* Add MMBT

* Add MobileBert

* Add GPT

* Formatting

* Add Reformer

* Add Roberta

* Add T5

* Add Transformer XL

* Fix test

* Add XLM + fix XLMForTokenClassification

* Style + XLMRoberta

* Add XLNet

* Formatting

* Add doc of return_tuple arg

edfd82f5

26 Jun, 2020 1 commit

[tokenizers] Updates data processors, docstring, examples and model cards to the new API (#5308) · 601d4d69

Thomas Wolf authored Jun 26, 2020

* remove references to old API in docstring - update data processors

* style

* fix tests - better type checking error messages

* better type checking

* include awesome fix by @LysandreJik for #5310

* updated doc and examples

601d4d69

25 Jun, 2020 1 commit

Refactor Code samples; Test code samples (#5036) · 364a5ae1

Lysandre Debut authored Jun 25, 2020



* Refactor code samples

* Test docstrings

* Style

* Tokenization examples

* Run rust of tests

* First step to testing source docs

* Style and BART comment

* Test the remainder of the code samples

* Style

* let to const

* Formatting fixes

* Ready for merge

* Fix fixture + Style

* Fix last tests

* Update docs/source/quicktour.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Addressing @sgugger's comments + Fix MobileBERT in TF
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

364a5ae1

23 Jun, 2020 1 commit
- [file_utils] Type user-agent · c01480bb
  Julien Chaumond authored Jun 23, 2020
  
  c01480bb
22 Jun, 2020 1 commit

Benchmarks (#4912) · fa0be6d7

Patrick von Platen authored Jun 22, 2020

* finish benchmark

* fix isort

* fix setup cfg

* retab

* fix time measuring of tf graph mode

* fix tf cuda

* clean code

* better error message

fa0be6d7

10 Jun, 2020 1 commit
- Don't init TPU device twice (#4916) · 466aa57a
  Lysandre Debut authored Jun 10, 2020
  
  466aa57a
09 Jun, 2020 1 commit

[Benchmark] add tpu and torchscipt for benchmark (#4850) · 2cfb947f

Patrick von Platen authored Jun 09, 2020



* add tpu and torchscipt for benchmark

* fix name in tests

* "fix email"

* make style

* better log message for tpu

* add more print and info for tpu

* allow possibility to print tpu metrics

* correct cpu usage

* fix test for non-install

* remove bugus file

* include psutil in testing

* run a couple of times before tracing in torchscript

* do not allow tpu memory tracing for now

* make style

* add torchscript to env

* better name for torch tpu
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2cfb947f

27 May, 2020 1 commit

[Benchmark] Memory benchmark utils (#4198) · 96f57c9c

Patrick von Platen authored May 27, 2020



* improve memory benchmarking

* correct typo

* fix current memory

* check torch memory allocated

* better pytorch function

* add total cached gpu memory

* add total gpu required

* improve torch gpu usage

* update memory usage

* finalize memory tracing

* save intermediate benchmark class

* fix conflict

* improve benchmark

* improve benchmark

* finalize

* make style

* improve benchmarking

* correct typo

* make train function more flexible

* fix csv save

* better repr of bytes

* better print

* fix __repr__ bug

* finish plot script

* rename plot file

* delete csv and small improvements

* fix in plot

* fix in plot

* correct usage of timeit

* remove redundant line

* remove redundant line

* fix bug

* add hf parser tests

* add versioning and platform info

* make style

* add gpu information

* ensure backward compatibility

* finish adding all tests

* Update src/transformers/benchmark/benchmark_args.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/benchmark/benchmark_args_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* delete csv files

* fix isort ordering

* add out of memory handling

* add better train memory handling
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

96f57c9c

11 May, 2020 1 commit

Simplify cache vars and allow for TRANSFORMERS_CACHE env (#4226) · 61d22f9c

Bram Vanroy authored May 11, 2020

* simplify cache vars and allow for TRANSFORMERS_CACHE env

As it currently stands, "TRANSFORMERS_CACHE" is not an accepted variable. It seems that the these variables were not updated when moving from version pytorch_transformers to transformers. In addition, the fallback procedure could be improved. and simplified. Pathlib seems redundant here.

* Update file_utils.py

61d22f9c

29 Apr, 2020 1 commit

CDN urls (#4030) · 455c6390

Julien Chaumond authored Apr 28, 2020

* [file_utils] use_cdn + documentation

* Move to cdn. urls for weights

* [urls] Hotfix for bert-base-japanese

455c6390

27 Apr, 2020 1 commit
- rm boto3 dependency · 97a37548
  Julien Chaumond authored Apr 25, 2020
  
  97a37548
22 Apr, 2020 1 commit

Trainer (#3800) · dd9d483d

Julien Chaumond authored Apr 21, 2020

* doc

* [tests] Add sample files for a regression task

* [HUGE] Trainer

* Feedback from @sshleifer

* Feedback from @thomwolf + logging tweak

* [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes

* [glue] Use default max_seq_length of 128 like before

* [glue] move DataTrainingArguments around

* [ner] Change interface of InputExample, and align run_{tf,pl}

* Re-align the pl scripts a little bit

* ner

* [ner] Add integration test

* Fix language_modeling with API tweak

* [ci] Tweak loss target

* Don't break console output

* amp.initialize: model must be on right device before

* [multiple-choice] update for Trainer

* Re-align to 827d6d6e

dd9d483d

09 Apr, 2020 1 commit
- Fix force_download of files on Windows (#3697) · 9384e5f6
  calpt authored Apr 09, 2020
  
  9384e5f6
24 Feb, 2020 1 commit

Add local_files_only parameter to pretrained items (#2930) · a143d947

Bram Vanroy authored Feb 24, 2020

* Add disable_outgoing to pretrained items

Setting disable_outgoing=True disables outgonig traffic:
- etags are not looked up
- models are not downloaded

* parameter name change

* Remove forgotten print

a143d947

06 Feb, 2020 2 commits
- cleanup · d311f87b
  thomwolf authored Feb 07, 2020
  
  d311f87b
- file_cache has options to extract archives · 7d99e05f
  thomwolf authored Feb 07, 2020
  
  7d99e05f
23 Jan, 2020 4 commits
- Run the examples in slow · 24d5ad1d
  Lysandre authored Jan 22, 2020
  
  24d5ad1d
- Tips + whitespaces · 9ddf60b6
  Lysandre authored Jan 21, 2020
  
  9ddf60b6
- TF ALBERT + TF Utilities + Fix warnings · 3922a249
  Lysandre authored Jan 15, 2020
  
  3922a249
- ALBERT Modeling + required changes to utilities · 00df3d4d
  Lysandre authored Jan 15, 2020
  
  00df3d4d
20 Jan, 2020 3 commits
- Fix style · ca6ce304
  Lysandre authored Jan 20, 2020
  
  ca6ce304
- Make forward asynchrone to avoid long computation timing out. · 908cd5ea
  Morgan Funtowicz authored Jan 10, 2020
```
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
```
  908cd5ea
- Fix bad handling of env variable USE_TF / USE_TORCH leading to invalid framework being used. · 6e6c8c52
  Morgan Funtowicz authored Jan 10, 2020
```
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
```
  6e6c8c52
16 Jan, 2020 1 commit
- Tokenizer.from_pretrained: fetch all possible files remotely · 23a2cea8
  Julien Chaumond authored Jan 15, 2020
  
  23a2cea8
13 Jan, 2020 1 commit
- In a parallel setup this could fail · afc24ea5
  Julien Chaumond authored Jan 13, 2020
  
  afc24ea5
07 Jan, 2020 2 commits
- fixed formatting · 2926852f
  Dima Galat authored Jan 07, 2020
  
  2926852f
- removing redundant .flush · e2810edc
  Dima Galat authored Jan 07, 2020
  
  e2810edc
06 Jan, 2020 1 commit
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b