Commits · f8d3695e8ceeeac3b0236e1a02e1858114a81d83 · chenpangpang / transformers

16 Oct, 2020 1 commit
- [cleanup] assign todos, faster bart-cnn test (#7835) · 96e47d92
  Sam Shleifer authored Oct 16, 2020
```
* 2 beam output

* unassign/remove TODOs

* remove one more
```
  96e47d92
11 Oct, 2020 1 commit
- [examples] bump pl=0.9.0 (#7053) · 827c5194
  Sam Shleifer authored Oct 11, 2020
  
  827c5194
22 Sep, 2020 1 commit

RAG (#6813) · c754c41c

Ola Piktus authored Sep 22, 2020

* added rag WIP

* path fix

* Formatting / renaming prior to actual work

* added rag WIP

* path fix

* Formatting / renaming prior to actual work

* added rag WIP

* path fix

* Formatting / renaming prior to actual work

* added rag WIP

* Formatting / renaming prior to actual work

* First commit

* improve comments

* Retrieval evaluation scripts

* refactor to include modeling outputs + MPI retriever

* Fix rag-token model + refactor

* Various fixes + finetuning logic

* use_bos fix

* Retrieval refactor

* Finetuning refactoring and cleanup

* Add documentation and cleanup

* Remove set_up_rag_env.sh file

* Fix retrieval wit HF index

* Fix import errors

* Fix quality errors

* Refactor as per suggestions in https://github.com/huggingface/transformers/pull/6813#issuecomment-687208867



* fix quality

* Fix RAG Sequence generation

* minor cleanup plus initial tests

* fix test

* fix tests 2

* Comments fix

* post-merge fixes

* Improve readme + post-rebase refactor

* Extra dependencied for tests

* Fix tests

* Fix tests 2

* Refactor test requirements

* Fix tests 3

* Post-rebase refactor

* rename nlp->datasets

* RAG integration tests

* add tokenizer to slow integration test and allow retriever to run on cpu

* add tests; fix position ids warning

* change structure

* change structure

* add from encoder generator

* save working solution

* make all integration tests pass

* add RagTokenizer.save/from_pretrained and RagRetriever.save/from_pretrained

* don't save paths

* delete unnecessary imports

* pass config to AutoTokenizer.from_pretrained for Rag tokenizers

* init wiki_dpr only once

* hardcode legacy index and passages paths (todo: add the right urls)

* finalize config

* finalize retriver api and config api

* LegacyIndex index download refactor

* add dpr to autotokenizer

* make from pretrained more flexible

* fix ragfortokengeneration

* small name changes in tokenizer

* add labels to models

* change default index name

* add retrieval tests

* finish token generate

* align test with previous version and make all tests pass

* add tests

* finalize tests

* implement thoms suggestions

* add first version of test

* make first tests work

* make retriever platform agnostic

* naming

* style

* add legacy index URL

* docstrings + simple retrieval test for distributed

* clean model api

* add doc_ids to retriever's outputs

* fix retrieval tests

* finish model outputs

* finalize model api

* fix generate problem for rag

* fix generate for other modles

* fix some tests

* save intermediate

* set generate to default

* big refactor generate

* delete rag_api

* correct pip faiss install

* fix auto tokenization test

* fix faiss install

* fix test

* move the distributed logic to examples

* model page

* docs

* finish tests

* fix dependencies

* fix import in __init__

* Refactor eval_rag and finetune scripts

* start docstring

* add psutil to test

* fix tf test

* move require torch to top

* fix retrieval test

* align naming

* finish automodel

* fix repo consistency

* test ragtokenizer save/load

* add rag model output docs

* fix ragtokenizer save/load from pretrained

* fix tokenizer dir

* remove torch in retrieval

* fix docs

* fixe finetune scripts

* finish model docs

* finish docs

* remove auto model for now

* add require torch

* remove solved todos

* integrate sylvains suggestions

* sams comments

* correct mistake on purpose

* improve README

* Add generation test cases

* fix rag token

* clean token generate

* fix test

* add note to test

* fix attention mask

* add t5 test for rag

* Fix handling prefix in finetune.py

* don't overwrite index_name
Co-authored-by: Patrick Lewis <plewis@fb.com>
Co-authored-by: Aleksandra Piktus <piktus@devfair0141.h2.fair>
Co-authored-by: Aleksandra Piktus <piktus@learnfair5102.h2.fair>
Co-authored-by: Aleksandra Piktus <piktus@learnfair5067.h2.fair>
Co-authored-by: Your Name <you@example.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

c754c41c

30 Aug, 2020 1 commit

clearly indicate shuffle=False (#6312) · 32fe4408

xujiaze13 authored Aug 30, 2020



* Clarify shuffle

* clarify shuffle
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

32fe4408

28 Aug, 2020 1 commit
- PL: --adafactor option (#6776) · fb78a90d
  Sam Shleifer authored Aug 27, 2020
  
  fb78a90d
26 Aug, 2020 1 commit
- Black 20 release · a75c64d8
  Lysandre authored Aug 26, 2020
  
  a75c64d8
17 Aug, 2020 1 commit
- [lightning_base] fix s2s logging, only make train_loader once (#6404) · 84c265ff
  Sam Shleifer authored Aug 16, 2020
  
  84c265ff
11 Aug, 2020 2 commits

lr_schedulers: add get_polynomial_decay_schedule_with_warmup (#6361) · ece0903e

Stas Bekman authored Aug 11, 2020



* [wip] add get_polynomial_decay_schedule_with_warmup

* style

* add assert

* change lr_end to a much smaller default number

* check for exact equality

* [model_cards] electra-base-turkish-cased-ner (#6350)

* for electra-base-turkish-cased-ner

* Add metadata
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Temporarily de-activate TPU CI

* Update modeling_tf_utils.py (#6372)

fix typo: ckeckpoint->checkpoint

* the test now works again (#6371)

* correct pl link in readme (#6364)

* refactor almost identical tests (#6339)

* refactor almost identical tests

* important to add a clear assert error message

* make the assert error even more descriptive than the original bt

* Small docfile fixes (#6328)

* Patch models (#6326)

* TFAlbertFor{TokenClassification, MultipleChoice}

* Patch models

* BERT and TF BERT info


s

* Update check_repo

* Ci GitHub caching (#6382)

* Cache Github Actions CI

* Remove useless file

* Colab button (#6389)

* Add colab button

* Add colab link for tutorials

* Fix links for open in colab (#6391)

* Update src/transformers/optimization.py

consistently use lr_end=1e-7 default
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* [wip] add get_polynomial_decay_schedule_with_warmup

* style

* add assert

* change lr_end to a much smaller default number

* check for exact equality

* Update src/transformers/optimization.py

consistently use lr_end=1e-7 default
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove dup (leftover from merge)

* convert the test into the new refactored format

* stick to using the current_step as is, without ++
Co-authored-by: M. Yusuf Sarıgöz <yusufsarigoz@gmail.com>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Alexander Measure <ameasure@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

ece0903e

[pl] restore lr logging behavior for glue, ner examples (#6314) · 0203d651
Stas Bekman authored Aug 11, 2020

0203d651

09 Aug, 2020 1 commit
- [s2s] fix --gpus clarg collision (#6358) · 9a5ef837
  Sam Shleifer authored Aug 08, 2020
  
  9a5ef837
06 Aug, 2020 1 commit
- [Fix] text-classification PL example (#6027) · ffceef20
  Bhashithe Abeysinghe authored Aug 06, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
  ffceef20
05 Aug, 2020 1 commit

[WIP] lightning_base: support --lr_scheduler with multiple possibilities (#6232) · 376c02e9

Stas Bekman authored Aug 05, 2020

* support --lr_scheduler with multiple possibilities

* correct the error message

* add a note about supported schedulers

* cleanup

* cleanup2

* needs the argument default

* style

* add another assert in the test

* implement requested changes

* cleanups

* fix relative import

* cleanup

376c02e9

03 Aug, 2020 1 commit
- s2s: fix LR logging, remove some dead code. (#6205) · b6b2f227
  Sam Shleifer authored Aug 03, 2020
  
  b6b2f227
30 Jul, 2020 1 commit
- [s2s] add support for overriding config params (#6149) · 3212b885
  Stas Bekman authored Jul 29, 2020
  
  3212b885
18 Jul, 2020 1 commit
- Lightning Updates for v0.8.5 (#5798) · 529850ae
  Nathan Raw authored Jul 17, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
  529850ae
26 Jun, 2020 1 commit
- [pl_examples] default warmup steps=0 (#5316) · 5543b30a
  Sam Shleifer authored Jun 26, 2020
  
  5543b30a
23 Jun, 2020 2 commits
- [pl_examples] revert deletion of optimizer_step (#5227) · 76e5af4c
  Sam Shleifer authored Jun 23, 2020
  
  76e5af4c
- Upgrade examples to pl=0.8.1(#5146) · f5c2a122
  Sam Shleifer authored Jun 22, 2020
  
  f5c2a122
17 Jun, 2020 1 commit
- [examples] SummarizationModule improvements (#4951) · 043f9f51
  Sam Shleifer authored Jun 17, 2020
  
  043f9f51
07 May, 2020 1 commit

BIG Reorganize examples (#4213) · 0ae96ff8

Julien Chaumond authored May 07, 2020

* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around

0ae96ff8

22 Apr, 2020 1 commit

Trainer (#3800) · dd9d483d

Julien Chaumond authored Apr 21, 2020

* doc

* [tests] Add sample files for a regression task

* [HUGE] Trainer

* Feedback from @sshleifer

* Feedback from @thomwolf + logging tweak

* [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes

* [glue] Use default max_seq_length of 128 like before

* [glue] move DataTrainingArguments around

* [ner] Change interface of InputExample, and align run_{tf,pl}

* Re-align the pl scripts a little bit

* ner

* [ner] Add integration test

* Fix language_modeling with API tweak

* [ci] Tweak loss target

* Don't break console output

* amp.initialize: model must be on right device before

* [multiple-choice] update for Trainer

* Re-align to 827d6d6e

dd9d483d

20 Apr, 2020 1 commit
- [examples] fix summarization do_predict (#3866) · a504cb49
  Sam Shleifer authored Apr 20, 2020
  
  a504cb49
15 Apr, 2020 1 commit
- [examples] unit test for run_bart_sum (#3544) · c59b1e68
  Sam Shleifer authored Apr 15, 2020
```
- adds pytorch-lightning dependency
```
  c59b1e68
07 Apr, 2020 1 commit
- [examples] SummarizationDataset cleanup (#3451) · e344e3d4
  Sam Shleifer authored Apr 07, 2020
  
  e344e3d4
25 Mar, 2020 1 commit
- BART for summarization training with CNN/DM using pytorch-lightning · 3d76df3a
  Andre Carrera authored Mar 24, 2020
  
  3d76df3a
17 Mar, 2020 1 commit

[WIP] Lightning glue example (#3290) · 930c9412

Nathan Raw authored Mar 17, 2020

* ✨ Alter base pl transformer to use automodels

* 🐛 Add batch size env variable to function call

* 💄 Apply black code style from Makefile

* 🚚 Move lightning base out of ner directory

* ✨ Add lightning glue example

* 💄 self

* move _feature_file to base class

* ✨ Move eval logging to custom callback

* 💄 Apply black code style

* 🐛 Add parent to pythonpath, remove copy command

* 🐛 Add missing max_length kwarg

930c9412

27 Feb, 2020 1 commit
- Changes to NER examples for PLT and TPU (#3053) · 908fa43b
  srush authored Feb 27, 2020
```
* changes to allow for tpu training

* black

* tpu

* tpu
```
  908fa43b
20 Feb, 2020 2 commits

default arg fix (#2937) · 889d3bfd
srush authored Feb 20, 2020

889d3bfd

Support for torch-lightning in NER examples (#2890) · b662f0e6

srush authored Feb 20, 2020



* initial pytorch lightning commit

* tested multigpu

* Fix learning rate schedule

* black formatting

* fix flake8

* isort

* isort

* .
Co-authored-by: Check your git settings! <chris@chris-laptop>

b662f0e6