- 07 May, 2020 1 commit
-
-
Patrick von Platen authored
* first copy & past commit from Bert and morgans LSH code * add easy way to compare to trax original code * translate most of function * make trax lsh self attention deterministic with numpy seed + copy paste code * add same config * add same config * make layer init work * implemented hash_vectors function for lsh attention * continue reformer translation * hf LSHSelfAttentionLayer gives same output as trax layer * refactor code * refactor code * refactor code * refactor * refactor + add reformer config * delete bogus file * split reformer attention layer into two layers * save intermediate step * save intermediate step * make test work * add complete reformer block layer * finish reformer layer * implement causal and self mask * clean reformer test and refactor code * fix merge conflicts * fix merge conflicts * update init * fix device for GPU * fix chunk length init for tests * include morgans optimization * improve memory a bit * improve comment * factorize num_buckets * better testing parameters * make whole model work * make lm model work * add t5 copy paste tokenizer * add chunking feed forward * clean config * add improved assert statements * make tokenizer work * improve test * correct typo * extend config * add complexer test * add new axial position embeddings * add local block attention layer * clean tests * refactor * better testing * save intermediate progress * clean test file * make shorter input length work for model * allow variable input length * refactor * make forward pass for pretrained model work * add generation possibility * finish dropout and init * make style * refactor * add first version of RevNet Layers * make forward pass work and add convert file * make uploaded model forward pass work * make uploaded model forward pass work * refactor code * add namedtuples and cache buckets * correct head masks * refactor * made reformer more flexible * make style * remove set max length * add attention masks * fix up tests * fix lsh attention mask * make random seed optional for the moment * improve memory in reformer * add tests * make style * make sure masks work correctly * detach gradients * save intermediate * correct backprob through gather * make style * change back num hashes * rename to labels * fix rotation shape * fix detach * update * fix trainer * fix backward dropout * make reformer more flexible * fix conflict * fix * fix * add tests for fixed seed in reformer layer * fix trainer typo * fix typo in activations * add fp16 tests * add fp16 training * support fp16 * correct gradient bug in reformer * add fast gelu * re-add dropout for embedding dropout * better naming * better naming * renaming * finalize test branch * finalize tests * add more tests * finish tests * fix * fix type trainer * fix fp16 tests * fix tests * fix tests * fix tests * fix issue with dropout * fix dropout seeds * correct random seed on gpu * finalize random seed for dropout * finalize random seed for dropout * remove duplicate line * correct half precision bug * make style * refactor * refactor * docstring * remove sinusoidal position encodings for reformer * move chunking to modeling_utils * make style * clean config * make style * fix tests * fix auto tests * pretrained models * fix docstring * update conversion file * Update pretrained_models.rst * fix rst * fix rst * update copyright * fix test path * fix test path * fix small issue in test * include reformer in generation tests * add docs for axial position encoding * finish docs * Update convert_reformer_trax_checkpoint_to_pytorch.py * remove isort * include sams comments * remove wrong comment in utils * correct typos * fix typo * Update reformer.rst * applied morgans optimization * make style * make gpu compatible * remove bogus file * big test refactor * add example for chunking * fix typo * add to README
-
- 16 Apr, 2020 1 commit
-
-
Patrick von Platen authored
* add dialoGPT * update README.md * fix conflict * update readme * add code links to docs * Update README.md * Update dialo_gpt2.rst * Update pretrained_models.rst * Update docs/source/model_doc/dialo_gpt2.rst Co-Authored-By:
Julien Chaumond <chaumond@gmail.com> * change filename of dialogpt Co-authored-by:
Julien Chaumond <chaumond@gmail.com>
-
- 10 Apr, 2020 1 commit
-
-
Sam Shleifer authored
- support mbart-en-ro weights - add MBartTokenizer
-
- 27 Mar, 2020 1 commit
-
-
Patrick von Platen authored
* add t5 docs basis * improve docs * add t5 docs * improve t5 docstring * add t5 tokenizer docstring * finish docstring * make style * add pretrained models * correct typo * make examples work * finalize docs
-
- 02 Mar, 2020 1 commit
-
-
Sam Shleifer authored
`generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.
-
- 20 Feb, 2020 1 commit
-
-
Sam Shleifer authored
* Results same as fairseq * Wrote a ton of tests * Struggled with api signatures * added some docs
-
- 07 Feb, 2020 1 commit
-
-
VictorSanh authored
-
- 30 Jan, 2020 1 commit
-
-
Lysandre authored
-
- 28 Jan, 2020 1 commit
-
-
Wietse de Vries authored
-
- 06 Jan, 2020 2 commits
-
-
alberduris authored
-
alberduris authored
-
- 21 Dec, 2019 1 commit
-
-
Julien Chaumond authored
cc @lysandrejik
-
- 18 Dec, 2019 2 commits
-
-
Stefan Schweter authored
-
Antti Virtanen authored
-
- 13 Dec, 2019 1 commit
-
-
thomwolf authored
-
- 11 Dec, 2019 3 commits
-
-
Julien Chaumond authored
-
Masatoshi Suzuki authored
-
Stefan Schweter authored
-
- 09 Dec, 2019 1 commit
-
-
Pierric Cistac authored
-
- 05 Dec, 2019 1 commit
-
-
VictorSanh authored
-
- 26 Nov, 2019 3 commits
- 19 Nov, 2019 1 commit
-
-
Stefan Schweter authored
-
- 16 Nov, 2019 1 commit
-
-
Louis MARTIN authored
-
- 08 Nov, 2019 1 commit
-
-
thomwolf authored
-
- 06 Nov, 2019 1 commit
-
-
Julien Chaumond authored
converted from https://github.com/openai/gpt-2-output-dataset/tree/master/detector Co-Authored-By:
Lysandre Debut <lysandre.debut@reseau.eseo.fr> Co-Authored-By:
Jong Wook Kim <jongwook@nyu.edu> Co-Authored-By:
Jeff Wu <wuthefwasthat@gmail.com>
-
- 05 Nov, 2019 1 commit
-
-
Lysandre authored
-
- 23 Oct, 2019 1 commit
-
-
VictorSanh authored
-
- 11 Oct, 2019 1 commit
-
-
Stefan Schweter authored
-
- 09 Oct, 2019 1 commit
-
-
thomwolf authored
-
- 03 Oct, 2019 2 commits
-
-
VictorSanh authored
-
VictorSanh authored
-
- 02 Oct, 2019 1 commit
-
-
LysandreJik authored
-
- 26 Sep, 2019 2 commits
-
-
LysandreJik authored
-
thomwolf authored
-
- 16 Sep, 2019 1 commit
-
-
thomwolf authored
-
- 28 Aug, 2019 3 commits
-
-
LysandreJik authored
-
LysandreJik authored
-
LysandreJik authored
-