- 16 Apr, 2020 1 commit
-
-
Patrick von Platen authored
* add dialoGPT * update README.md * fix conflict * update readme * add code links to docs * Update README.md * Update dialo_gpt2.rst * Update pretrained_models.rst * Update docs/source/model_doc/dialo_gpt2.rst Co-Authored-By:
Julien Chaumond <chaumond@gmail.com> * change filename of dialogpt Co-authored-by:
Julien Chaumond <chaumond@gmail.com>
-
- 10 Apr, 2020 2 commits
-
-
Julien Chaumond authored
-
Sam Shleifer authored
- support mbart-en-ro weights - add MBartTokenizer
-
- 06 Apr, 2020 2 commits
-
-
Lysandre Debut authored
* Update notebooks * From local to global link * from local links to *actual* global links
-
LysandreJik authored
-
- 04 Apr, 2020 1 commit
-
-
Julien Chaumond authored
-
- 03 Apr, 2020 1 commit
-
-
Lysandre Debut authored
* Electra wip * helpers * Electra wip * Electra v1 * ELECTRA may be saved/loaded * Generator & Discriminator * Embedding size instead of halving the hidden size * ELECTRA Tokenizer * Revert BERT helpers * ELECTRA Conversion script * Archive maps * PyTorch tests * Start fixing tests * Tests pass * Same configuration for both models * Compatible with base + large * Simplification + weight tying * Archives * Auto + Renaming to standard names * ELECTRA is uncased * Tests * Slight API changes * Update tests * wip * ElectraForTokenClassification * temp * Simpler arch + tests Removed ElectraForPreTraining which will be in a script * Conversion script * Auto model * Update links to S3 * Split ElectraForPreTraining and ElectraForTokenClassification * Actually test PreTraining model * Remove num_labels from configuration * wip * wip * From discriminator and generator to electra * Slight API changes * Better naming * TensorFlow ELECTRA tests * Accurate conversion script * Added to conversion script * Fast ELECTRA tokenizer * Style * Add ELECTRA to README * Modeling Pytorch Doc + Real style * TF Docs * Docs * Correct links * Correct model intialized * random fixes * style * Addressing Patrick's and Sam's comments * Correct links in docs
-
- 31 Mar, 2020 3 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Patrick von Platen authored
-
- 30 Mar, 2020 2 commits
-
-
LysandreJik authored
-
Patrick von Platen authored
* Add clear description of how to train T5 * correct docstring in T5 * correct typo * correct docstring format * update t5 model docs * implement collins feedback * fix typo and add more explanation for sentinal tokens * delete unnecessary todos
-
- 27 Mar, 2020 1 commit
-
-
Patrick von Platen authored
* add t5 docs basis * improve docs * add t5 docs * improve t5 docstring * add t5 tokenizer docstring * finish docstring * make style * add pretrained models * correct typo * make examples work * finalize docs
-
- 24 Mar, 2020 1 commit
-
-
LysandreJik authored
-
- 17 Mar, 2020 2 commits
-
-
Sam Shleifer authored
* passing * Undo stupid chg * docs * undo rename * delete-cruft * only import if you have torch * Dont rely on dict ordering * Fix dict ordering upstream * docstring link * docstring link * remove trailing comma for 3.5 compat * new name * delegate kwarging * Update kwargs
-
Thomas Wolf authored
* memory benchmark rss * have both forward pass and line-by-line mem tracing * cleaned up tracing * refactored and cleaning up API * no f-strings yet... * add GPU mem logging * fix GPU memory monitoring * style and quality * clean up and doc * update with comments * Switching to python 3.6+ * fix quality
-
- 10 Mar, 2020 2 commits
-
-
Julien Chaumond authored
Co-Authored-By:Thomas Wolf <thomwolf@users.noreply.github.com>
-
Julien Chaumond authored
-
- 05 Mar, 2020 2 commits
-
-
Sam Shleifer authored
* improved documentation
-
Lysandre authored
-
- 02 Mar, 2020 2 commits
-
-
Lysandre Debut authored
* Pipeline doc initial commit * pipeline abstraction * Remove modelcard argument from pipeline * Task-specific pipelines can be instantiated with no model or tokenizer * All pipelines doc
-
Sam Shleifer authored
`generate` code that produces 99% identical summarizations to fairseq on CNN test data, with caching.
-
- 26 Feb, 2020 1 commit
-
-
Sam Shleifer authored
-
- 25 Feb, 2020 2 commits
-
-
Lysandre Debut authored
* All Tokenizers BertTokenizer + few fixes RobertaTokenizer OpenAIGPTTokenizer + Fixes GPT2Tokenizer + fixes TransfoXLTokenizer Correct rst for TransformerXL XLMTokenizer + fixes XLNet Tokenizer + Style DistilBERT + Fix XLNet RST CTRLTokenizer CamemBERT Tokenizer FlaubertTokenizer XLMRobertaTokenizer cleanup * cleanup
-
Lysandre Debut authored
* Usage: Sequence Classification & Question Answering * Pipeline example * Language modeling * TensorFlow code for Sequence classification * Custom TF/PT toggler in docs * QA + LM for TensorFlow * Finish Usage for both PyTorch and TensorFlow * Addressing Julien's comments * More assertive * cleanup * Favicon - added favicon option in conf.py along with the favicon image - udpated
馃 logo. slightly smaller and should appear more consistent across editing programs (no more tongue on the outside of the mouth) Co-authored-by:joshchagani <joshua@joshuachagani.com>
-
- 24 Feb, 2020 1 commit
-
-
Lysandre authored
-
- 20 Feb, 2020 1 commit
-
-
Sam Shleifer authored
* Results same as fairseq * Wrote a ton of tests * Struggled with api signatures * added some docs
-
- 19 Feb, 2020 1 commit
-
-
Lysandre authored
Welcome Rust Tokenizers
-
- 10 Feb, 2020 1 commit
-
-
Lysandre authored
-
- 07 Feb, 2020 4 commits
-
-
Lysandre authored
-
Lysandre authored
-
VictorSanh authored
-
Julien Chaumond authored
-
- 06 Feb, 2020 2 commits
-
-
Julien Chaumond authored
-
Julien Chaumond authored
-
- 05 Feb, 2020 1 commit
-
-
Julien Chaumond authored
cc @lysandrejik @thomwolf
-
- 04 Feb, 2020 1 commit
-
-
Lysandre authored
-
- 31 Jan, 2020 2 commits
- 30 Jan, 2020 1 commit
-
-
Hang Le authored
-