- 31 Aug, 2020 3 commits
-
-
Lysandre Debut authored
-
Lysandre authored
-
Lysandre authored
-
- 30 Aug, 2020 6 commits
-
-
Sam Shleifer authored
-
xujiaze13 authored
* Clarify shuffle * clarify shuffle Co-authored-by:Kevin Canwen Xu <canwenxu@126.com>
-
Rodolfo De Nadai authored
-
Zane Lim authored
-
Thomas Ashish Cherian authored
-
Stas Bekman authored
-
- 29 Aug, 2020 3 commits
-
-
Sam Shleifer authored
-
Sam Shleifer authored
-
Sam Shleifer authored
-
- 28 Aug, 2020 9 commits
-
-
Sam Shleifer authored
-
Sam Shleifer authored
-
Sam Shleifer authored
-
Sam Shleifer authored
* broken test * batch parity * tests pass * boom boom * boom boom * split out bart tokenizer tests * fix tests * boom boom * Fixed dataset bug * Fix marian * Undo extra * Get marian working * Fix t5 tok tests * Test passing * Cleanup * better assert msg * require torch * Fix mbart tests * undo extra decoder_attn_mask change * Fix import * pegasus tokenizer can ignore src_lang kwargs * unused kwarg test cov * boom boom * add todo for pegasus issue * cover one word translation edge case * Cleanup * doc
-
RafaelWO authored
* Improved tokenization with sacremoses * The TransfoXLTokenizer is now using sacremoses for tokenization * Added tokenization of comma-separated and floating point numbers. * Removed prepare_for_tokenization() from tokenization_transfo_xl.py because punctuation is handled by sacremoses * Added corresponding tests * Removed test comapring TransfoXLTokenizer and TransfoXLTokenizerFast * Added deprecation warning to TransfoXLTokenizerFast * isort change Co-authored-by:
Teven <teven.lescao@gmail.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Ahmed Elnaggar authored
-
Stas Bekman authored
`make style` with `black` < 20.8b1 is a no go (in case some other package forced a lower version) - so make it explicit to avoid confusion
-
Sam Shleifer authored
-
Stas Bekman authored
-
- 27 Aug, 2020 12 commits
-
-
Lysandre authored
-
Stas Bekman authored
* [doc] multiple corrections to "Summary of the tasks" * add a new "docs" target to validate docs and document it * fix mixup
-
Stas Bekman authored
* [test schedulers] small improvement * cleanup
-
Stas Bekman authored
* [testing] replace hardcoded paths to allow running tests from anywhere * fix the merge conflict
-
Sam Shleifer authored
-
Tom Grek authored
-
Lysandre authored
-
Julien Plu authored
* Align TF NER example over the PT one * Fix Dataset call * Fix gradient accumulation training * Apply style * Address Sylvain's comments * Address Sylvain's comments * Apply style
-
Lysandre Debut authored
-
Nikolai Yakovenko authored
* AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM. * update PR fixes, add basic test * bug -- incorrect params in test * bugfix -- import Adafactor into test * bugfix -- removed accidental T5 include * resetting T5 to master * bugfix -- include Adafactor in __init__ * longer loop for adafactor test * remove double error class declare * lint * black * isort * Update src/transformers/optimization.py Co-authored-by:
Sam Shleifer <sshleifer@gmail.com> * single docstring * Cleanup docstring Co-authored-by:
Nikolai Y <nikolai.yakovenko@point72.com> Co-authored-by:
Sam Shleifer <sshleifer@gmail.com>
-
Sam Shleifer authored
-
Ahmed Elnaggar authored
-
- 26 Aug, 2020 7 commits
-
-
Sam Shleifer authored
-
Igli Manaj authored
-
Joe Davison authored
-
Ali Safaya authored
* Create README.md * Update README.md
-
Ali Safaya authored
* Create README.md * Update README.md
-
Ali Safaya authored
* Create README.md * Update README.md
-
Sagor Sarker authored
* added model card for codeswitch-spaeng-sentiment-analysis-lince model also update other model card * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * fixed typo * Update README.md
-