- 22 Oct, 2020 1 commit
-
-
Stas Bekman authored
-
- 21 Oct, 2020 1 commit
-
-
Stas Bekman authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
- 16 Oct, 2020 1 commit
-
-
Stas Bekman authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
- 04 Oct, 2020 1 commit
-
-
Suraj Patil authored
-
- 01 Oct, 2020 2 commits
-
-
Suraj Patil authored
-
Sam Shleifer authored
-
- 30 Sep, 2020 1 commit
-
-
Amanpreet Singh authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
- 27 Sep, 2020 1 commit
-
-
Sam Shleifer authored
Co-authored-by:Swetha Mandava <smandava@nvidia.com>
-
- 24 Sep, 2020 1 commit
-
-
Sam Shleifer authored
-
- 21 Sep, 2020 1 commit
-
-
Sam Shleifer authored
* save hostname
-
- 17 Sep, 2020 1 commit
-
-
Sam Shleifer authored
-
- 16 Sep, 2020 2 commits
-
-
Sam Shleifer authored
-
Stas Bekman authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
- 14 Sep, 2020 2 commits
-
-
Sam Shleifer authored
-
Sam Shleifer authored
-
- 13 Sep, 2020 1 commit
-
-
Sam Shleifer authored
-
- 10 Sep, 2020 1 commit
-
-
Sam Shleifer authored
-
- 04 Sep, 2020 1 commit
-
-
Sam Shleifer authored
-
- 28 Aug, 2020 1 commit
-
-
Sam Shleifer authored
* broken test * batch parity * tests pass * boom boom * boom boom * split out bart tokenizer tests * fix tests * boom boom * Fixed dataset bug * Fix marian * Undo extra * Get marian working * Fix t5 tok tests * Test passing * Cleanup * better assert msg * require torch * Fix mbart tests * undo extra decoder_attn_mask change * Fix import * pegasus tokenizer can ignore src_lang kwargs * unused kwarg test cov * boom boom * add todo for pegasus issue * cover one word translation edge case * Cleanup * doc
-
- 26 Aug, 2020 1 commit
-
-
Lysandre authored
-
- 25 Aug, 2020 1 commit
-
-
Sam Shleifer authored
-
- 13 Aug, 2020 1 commit
-
-
Sam Shleifer authored
-
- 11 Aug, 2020 1 commit
-
-
Sam Shleifer authored
-
- 08 Aug, 2020 1 commit
-
-
Suraj Patil authored
-
- 06 Aug, 2020 1 commit
-
-
Sam Shleifer authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 28 Jul, 2020 2 commits
-
-
Sam Shleifer authored
-
Sam Shleifer authored
* MBART: support summarization tasks * fix test * Style * add tokenizer test
-
- 21 Jul, 2020 1 commit
-
-
Sam Shleifer authored
-
- 18 Jul, 2020 1 commit
-
-
Sam Shleifer authored
Co-authored-by:Pradhy729 <49659913+Pradhy729@users.noreply.github.com>
-
- 17 Jul, 2020 1 commit
-
-
Sam Shleifer authored
-
- 15 Jul, 2020 2 commits
-
-
Sam Shleifer authored
-
Sam Shleifer authored
-
- 07 Jul, 2020 1 commit
-
-
Sam Shleifer authored
improve unittests for finetuning, especially w.r.t testing frozen parameters fix freeze_embeds for T5 add streamlit setup.cfg
-
- 26 Jun, 2020 2 commits
-
-
Sam Shleifer authored
-
Thomas Wolf authored
* remove references to old API in docstring - update data processors * style * fix tests - better type checking error messages * better type checking * include awesome fix by @LysandreJik for #5310 * updated doc and examples
-
- 25 Jun, 2020 1 commit
-
-
Sam Shleifer authored
-
- 23 Jun, 2020 1 commit
-
-
Sam Shleifer authored
-
- 19 Jun, 2020 1 commit
-
-
Sam Shleifer authored
-
- 17 Jun, 2020 1 commit
-
-
Sam Shleifer authored
-
- 15 Jun, 2020 1 commit
-
-
Anthony MOI authored
[HUGE] Refactoring tokenizers backend - padding - truncation - pre-tokenized pipeline - fast tokenizers - tests (#4510) * Use tokenizers pre-tokenized pipeline * failing pretrokenized test * Fix is_pretokenized in python * add pretokenized tests * style and quality * better tests for batched pretokenized inputs * tokenizers clean up - new padding_strategy - split the files * [HUGE] refactoring tokenizers - padding - truncation - tests * style and quality * bump up requied tokenizers version to 0.8.0-rc1 * switched padding/truncation API - simpler better backward compat * updating tests for custom tokenizers * style and quality - tests on pad * fix QA pipeline * fix backward compatibility for max_length only * style and quality * Various cleans up - add verbose * fix tests * update docstrings * Fix tests * Docs reformatted * __call__ method documented Co-authored-by:
Thomas Wolf <thomwolf@users.noreply.github.com> Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr>
-