- 09 Nov, 2020 9 commits
-
-
Sylvain Gugger authored
-
Stas Bekman authored
* support lowercase tokenizer * fix arg pos
-
Shashank Gupta authored
-
Philip May authored
* add evaluate doc * fix style with utils/style.doc * Update src/transformers/trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sam Shleifer authored
-
Julien Plu authored
* Fix some tooling for windows * Fix conflict * Trigger CI
-
dartrevan authored
-
smanjil authored
-
Stas Bekman authored
-
- 08 Nov, 2020 7 commits
-
-
Sam Shleifer authored
-
Stas Bekman authored
-
Stas Bekman authored
-
Jonathan Chang authored
-
Manav Rathod authored
-
Chengxi Guo authored
* Create README.md * Update model_cards/mymusise/gpt2-medium-chinese/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Stas Bekman authored
-
- 07 Nov, 2020 2 commits
-
-
Jonathan Chang authored
* Fix DataCollatorForWholeWordMask * Replace all tensorize_batch in data_collator.py
-
Stas Bekman authored
* rewrite modified_py_files in python to be cross-platform * try a different way to test for variable not being "" * improve comment
-
- 06 Nov, 2020 19 commits
-
-
Patrick von Platen authored
-
Yossi Synett authored
[All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add cross-attention weights to outputs (#8071) * Output cross-attention with decoder attention output * Update src/transformers/modeling_bert.py * add cross-attention for t5 and bart as well * fix tests * correct typo in docs * add sylvains and sams comments * correct typo Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
hassoudi authored
Fix websitr address
-
Jonathan Chang authored
-
hassoudi authored
fixes
-
ktrapeznikov authored
* Create README.md * Update model_cards/ktrapeznikov/gpt2-medium-topic-news/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
hasantanvir79 authored
* Create README.md Initial commit * Updated Read me Updated * Apply suggestions from code review Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Karthik Uppuluri authored
-
Karthik Uppuluri authored
-
Karthik Uppuluri authored
* Create README.md * Update README.md
-
Karthik Uppuluri authored
* Create README.md Telugu BERTU Readme file * Update model_cards/kuppuluri/telugu_bertu/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Yifan Peng authored
-
smanjil authored
* model details * Apply suggestions from code review Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Jiaxin Pei authored
-
Stefan Schweter authored
-
Manuel Romero authored
-
Manuel Romero authored
* Model card: CodeBERT fine-tuned for Insecure Code Detection * Update model_cards/mrm8488/codebert-base-finetuned-detect-insecure-code/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Manuel Romero authored
-
Stas Bekman authored
* use decorator * remove hardcoded paths * make the test use more data and do real quality tests * shave off 10 secs * add --eval_beams 2, reformat * reduce train size, use smaller custom dataset
-
- 05 Nov, 2020 3 commits
-
-
Leandro von Werra authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
Stas Bekman authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
Sylvain Gugger authored
* Make Trainer evaluation handle dynamic seq_length * Document behavior. * Fix test * Better fix * Fixes for realsies this time * Address review comments * Without forgetting to save...
-