- 09 Nov, 2020 3 commits
-
-
dartrevan authored
-
smanjil authored
-
Stas Bekman authored
-
- 08 Nov, 2020 7 commits
-
-
Sam Shleifer authored
-
Stas Bekman authored
-
Stas Bekman authored
-
Jonathan Chang authored
-
Manav Rathod authored
-
Chengxi Guo authored
* Create README.md * Update model_cards/mymusise/gpt2-medium-chinese/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Stas Bekman authored
-
- 07 Nov, 2020 2 commits
-
-
Jonathan Chang authored
* Fix DataCollatorForWholeWordMask * Replace all tensorize_batch in data_collator.py
-
Stas Bekman authored
* rewrite modified_py_files in python to be cross-platform * try a different way to test for variable not being "" * improve comment
-
- 06 Nov, 2020 19 commits
-
-
Patrick von Platen authored
-
Yossi Synett authored
[All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add cross-attention weights to outputs (#8071) * Output cross-attention with decoder attention output * Update src/transformers/modeling_bert.py * add cross-attention for t5 and bart as well * fix tests * correct typo in docs * add sylvains and sams comments * correct typo Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
hassoudi authored
Fix websitr address
-
Jonathan Chang authored
-
hassoudi authored
fixes
-
ktrapeznikov authored
* Create README.md * Update model_cards/ktrapeznikov/gpt2-medium-topic-news/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
hasantanvir79 authored
* Create README.md Initial commit * Updated Read me Updated * Apply suggestions from code review Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Karthik Uppuluri authored
-
Karthik Uppuluri authored
-
Karthik Uppuluri authored
* Create README.md * Update README.md
-
Karthik Uppuluri authored
* Create README.md Telugu BERTU Readme file * Update model_cards/kuppuluri/telugu_bertu/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Yifan Peng authored
-
smanjil authored
* model details * Apply suggestions from code review Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Jiaxin Pei authored
-
Stefan Schweter authored
-
Manuel Romero authored
-
Manuel Romero authored
* Model card: CodeBERT fine-tuned for Insecure Code Detection * Update model_cards/mrm8488/codebert-base-finetuned-detect-insecure-code/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Manuel Romero authored
-
Stas Bekman authored
* use decorator * remove hardcoded paths * make the test use more data and do real quality tests * shave off 10 secs * add --eval_beams 2, reformat * reduce train size, use smaller custom dataset
-
- 05 Nov, 2020 9 commits
-
-
Leandro von Werra authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
Stas Bekman authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
Sylvain Gugger authored
* Make Trainer evaluation handle dynamic seq_length * Document behavior. * Fix test * Better fix * Fixes for realsies this time * Address review comments * Without forgetting to save...
-
Guillaume Filion authored
* Output global_attentions in Longformer models * make style * small refactoring * fix tests * make fix-copies * add for tf as well * remove comments in test * make fix-copies * make style * add docs * make docstring pretty Co-authored-by:patrickvonplaten <patrick.v.platen@gmail.com>
-
Sam Shleifer authored
-
Bobby Donchev authored
* change TokenClassificationTask class methods to static methods Since we do not require self in the class methods of TokenClassificationTask we should probably switch to static methods. Also, since the class TokenClassificationTask does not contain a constructor it is currently unusable as is. By switching to static methods this fixes the issue of having to document the intent of the broken class. Also, since the get_labels and read_examples_from_file methods are ought to be implemented. Static method definitions are unchanged even after inheritance, which means that it can be overridden, similar to other class methods. * Trigger Build Co-authored-by:Lysandre <lysandre.debut@reseau.eseo.fr>
-
Guillem Garc铆a Subies authored
-
Patrick von Platen authored
-
Patrick von Platen authored
-