- 29 Sep, 2020 6 commits
-
-
Sylvain Gugger authored
-
Teven authored
* GPT2 gradient checkpointing * find_unused_parameters removed if checkpointing * find_unused_parameters removed if checkpointing * Update src/transformers/configuration_gpt2.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Added a test for generation with checkpointing * Update src/transformers/configuration_gpt2.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sylvain Gugger authored
* Add automatic best model loading to Trainer * Some small fixes * Formatting
-
Sylvain Gugger authored
-
GmailB authored
Hi, just corrected the example code, add 2 links and fixed some typos
-
Sam Shleifer authored
-
- 28 Sep, 2020 14 commits
-
-
Typicasoft authored
* Create README.md MagBERT-NER : Added widget (Text) * Rename model_cards/README.md to model_cards/TypicaAI/magbert-ner/README.md
-
Manuel Romero authored
Add links to models fine-tuned on a downstream task
-
Stas Bekman authored
* [makefile] check/fix only modified since branching files * fix phonies * parametrize dirs * have only one source for dirs to check * look ma, no autoformatters here
-
Lysandre authored
-
Lysandre authored
-
Sylvain Gugger authored
* Reorganize documentation navbar * Update css to have clear sections
-
Lysandre Debut authored
co-authored-by:
sgugger <sylvain.gugger@gmail.com> Co-authored-by:
sgugger <sylvain.gugger@gmail.com>
-
Sylvain Gugger authored
Do not merge before Monday
-
Sylvain Gugger authored
-
Boris Dayma authored
* docs: fix model sharing file names * Update docs/source/model_sharing.rst Co-authored-by:
Julien Chaumond <chaumond@gmail.com> * docs(model_sharing.rst): fix new line Co-authored-by:
Julien Chaumond <chaumond@gmail.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Patrick von Platen authored
-
Marcin Zab艂ocki authored
-
Ola Piktus authored
* Improve README + consolidation script * Reformat README * Reformat README Co-authored-by:Your Name <you@example.com>
-
Sam Shleifer authored
* Working assymmetrical T5 * rename decoder_layers -> num_decoder_layers * Fix docstring * Allow creation of asymmetric t5 students
-
- 27 Sep, 2020 2 commits
-
-
Sam Shleifer authored
Co-authored-by:Swetha Mandava <smandava@nvidia.com>
-
Suraj Patil authored
Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Sam Shleifer <sshleifer@gmail.com>
-
- 25 Sep, 2020 18 commits
-
-
Patrick von Platen authored
* fix multi-gpu * fix longformer * force to delete unnecessary layers * fix notifications * fix warning * fix roberta * fix tests * remove hasattr * fix tests * fix roberta * merge and clean authorized keys
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Ola Piktus authored
Co-authored-by:Your Name <you@example.com>
-
Sylvain Gugger authored
* Fix #7371 * Fix training * Fix test values * Apply the fix to TF as well
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Stas Bekman authored
* [code quality] merge style and quality targets Any reason why we don't run `flake8` in `make style`? I find myself needing to run `make style` and `make quality` all the time, but I need the latter just for the last 2 checks. Since we have no control over the source code why bother with separating checking and fixing - let's just have one target that fixes and then performs the remaining checks, as we know the first two have been done already. This PR suggests to merge the 2 targets into one efficient target. I will edit the docs if this change resonates with the team. * move checks into style, re-use target * better name * add fixup target * document new target
-
Sam Shleifer authored
-
Patrick von Platen authored
-
Quentin Lhoest authored
* Fix retrieval offset in RAG's HfIndex * update slow tests * style * fix new test * style * add better tests Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Patrick von Platen authored
* fix_rag_sequence * add second bug fix
-
Suraj Patil authored
-
Patrick von Platen authored
* add docs * add missing docs and attention_mask in fine-tune
-
Lysandre Debut authored
* Check config type instead of instance Bad merge * Remove for loops * Style
-