- 26 Jan, 2021 3 commits
-
-
Yusuke Mori authored
* Fix broken links in the converting tf ckpt document * Update docs/source/converting_tensorflow_models.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Reflect the review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Patrick von Platen authored
* fix ci * fix ci * renaming * fix dup line
-
Stas Bekman authored
* normalize, group, sort + add myself for deepspeed * new structure * add ray * typo * more suggestions * more suggestions * white space * Update .github/ISSUE_TEMPLATE/bug-report.md Co-authored-by:
Suraj Patil <surajp815@gmail.com> * add bullets * sync * Apply suggestions from code review Co-authored-by:
Suraj Patil <surajp815@gmail.com> * Apply suggestions from code review Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * sync Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
- 25 Jan, 2021 7 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Auto-resume training from checkpoint * Update examples/text-classification/run_glue.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Roll out to other examples Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Lysandre Debut authored
-
Stas Bekman authored
* onnx triu workaround * style * working this time * add test * more efficient version
-
Sorami Hisamoto authored
`compute_objectie` => `compute_objective`
-
Kai Fricke authored
-
Maria Janina Sarol authored
* Fix TFTrainer prediction output * Update trainer_tf.py * Fix TFTrainer prediction output * Fix evaluation_loss update in TFTrainer * Fix TFTrainer prediction output
-
- 23 Jan, 2021 2 commits
-
-
Wilfried L. Bounsi authored
-
Stas Bekman authored
-
- 22 Jan, 2021 5 commits
-
-
Julien Plu authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Fixes to run_seq2seq and instructions * Add more defaults for summarization
-
Julien Plu authored
* Fix saved model tests + fix a graph issue in longformer * Apply style
-
Stefan Schweter authored
-
- 21 Jan, 2021 11 commits
-
-
Sylvain Gugger authored
* Fix memory regression in Seq2Seq example * Fix test and properly deal with -100 * Easier condition with device safety * Patch for MBartTokenzierFast
-
Julien Plu authored
* Fix Seq2Seq models for serving * Apply style * Fix lonfgormer * Fix mBart/Pegasus/Blenderbot * Apply style * Add a main intermediate layer * Apply style * Remove import * Apply tf.function to Longformer * Fix utils check_copy * Update S2S template * Fix BART + Blenderbot * Fix BlenderbotSmall * Fix BlenderbotSmall * Fix BlenderbotSmall * Fix MBart * Fix Marian * Fix Pegasus + template * Apply style * Fix common attributes test * Forgot to fix the LED test * Apply Patrick's comment on LED Decoder
-
Nicolas Patry authored
* Changing model default for TableQuestionAnsweringPipeline. - Discussion: https://discuss.huggingface.co/t/table-question-answering-is-not-an-available-task-under-pipeline/3284/6 * Updating slow tests that were out of sync.
-
Julien Plu authored
* Fix Gelu precision * Fix gelu_fast * Naming * Fix usage and apply style * add TF gelu approximate version * add TF gelu approximate version * add TF gelu approximate version * Apply style * Fix albert * Remove the usage of the Activation layer
-
Suraj Patil authored
* fix head mask in model_parallel * pass correct head mask
-
Patrick von Platen authored
-
Patrick von Platen authored
-
guillaume-be authored
* Moved ProphetNetForCausalLM's parent initialization after config update * Added unit tests for generation for ProphetNetForCausalLM
-
Lysandre Debut authored
-
Muennighoff authored
* fix typo Co-authored-by:Suraj Patil <surajp815@gmail.com>
-
Stas Bekman authored
* no --deepspeed and --sharded_ddp together * Update src/transformers/trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * style Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 20 Jan, 2021 12 commits
-
-
Sylvain Gugger authored
-
Darigov Research authored
* fix: Makes small typo corrections & standardises glossary * feat: Adds introduction & links to transformer flashcards * feat: Adds attribution & adjustments requested in #8949 * feat: Adds flashcards to community.md * refactor: Removes flashcards from glossary
-
Sylvain Gugger authored
* Fix WAND_DISABLED test * Remove duplicate import * Make a test that actually works... * Fix style
-
Sylvain Gugger authored
-
Stas Bekman authored
-
Gunjan Chhablani authored
* Fix Trainer and Args to mention AdamW, not Adam. * Update the docs for Training Arguments. * Change arguments adamw_* to adam_* * Fixed links to AdamW in TrainerArguments docs * Fix line length in Training Args docs.
-
NielsRogge authored
-
NielsRogge authored
* Add DebertaForMaskedLM, DebertaForTokenClassification, DebertaForQuestionAnswering * Add docs and fix quality * Fix Deberta not having pooler
-
Sylvain Gugger authored
-
acul3 authored
* Update run_mlm.py * add t5 model to transformers-cli convert * update rum_mlm.py same as master * update converting model docs * update converting model docs * Update convert.py * Trigger notification * update import sorted * fix typo t5
-
Julien Plu authored
-
Julien Plu authored
* Create new embeddings + add to BERT * Add Albert * Add DistilBert * Add Albert + Electra + Funnel * Add Longformer + Lxmert * Add last models * Apply style * Update the template * Remove unused imports * Rename attribute * Import embeddings in their own model file * Replace word_embeddings per weight * fix naming * Fix Albert * Fix Albert * Fix Longformer * Fix Lxmert Mobilebert and MPNet * Fix copy * Fix template * Update the get weights function * Update src/transformers/modeling_tf_utils.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/electra/modeling_tf_electra.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address Sylvain's comments Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-