- 20 Nov, 2020 6 commits
-
-
Patrick von Platen authored
-
Binoy Dalal authored
* refactored exisiting nested loops to vectorized implementation * replaced explicit indexing with torch.where * modifying score for previous input_ids only
-
Roman Kalyakin authored
-
Quentin Lhoest authored
* replace init_ddp_connection for index init * style * add finetune test * add test data * move generate tensors to device * add test on EM metric * style * allow multi process test * keep gloo process group for retrieval * add multi-gpu test * use custom accelerator * clean test finetune * minor * style * style * typo * use python call instead of imported main fumction * return_dict fix in modeling_rag * use float32 in retrieval * store as float32 as well in the custom knowledge dataset example * style * rename to finetune_rag * style * update readme * rename utils and callbacks to utils_rag and callbacks_rag * fix test * patrick's comments * generate dummy data in the finetue test script * remove dummy data files * style
-
Sylvain Gugger authored
-
Kevin Canwen Xu authored
* Update the bibtex with EMNLP demo * Update README.md * Update README.md
-
- 19 Nov, 2020 19 commits
-
-
Sylvain Gugger authored
* Fix the CI and tests * Fix quality * Remove that m form nowhere
-
Stas Bekman authored
* fix deprecation warning * fix
-
Arindum Roy authored
The heading was originally uncased, which did not reflect the contents of this README. Changed it to cased.
-
Stas Bekman authored
-
Stas Bekman authored
You may be unaware but you're running some software that meddles with every commit on https://github.com/huggingface/transformers/ Something is wrong with the software you're using. It adds a reference to almost every PR in the master tree. Which is very wrong. Please check your software and please don't do it again. Example: see the bottom of this PR and most other PRs: https://github.com/huggingface/transformers/pull/8639
-
Stas Bekman authored
* don't reconvert when the type is already right * better name * adjust logic as suggested * merge
-
Sylvain Gugger authored
* Fix run_ner script * Pin datasets
-
Zhylko Dima authored
* `disable_ngram_loss` fix for prophetnet * add changes documentation * fix _compute_loss to use mean reduction and -100 to masked tokens & remove unnecessary arguments * mean label smoothing loss * small refactor * fix test Co-authored-by:patrickvonplaten <patrick.v.platen@gmail.com>
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
LysandreJik authored
-
Sylvain Gugger authored
-
Matthias authored
Fixed a small typo on the XLNet and permutation language modelling section
-
Sylvain Gugger authored
* Better filtering of the model outputs in Trainer * Fix examples tests * Add test for Lysandre
-
Lysandre Debut authored
* CI should install `sentencepiece` * Requiring TF * Fixing some TFDPR bugs * remove return_dict=False/True hack Co-authored-by:patrickvonplaten <patrick.v.platen@gmail.com>
-
elk-cloner authored
* working on LongformerForSequenceClassification * add TFLongformerForMultipleChoice * add TFLongformerForTokenClassification * use add_start_docstrings_to_model_forward * test TFLongformerForSequenceClassification * test TFLongformerForMultipleChoice * test TFLongformerForTokenClassification * remove test from repo * add test and doc for TFLongformerForSequenceClassification, TFLongformerForTokenClassification, TFLongformerForMultipleChoice * add requested classes to modeling_tf_auto.py update dummy_tf_objects fix tests fix bugs in requested classes * pass all tests except test_inputs_embeds * sync with master * pass all tests except test_inputs_embeds * pass all tests * pass all tests * work on test_inputs_embeds * fix style and quality * make multi choice work * fix TFLongformerForTokenClassification signature * fix TFLongformerForMultipleChoice, TFLongformerForSequenceClassification signature * fix mult choice * fix mc hint * fix input embeds * fix input embeds * refactor input embeds * fix copy issue * apply sylvains changes and clean more Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Quentin Lhoest authored
-
Amine Abdaoui authored
the table was badly formatted because of a single line break
-
Amine Abdaoui authored
* docs(bert-base-15lang-cased): add model card * add cards for all Geotrend models * [model cards] fix language tag for all Geotrend models
-
- 18 Nov, 2020 15 commits
-
-
cronoik authored
* Updated the Extractive Question Answering code snippets The Extractive Question Answering code snippets do not work anymore since the models return task-specific output objects. This commit fixes the pytorch and tensorflow examples but adding `.values()` to the model call. * Update task_summary.rst
-
Tim Isbister authored
-
cronoik authored
-
Stas Bekman authored
* apex breaks return_dict obj * style
-
Perez Ogayo authored
* Updated ModelCard * Apply suggestions from code review Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Yifan Peng authored
-
Manuel Romero authored
* Model card: T5-base fine-tuned on QuaRTz * Update model_cards/mrm8488/t5-base-finetuned-quartz/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Yifan Peng authored
-
Divyanshu Kakwani authored
-
dartrevan authored
* Update README.md * Update README.md
-
Abhilash Majumder authored
* Model Card for abhilash1910/financial_roberta * Update model_cards/abhilash1910/financial_roberta/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Vishal Singh authored
Modified Model in Action section. The class `AutoModelWithLMHead` is deprecated so changed it to `AutoModelForSeq2SeqLM` for encoder-decoder models. Removed duplicate eos token.
-
smanjil authored
* replace performance table with markdown * Update model_cards/smanjil/German-MedBERT/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
hhou435 authored
-
Sylvain Gugger authored
-