- 04 Sep, 2020 2 commits
-
-
Patrick von Platen authored
-
Yih-Dar authored
* Remove hard-coded uses of float32 to fix mixed precision use in TF Distilbert * fix style * fix gelu dtype issue in TF Distilbert * fix numeric overflow while using half precision
-
- 03 Sep, 2020 12 commits
-
-
Sam Shleifer authored
-
Sam Shleifer authored
-
krfricke authored
* move wandb/comet logger init to train() to allow parallel logging * Setup wandb/comet loggers on first call to log()
-
Sam Shleifer authored
-
Sam Shleifer authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
brett koonce authored
-
Stefan Engl authored
-
David Mark Nemeskey authored
-
abdullaholuk-loodos authored
Loodos model cards had errors on "Usage" section. It is fixed. Also "electra-base-turkish-uncased" model removed from s3 and re-uploaded as "electra-base-turkish-uncased-discriminator". Its README added. (#6921) Co-authored-by:Abdullah Oluk <abdullaholuk123@gmail.com>
-
Julien Chaumond authored
cc @psorianom @rachelker
-
Sylvain Gugger authored
-
Antonio V Mendoza authored
Adding the LXMERT pretraining model (MultiModal languageXvision) to HuggingFace's suite of models (#5793) * added template files for LXMERT and competed the configuration_lxmert.py * added modeling, tokization, testing, and finishing touched for lxmert [yet to be tested] * added model card for lxmert * cleaning up lxmert code * Update src/transformers/modeling_lxmert.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/modeling_tf_lxmert.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/modeling_tf_lxmert.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/modeling_lxmert.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * tested torch lxmert, changed documtention, updated outputs, and other small fixes * Update src/transformers/convert_pytorch_checkpoint_to_tf2.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/convert_pytorch_checkpoint_to_tf2.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/convert_pytorch_checkpoint_to_tf2.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * renaming, other small issues, did not change TF code in this commit * added lxmert question answering model in pytorch * added capability to edit number of qa labels for lxmert * made answer optional for lxmert question answering * add option to return hidden_states for lxmert * changed default qa labels for lxmert * changed config archive path * squshing 3 commits: merged UI + testing improvments + more UI and testing * changed some variable names for lxmert * TF LXMERT * Various fixes to LXMERT * Final touches to LXMERT * AutoTokenizer order * Add LXMERT to index.rst and README.md * Merge commit test fixes + Style update * TensorFlow 2.3.0 sequential model changes variable names Remove inherited test * Update src/transformers/modeling_tf_pytorch_utils.py * Update docs/source/model_doc/lxmert.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/model_doc/lxmert.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_tf_lxmert.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * added suggestions * Fixes * Final fixes for TF model * Fix docs Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 02 Sep, 2020 12 commits
-
-
Puneetha Pai authored
-
Stas Bekman authored
Since `generate()` does: ``` num_beams = num_beams if num_beams is not None else self.config.num_beams ``` This test fails if `model.config.num_beams > 1` (which is the case in the model I'm porting). This fix makes the test setup unambiguous by passing an explicit `num_beams=1` to `generate()`. Thanks. -
Sylvain Gugger authored
* Fix output_attention -> output_attentions * Formatting * One unsaved file
-
Yohei Tamura authored
-
Suraj Patil authored
* add Text2TextGenerationPipeline * remove max length warning * remove comments * remove input_length * fix typo * add tests * use TFAutoModelForSeq2SeqLM * doc * typo * add the doc below TextGenerationPipeline * doc nit * style * delete comment
-
Prajjwal Bhargava authored
-
Stas Bekman authored
* [doc] typos fixed typos * Update README.md
-
Harry Wang authored
-
Patrick von Platen authored
-
Parthe Pandit authored
outptus -> outputs in example of BertForPreTraining
-
David Mark Nemeskey authored
* Create README.md Model card for huBERT. * Update README.md lowercase h * Update model_cards/SZTAKI-HLT/hubert-base-cc/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Patrick von Platen authored
-
- 01 Sep, 2020 14 commits
-
-
Julien Chaumond authored
-
Rohan Rajpal authored
-
Rohan Rajpal authored
-
Igli Manaj authored
Fix range of possible score, add inference .
-
Tom Grek authored
-
zolekode authored
Co-authored-by:zolekode <pascal.zoleko@fau.de>
-
hakan authored
-
Manuel Romero authored
Add language meta attribute
-
Manuel Romero authored
Add language meta attribute
-
Abed khooli authored
* Create README.md model card for akhooli/xlm-r-large-arabic-sent * Update model_cards/akhooli/xlm-r-large-arabic-sent/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Abed khooli authored
-
Patrick von Platen authored
* finish xlm-roberta * finish docs * expose XLMRobertaForCausalLM
-
Patrick von Platen authored
* Create README.md * Update README.md
-
Jin Young (Daniel) Sohn authored
* Add cache_dir to save features TextDataset This is in case the dataset is in a RO filesystem, for which is the case in tests (GKE TPU tests). * style
-