- 13 Nov, 2020 4 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Branden Chan authored
* Update README.md * Update README.md
-
Sylvain Gugger authored
-
- 12 Nov, 2020 9 commits
-
-
Julien Plu authored
* Add pretraining loss computation for TF Bert pretraining * Fix labels creation * Fix T5 model * restore T5 kwargs * try a generic fix for pretraining models * Apply style * Overide the prepare method for the BERT tests
-
Julien Plu authored
-
Julien Plu authored
-
Forrest Iandola authored
-
Sylvain Gugger authored
* Model sharing doc * Style
-
Chengxi Guo authored
* fix doc bug Signed-off-by:
mymusise <mymusise1@gmail.com> * fix example bug Signed-off-by:
mymusise <mymusise1@gmail.com>
-
zeyuyun1 authored
-
Antonio Lanza authored
-
Julien Chaumond authored
cc @Pierrci
-
- 11 Nov, 2020 11 commits
-
-
Funtowicz Morgan authored
* Update deploy-docs dependencies on CI to enable Flax Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co> * Added pair of "" Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co>
-
Sumithra Bhakthavatsalam authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
Funtowicz Morgan authored
* First addition of Flax/Jax documentation Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co> * make style * Ensure input order match between Bert & Roberta Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co> * Install dependencies "all" when building doc Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co> * wraps build_doc deps with "" Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co> * Addressing @sgugger comments. Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co> * Use list to highlight JAX features. Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co> * Make style. Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co> * Let's not look to much into the future for now. Signed-off-by:
Morgan Funtowicz <morgan@huggingface.co> * Style Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr>
-
Lysandre authored
-
Beomsoo Kim authored
-
Ratthachat (Jung) authored
* Create modeling_tf_dpr.py * Add TFDPR * Add back TFPegasus, TFMarian, TFMBart, TFBlenderBot last commit accidentally deleted these 4 lines, so I recover them back * Add TFDPR * Add TFDPR * clean up some comments, add TF input-style doc string * Add TFDPR * Make return_dict=False as default * Fix return_dict bug (in .from_pretrained) * Add get_input_embeddings() * Create test_modeling_tf_dpr.py The current version is already passed all 27 tests! Please see the test run at : https://colab.research.google.com/drive/1czS_m9zy5k-iSJbzA_DP1k1xAAC_sdkf?usp=sharing * fix quality * delete init weights * run fix copies * fix repo consis * del config_class, load_tf_weights They shoud be 'pytorch only' * add config_class back after removing it, test failed ... so totally only removing "use_tf_weights = None" on Lysandre suggestion * newline after .. note:: * import tf, np (Necessary for ModelIntegrationTest) * slow_test from_pretrained with from_pt=True At the moment we don't have TF weights (since we don't have official official TF model) Previously, I did not run slow test, so I missed this bug * Add simple TFDPRModelIntegrationTest Note that this is just a test that TF and Pytorch gives approx. the same output. However, I could not test with the official DPR repo's output yet * upload correct tf model * remove position_ids as missing keys Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
patrickvonplaten <patrick@huggingface.co>
-
sarnoult authored
The new run_ner.py script tries to run prediction on the input test set `datasets["test"]`, but it should be the tokenized set `tokenized_datasets["test"]`
-
Julien Plu authored
-
Julien Plu authored
* Add next sentence prediction loss computation * Apply style * Fix tests * Add forgotten import * Add forgotten import * Use a new parameter * Remove kwargs and use positional arguments
-
Julien Plu authored
-
Julien Chaumond authored
-
- 10 Nov, 2020 16 commits
-
-
Pedro authored
Bug fix for modeling utilities function: apply_chunking_to_forward, chunking should be in the chunking dimension, an exception was raised if the complete shape of the inputs was not the same rather than only the chunking dimension (#8391) Co-authored-by:pedro <pe25171@mit.edu>
-
Patrick von Platen authored
-
Lysandre Debut authored
* stash * TF Integration testing for ELECTRA, BERT, Longformer * Trigger slow tests * Apply suggestions from code review
-
Santiago Castro authored
-
Stas Bekman authored
* s|multiple_gpu|multi_gpu|g; s|multigpu|multi_gpu|g' * doc
-
Patrick von Platen authored
-
Julien Plu authored
* Add missing import * Fix dummy objects
-
Stas Bekman authored
* [testing utils] get_auto_remove_tmp_dir default change Now that I have been using `get_auto_remove_tmp_dir default change` for a while, I realized that the defaults aren't most optimal. 99% of the time we want the tmp dir to be empty at the beginning of the test - so changing the default to `before=True` - this shouldn't impact any tests since this feature is used only during debug. * simplify things * update docs * fix doc layout * style * Update src/transformers/testing_utils.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * better 3-state doc * style * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * s/tmp/temporary/ + style * correct the statement Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Julien Plu authored
* Add a Windows dev section in the contributing file. * Forgotten link * Trigger CI * Rework description * Trigger CI
-
Julien Plu authored
* Add auto next sentence prediction * Fix style * Add mobilebert next sentence prediction
-
Sam Shleifer authored
-
Sylvain Gugger authored
* Remove SO from question template * Styling
-
Stas Bekman authored
-
Shichao Sun authored
-
Lysandre authored
-
Lysandre authored
-