- 30 Apr, 2021 1 commit
-
-
Manuel Romero authored
Add link to code
-
- 29 Apr, 2021 1 commit
-
-
Sylvain Gugger authored
* Split checkpoint from model_name_or_path in examples * Address review comments * Address review comments
-
- 26 Apr, 2021 4 commits
-
-
Jaimeen Ahn authored
As the error comes from the inconsistency of variable meaning number of gpus in parser and its actual usage in the train.py script, 'gpus' and 'n_gpu' respectively, the correction makes the example work
-
Bhadresh Savani authored
* added changes for uniformity * modified files * corrected typo * fixed qa scripts * fix typos * fixed predict typo in qa no trainer * fixed test file * reverted trainer changes * reverted trainer changes in custom exmaples * updated readme * added changes in deepspeed test * added changes for predict and eval
-
Amine Abdaoui authored
-
Patrick von Platen authored
-
- 23 Apr, 2021 5 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Initial support for upload to hub * push -> upload * Fixes + examples * Fix torchhub test * Torchhub test I hate you * push_model_to_hub -> push_to_hub * Apply mixin to other pretrained models * Remove ABC inheritance * Add tests * Typo * Run tests * Install git-lfs * Change approach * Add push_to_hub to all * Staging test suite * Typo * Maybe like this? * More deps * Cache * Adapt name * Quality * MOAR tests * Put it in testing_utils * Docs + torchhub last hope * Styling * Wrong method * Typos * Update src/transformers/file_utils.py Co-authored-by:
Julien Chaumond <julien@huggingface.co> * Address review comments * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Julien Chaumond <julien@huggingface.co> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Yoshitomo Matsubara authored
-
Max Del authored
-
Patrick von Platen authored
-
- 22 Apr, 2021 2 commits
-
-
Matt authored
-
johnson7788 authored
fix typo Co-authored-by:johnson <johnson@github.com>
-
- 21 Apr, 2021 3 commits
-
-
Matt authored
And update README to explain the work-in-progress!
-
Matt authored
First of the new and more idiomatic TF examples!
-
Sylvain Gugger authored
* Base move * Examples reorganization * Update references * Put back test data * Move conftest * More fixes * Move test data to test fixtures * Update path * Apply suggestions from code review Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Address review comments and clean Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
- 20 Apr, 2021 2 commits
-
-
Sylvain Gugger authored
* Update to use datasets remove_cloumns method * Quality
-
rajvi-k authored
* initial changes * modified evaluation * updated evaluation * updated evaluation on text translation example script * added translation example script * Formatted translation example script * Reformatted translation example * Fixed evaluation bug and added support for other tokenisers * Fixed evaluation bug and added support for other tokenisers * Added translation example script * Formatted summarization example script * Removed typos from summarization example script
-
- 14 Apr, 2021 2 commits
-
-
Sudharsan S T authored
Co-authored-by:Sudharsan Thirumalai <sudharsan.t@sprinklr.com>
-
Nithin Holla authored
Co-authored-by:nithin19 <nithin@amberscript.com>
-
- 13 Apr, 2021 1 commit
-
-
Philipp Schmid authored
-
- 12 Apr, 2021 2 commits
-
-
Takuya Makino authored
-
Masatoshi TSUCHIYA authored
* model_path is refered as the path of the trainer, and should be ignored as the checkpoint path. * Improved according to Sgugger's comment.
-
- 09 Apr, 2021 3 commits
-
-
Stas Bekman authored
* fix _LazyModule hasher error * reword
-
Suraj Patil authored
* keep a list of multilingual tokenizers * add forced_bos_token argument
-
Saviour Owolabi authored
Corrected a typo ('Downlowd' to 'Download')
-
- 08 Apr, 2021 4 commits
-
-
Stas Bekman authored
* relocate core integration tests * add sys.path context manager * cleanup * try * try2 * fix path * doc * style * add dep * add 2 more deps
-
Andrea Cappelli authored
* Add mlm collator pad to multiple option (#10627) * Use padding to 8x in run mlm (#10627)
-
Stas Bekman authored
* synced gpus * fix * fix * need to use t5-small for quality tests * notes * complete merge * fix a disappearing std stream problem * start zero3 tests * wip * tune params * sorting out the pre-trained model loading * reworking generate loop wip * wip * style * fix tests * split the tests * refactor tests * wip * parameterized * fix * workout the resume from non-ds checkpoint pass + test * cleanup * remove no longer needed code * split getter/setter functions * complete the docs * suggestions * gpus and their compute capabilities link * Apply suggestions from code review Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * style * remove invalid paramgd * automatically configure zero3 params that rely on hidden size * make _get_resized_embeddings zero3-aware * add test exercising resize_token_embeddings() * add docstring Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Stas Bekman authored
* clarify why we get the warning here * Update examples/language-modeling/run_clm.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * wording * style Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 07 Apr, 2021 2 commits
-
-
Stas Bekman authored
these get concatenated without whitespace, so fix it
-
Stas Bekman authored
* The 'warn' method is deprecated * fix test
-
- 06 Apr, 2021 5 commits
-
-
Sylvain Gugger authored
-
SHYAM SUNDER KUMAR authored
* accelerate question answering examples with no trainer * removed train and eval flags also fixed fill np array function * Update examples/question-answering/run_qa_beam_search_no_trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/question-answering/run_qa_no_trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Lysandre authored
-
Lysandre authored
-
Hemil Desai authored
-
- 05 Apr, 2021 2 commits
-
-
Hemil Desai authored
* Initial draft for clm no trainer * Remove unwanted args * Fix bug * Update examples/language-modeling/run_clm_no_trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Stas Bekman authored
-
- 02 Apr, 2021 1 commit
-
-
versis authored
-