- 12 Aug, 2020 1 commit
-
-
Lysandre Debut authored
-
- 11 Aug, 2020 9 commits
-
-
Sam Shleifer authored
-
Stas Bekman authored
* [wip] add get_polynomial_decay_schedule_with_warmup * style * add assert * change lr_end to a much smaller default number * check for exact equality * [model_cards] electra-base-turkish-cased-ner (#6350) * for electra-base-turkish-cased-ner * Add metadata Co-authored-by:
Julien Chaumond <chaumond@gmail.com> * Temporarily de-activate TPU CI * Update modeling_tf_utils.py (#6372) fix typo: ckeckpoint->checkpoint * the test now works again (#6371) * correct pl link in readme (#6364) * refactor almost identical tests (#6339) * refactor almost identical tests * important to add a clear assert error message * make the assert error even more descriptive than the original bt * Small docfile fixes (#6328) * Patch models (#6326) * TFAlbertFor{TokenClassification, MultipleChoice} * Patch models * BERT and TF BERT info s * Update check_repo * Ci GitHub caching (#6382) * Cache Github Actions CI * Remove useless file * Colab button (#6389) * Add colab button * Add colab link for tutorials * Fix links for open in colab (#6391) * Update src/transformers/optimization.py consistently use lr_end=1e-7 default Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * [wip] add get_polynomial_decay_schedule_with_warmup * style * add assert * change lr_end to a much smaller default number * check for exact equality * Update src/transformers/optimization.py consistently use lr_end=1e-7 default Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove dup (leftover from merge) * convert the test into the new refactored format * stick to using the current_step as is, without ++ Co-authored-by:
M. Yusuf Sarıgöz <yusufsarigoz@gmail.com> Co-authored-by:
Julien Chaumond <chaumond@gmail.com> Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by:
Alexander Measure <ameasure@gmail.com> Co-authored-by:
Rohit Gupta <rohitgr1998@gmail.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Stas Bekman authored
-
Sam Shleifer authored
-
Sam Shleifer authored
Co-authored-by:Jingqing Zhang <jingqing.zhang15@imperial.ac.uk>
-
Stas Bekman authored
-
Stas Bekman authored
-
Stas Bekman authored
* add pl_glue example test * for now just test that it runs, next validate results of eval or predict? * complete the run_pl_glue test to validate the actual outcome * worked on my machine, CI gets less accuracy - trying higher epochs * match run_pl.sh hparms * more epochs? * trying higher lr * for now just test that the script runs to a completion * correct the comment * if cuda is available, add --fp16 --gpus=1 to cover more bases * style
-
Sam Shleifer authored
-
- 10 Aug, 2020 2 commits
-
-
Rohit Gupta authored
-
Stas Bekman authored
-
- 09 Aug, 2020 1 commit
-
-
Sam Shleifer authored
-
- 08 Aug, 2020 3 commits
-
-
Suraj Patil authored
-
Sam Shleifer authored
-
Stas Bekman authored
as discussed with @sshleifer, removing this TODO to switch to a tiny model, since it won't be able to test the results of the evaluation (i.e. the results are meaningless).
-
- 07 Aug, 2020 3 commits
-
-
zcain117 authored
* Add setup for TPU CI to run every hour. * Re-organize config.yml Co-authored-by:Lysandre <lysandre.debut@reseau.eseo.fr>
-
Stas Bekman authored
-
Stas Bekman authored
-
- 06 Aug, 2020 4 commits
-
-
Bhashithe Abeysinghe authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
xujiaze13 authored
-
Sam Shleifer authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Doug Blank authored
* Support for Comet.ml * Need to import comet first * Log this model, not the one in the backprop step * Log args as hyperparameters; use framework to allow fine control * Log hyperparameters with context * Apply black formatting * isort fix integrations * isort fix __init__ * Update src/transformers/trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer_tf.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Address review comments * Style + Quality, remove Tensorboard import test Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr>
-
- 05 Aug, 2020 1 commit
-
-
Stas Bekman authored
* support --lr_scheduler with multiple possibilities * correct the error message * add a note about supported schedulers * cleanup * cleanup2 * needs the argument default * style * add another assert in the test * implement requested changes * cleanups * fix relative import * cleanup
-
- 03 Aug, 2020 3 commits
-
-
Sam Shleifer authored
* Document better MT command * improve multigpu command
-
Victor SANH authored
-
Sam Shleifer authored
-
- 01 Aug, 2020 1 commit
-
-
Stas Bekman authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
- 31 Jul, 2020 1 commit
-
-
Stas Bekman authored
* enable easy checkout switch allow having multiple repository checkouts and not needing to remember to rerun 'pip install -e .[dev]' when switching between checkouts and running tests. * make isort happy * examples needs one too
-
- 30 Jul, 2020 2 commits
-
-
Sylvain Gugger authored
* Switch from return_tuple to return_dict * Fix test * [WIP] Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleC… (#5614) * Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleChoice} models and tests * AutoModels Tiny tweaks * Style * Final changes before merge * Re-order for simpler review * Final fixes * Addressing @sgugger's comments * Test MultipleChoice * Rework TF trainer (#6038) * Fully rework training/prediction loops * fix method name * Fix variable name * Fix property name * Fix scope * Fix method name * Fix tuple index * Fix tuple index * Fix indentation * Fix variable name * fix eval before log * Add drop remainder for test dataset * Fix step number + fix logging datetime * fix eval loss value * use global step instead of step + fix logging at step 0 * Fix logging datetime * Fix global_step usage * Fix breaking loop + logging datetime * Fix step in prediction loop * Fix step breaking * Fix train/test loops * Force TF at least 2.2 for the trainer * Use assert_cardinality to facilitate the dataset size computation * Log steps per epoch * Make tfds compliant with TPU * Make tfds compliant with TPU * Use TF dataset enumerate instead of the Python one * revert previous commit * Fix data_dir * Apply style * rebase on master * Address Sylvain's comments * Address Sylvain's and Lysandre comments * Trigger CI * Remove unused import * Switch from return_tuple to return_dict * Fix test * Add recent model Co-authored-by:Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Julien Plu <plu.julien@gmail.com>
-
Stas Bekman authored
-
- 29 Jul, 2020 2 commits
-
-
Julien Plu authored
* Fully rework training/prediction loops * fix method name * Fix variable name * Fix property name * Fix scope * Fix method name * Fix tuple index * Fix tuple index * Fix indentation * Fix variable name * fix eval before log * Add drop remainder for test dataset * Fix step number + fix logging datetime * fix eval loss value * use global step instead of step + fix logging at step 0 * Fix logging datetime * Fix global_step usage * Fix breaking loop + logging datetime * Fix step in prediction loop * Fix step breaking * Fix train/test loops * Force TF at least 2.2 for the trainer * Use assert_cardinality to facilitate the dataset size computation * Log steps per epoch * Make tfds compliant with TPU * Make tfds compliant with TPU * Use TF dataset enumerate instead of the Python one * revert previous commit * Fix data_dir * Apply style * rebase on master * Address Sylvain's comments * Address Sylvain's and Lysandre comments * Trigger CI * Remove unused import
-
Lysandre Debut authored
-
- 28 Jul, 2020 5 commits
-
-
Sam Shleifer authored
-
Sam Shleifer authored
-
Stas Bekman authored
* add a link to README.md * Update README.md
-
Sam Shleifer authored
* MBART: support summarization tasks * fix test * Style * add tokenizer test
-
Sam Shleifer authored
-
- 27 Jul, 2020 2 commits
-
-
Sam Shleifer authored
-
Sam Shleifer authored
-