- 12 Aug, 2020 9 commits
-
-
Sam Shleifer authored
-
Sylvain Gugger authored
* Data collator with padding * Add type annotation * Support tensors as well * Add comment * Fix for labels wrong shape * Data collator with padding * Add type annotation * Support tensors as well * Add comment * Fix for labels wrong shape * Remove changes rendered unnecessary
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Activate check on the CI * Fix repo inconsistencies * Don't document too much
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* allow using tokenizer.pad as a collate_fn in pytorch * allow using tokenizer.pad as a collate_fn in pytorch * Add documentation and tests * Make attention mask the right shape * Better test Co-authored-by:Thomas Wolf <thomwolf@users.noreply.github.com>
-
Stas Bekman authored
* replace capsys with the more refined CaptureStderr/CaptureStdout * Update examples/seq2seq/test_seq2seq_examples.py Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
Jared T Nielsen authored
* Fix FFN dropout in TFAlbertLayer, and split dropout in TFAlbertAttention into two separate dropout layers. * Same dropout fixes for PyTorch.
-
Lysandre Debut authored
-
- 11 Aug, 2020 28 commits
-
-
Rohan Rajpal authored
* Create README.md * Update model_cards/rohanrajpal/bert-base-codemixed-uncased-sentiment/README.md * Update model_cards/rohanrajpal/bert-base-codemixed-uncased-sentiment/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Manuel Romero authored
-
Manuel Romero authored
-
Manuel Romero authored
-
Manuel Romero authored
-
Manuel Romero authored
-
Rohan Rajpal authored
* Create README.md * add results on SAIL dataset * Update model_cards/rohanrajpal/bert-base-multilingual-codemixed-cased-sentiment/README.md Co-authored-by:
Julien Chaumond <chaumond@gmail.com> Co-authored-by:
Julien Chaumond <chaumond@gmail.com>
-
Sam Shleifer authored
-
Stas Bekman authored
* [wip] add get_polynomial_decay_schedule_with_warmup * style * add assert * change lr_end to a much smaller default number * check for exact equality * [model_cards] electra-base-turkish-cased-ner (#6350) * for electra-base-turkish-cased-ner * Add metadata Co-authored-by:
Julien Chaumond <chaumond@gmail.com> * Temporarily de-activate TPU CI * Update modeling_tf_utils.py (#6372) fix typo: ckeckpoint->checkpoint * the test now works again (#6371) * correct pl link in readme (#6364) * refactor almost identical tests (#6339) * refactor almost identical tests * important to add a clear assert error message * make the assert error even more descriptive than the original bt * Small docfile fixes (#6328) * Patch models (#6326) * TFAlbertFor{TokenClassification, MultipleChoice} * Patch models * BERT and TF BERT info s * Update check_repo * Ci GitHub caching (#6382) * Cache Github Actions CI * Remove useless file * Colab button (#6389) * Add colab button * Add colab link for tutorials * Fix links for open in colab (#6391) * Update src/transformers/optimization.py consistently use lr_end=1e-7 default Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * [wip] add get_polynomial_decay_schedule_with_warmup * style * add assert * change lr_end to a much smaller default number * check for exact equality * Update src/transformers/optimization.py consistently use lr_end=1e-7 default Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove dup (leftover from merge) * convert the test into the new refactored format * stick to using the current_step as is, without ++ Co-authored-by:
M. Yusuf Sarıgöz <yusufsarigoz@gmail.com> Co-authored-by:
Julien Chaumond <chaumond@gmail.com> Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr> Co-authored-by:
Alexander Measure <ameasure@gmail.com> Co-authored-by:
Rohit Gupta <rohitgr1998@gmail.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
cedspam authored
* Create README.md * Update README.md
-
Stas Bekman authored
-
Sam Shleifer authored
-
Sam Shleifer authored
Co-authored-by:Jingqing Zhang <jingqing.zhang15@imperial.ac.uk>
-
Stas Bekman authored
-
Stas Bekman authored
-
Pranav Vadrevu authored
-
Abed khooli authored
* Create README.md Model card for https://huggingface.co/akhooli/gpt2-small-arabic * Update model_cards/akhooli/gpt2-small-arabic/README.md Co-authored-by:
Julien Chaumond <chaumond@gmail.com>
-
Nick Doiron authored
-
Abed khooli authored
* Create README.md * Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md * Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md * Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md * Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
guillaume-be authored
* Optimized banned token masking * Avoid duplicate EOS masking if in bad_words_id * Updated mask generation to handle empty banned token list * Addition of unit tests for the updated bad_words_ids masking * Updated timeout handling in `test_postprocess_next_token_scores_large_bad_words_list` unit test * Updated timeout handling in `test_postprocess_next_token_scores_large_bad_words_list` unit test (timeout does not work on Windows) * Moving Marian import to the test context to allow TF only environments to run * Moving imports to torch_available test * Updated operations device and test * Updated operations device and test * Added docstring and comment for in-place scores modification * Moving test to own test_generation_utils, use of lighter models for testing * removed unneded imports in test_modeling_common * revert formatting change for ModelTesterMixin * Updated caching, simplified eos token id test, removed unnecessary @require_torch * formatting compliance
-
David LaPalomento authored
* Warn if debug requested without TPU fixes (#6308) Check whether a PyTorch compatible TPU is available before attempting to print TPU metrics after training has completed. This way, users who apply `--debug` without reading the documentation aren't suprised by a stacktrace. * Style Co-authored-by:Lysandre <lysandre.debut@reseau.eseo.fr>
-
Junyuan Zheng authored
* fix tokenizer saving and loading bugs when adding AddedToken to additional special tokens * Add tokenizer test * Style * Style 2 Co-authored-by:Lysandre <lysandre.debut@reseau.eseo.fr>
-
Stas Bekman authored
* testing utils: capturing std streams context manager * style * missing import * add the origin of this code
-
Stas Bekman authored
* add pl_glue example test * for now just test that it runs, next validate results of eval or predict? * complete the run_pl_glue test to validate the actual outcome * worked on my machine, CI gets less accuracy - trying higher epochs * match run_pl.sh hparms * more epochs? * trying higher lr * for now just test that the script runs to a completion * correct the comment * if cuda is available, add --fp16 --gpus=1 to cover more bases * style
-
Pradhy729 authored
* Chunked feed forward for Bert This is an initial implementation to test applying feed forward chunking for BERT. Will need additional modifications based on output and benchmark results. * Black and cleanup * Feed forward chunking in BertLayer class. * Isort * add chunking for all models * fix docs * Fix typo Co-authored-by:patrickvonplaten <patrick.v.platen@gmail.com>
-
Lysandre authored
-
zcain117 authored
-
Sam Shleifer authored
-
- 10 Aug, 2020 3 commits
-
-
Patrick von Platen authored
* improve names and tests longformer * more and better tests for longformer * add first tf test * finalize tf basic op functions * fix merge * tf shape test passes * narrow down discrepancies * make longformer local attn tf work * correct tf longformer * add first global attn function * add more global longformer func * advance tf longformer * finish global attn * upload big model * finish all tests * correct false any statement * fix common tests * make all tests pass except keras save load * fix some tests * fix torch test import * finish tests * fix test * fix torch tf tests * add docs * finish docs * Update src/transformers/modeling_longformer.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/modeling_tf_longformer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply Lysandres suggestions * reverse to assert statement because function will fail otherwise * applying sylvains recommendations * Update src/transformers/modeling_longformer.py Co-authored-by:
Sam Shleifer <sshleifer@gmail.com> * Update src/transformers/modeling_tf_longformer.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sam Shleifer <sshleifer@gmail.com>
-
Patrick von Platen authored
* correct encoder decoder model * Apply suggestions from code review * apply sylvains suggestions
-
Sylvain Gugger authored
-