- 17 Sep, 2020 9 commits
-
-
Sohee Yang authored
* Move 'from transformers' statements to relative imports in some files * Add python prompt symbols in front of the example codes * Reformat the code * Add one missing space Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Stas Bekman authored
* [model cards] ported allenai Deep Encoder, Shallow Decoder models * typo * fix references * add allenai/wmt19-de-en-6-6 model cards * fill-in the missing info for the build script as provided by the searcher.
-
Stas Bekman authored
* ready for PR * cleanup * correct FSMT_PRETRAINED_MODEL_ARCHIVE_LIST * fix * perfectionism * revert change from another PR * odd, already committed this one * non-interactive upload workaround * backup the failed experiment * store langs in config * workaround for localizing model path * doc clean up as in https://github.com/huggingface/transformers/pull/6956 * style * back out debug mode * document: run_eval.py --num_beams 10 * remove unneeded constant * typo * re-use bart's Attention * re-use EncoderLayer, DecoderLayer from bart * refactor * send to cuda and fp16 * cleanup * revert (moved to another PR) * better error message * document run_eval --num_beams * solve the problem of tokenizer finding the right files when model is local * polish, remove hardcoded config * add a note that the file is autogenerated to avoid losing changes * prep for org change, remove u...
-
Sylvain Gugger authored
* Trainer accep multiple labels * Missing import * Fix dosctrings
-
RafaelWO authored
* Removed 'tgt_len' and 'ext_len' from Transfomer-XL * Some changes are still to be done * Removed 'tgt_len' and 'ext_len' from Transfomer-XL (2) * Removed comments * Fixed quality * Changed warning to info
-
Dhaval Taunk authored
* added multilabel classification using distilbert notebook to community notebooks * added multilabel classification using distilbert notebook to community notebooks
-
Stas Bekman authored
``` /home/circleci/.local/lib/python3.6/site-packages/isort/main.py:915: UserWarning: W0501: The following deprecated CLI flags were used and ignored: --recursive! "W0501: The following deprecated CLI flags were used and ignored: " ```
-
Stas Bekman authored
-
Sam Shleifer authored
-
- 16 Sep, 2020 11 commits
-
-
Sam Shleifer authored
-
sgugger authored
-
Stas Bekman authored
-
Stas Bekman authored
Co-authored-by:Sam Shleifer <sshleifer@gmail.com>
-
Antoine Louis authored
* Create README.md * Update README.md
-
Sylvain Gugger authored
* Rewrite and update README * Typo and migration guide * Apply suggestions from code review Co-authored-by:
Thomas Wolf <thomwolf@users.noreply.github.com> * Address Clem's comments Co-authored-by:
Thomas Wolf <thomwolf@users.noreply.github.com>
-
Donna Choi authored
-
Stas Bekman authored
-
Stas Bekman authored
* build/eval/gen-card scripts for fsmt * adjust for model renames
-
Xi Ye authored
-
Julien Plu authored
* Refactoring the activations functions into a common file * Apply style * remove unused import * fix tests * Fix tests.
-
- 15 Sep, 2020 15 commits
-
-
Stas Bekman authored
* [docs] add testing documentation * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * tweaks as suggested * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * tweaks * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update docs/source/testing.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * more tweaks * suggestions from @LysandreJik Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Patrick von Platen authored
-
Yih-Dar authored
* fix ZeroDivisionError and epoch counting * Add test for num_train_epochs calculation in trainer.py * Remove @require_non_multigpu for test_num_train_epochs_in_training
-
Patrick von Platen authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Allow multiple outputs * Formatting * Move the unwrapping before metrics * Fix typo * Add test for non-supported config options
-
Pedro Lima authored
From **Language-Agnostic BERT Sentence Embedding** https://ai.googleblog.com/2020/08/language-agnostic-bert-sentence.html
-
tuner007 authored
Model card for PEGASUS finetuned for paraphrasing task
-
Philip May authored
* changed eval table model order * Update install * update mc
-
Siddharth Jain authored
-
Paul O'Leary McCann authored
Without quotes directories with spaces in them will fail to be processed correctly.
-
Yih-Dar authored
* create branch for issue #6968 * First attempt to fix incorrect tf trainer loss calculation * Fix training loss in metric * fix tf trainer evaluation loss * apply count_instances_in_batch() for eval and test datasets * prototype of using a new argument in trainer_tf.py to fix loss issue * some renaming and fix, in particular for evaluation methods * fix bugs to have a running version * change to @staticmethod * apply style
-
Stas Bekman authored
-
Sylvain Gugger authored
* Fix reproducible tests in Trainer * Deal with multiple GPUs
-
Sam Shleifer authored
-
- 14 Sep, 2020 5 commits
-
-
Sam Shleifer authored
-
sgugger authored
-
Kevin Canwen Xu authored
* Add Tuna Mirror for Downloads from China * format fix * Use preset instead of hardcoding URL * Fix * make style * update the mirror option doc * update the mirror
-
Antonio V Mendoza authored
Demoing LXMERT with raw images by incorporating the FRCNN model for roi-pooled extraction and bounding-box predction on the GQA answer set. (#6986) * adding demo * Update examples/lxmert/requirements.txt Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update examples/lxmert/checkpoint.sh Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * added user input for .py demo * updated model loading, data extrtaction, checkpoints, and lots of other automation * adding normalizing for bounding boxes * Update requirements.txt * some optimizations for extracting data * added data extracting file * added data extraction file * minor fixes to reqs and readme * Style * remove options Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr>
-
sgugger authored
-