- 07 Sep, 2021 4 commits
-
-
Anton Lozhkov authored
-
guillaume-be authored
* Optimized bad word ids generation * Fixed optimized bad token ids * Updated style
-
Nicolas Patry authored
-
Nathan Raw authored
*
✅ Update image-classification example's tests *🔥 remove cats_and_dogs test samples *💄 fix flake8
-
- 06 Sep, 2021 11 commits
-
-
Anurag Kumar authored
-
Shiv Dhar authored
-
Anton Lozhkov authored
-
NielsRogge authored
* Add conversion of TapasForMaskedLM * Add copied from statements
-
Suraj Patil authored
-
Nils Reimers authored
* refactor GPT Config to allow dyn. properties * make attribute_map a class attribute * remove old code * update unit test to test config: Add test for common properties setter * update unit test to test config: Add test for common properties passed as parameters to __init__ * update to black code format * Allow that setters are not defined for certain config classes * update config classes to implement attribute_map * bugfix lxmert config - id2labels was not defined when num_labels was set * update broken configs - add attribute_maps * update bart config * update black codestyle * update documentation on common config attributes * update GPTJ config to new attribute map * update docs on common attributes * gptj config: add max_position_embeddings * gptj config: format with black * update speech to text 2 config * format doc file to max_len 119 * update config template
-
Nicolas Patry authored
* Adding a test for multibytes unicode. * Adding some accents. * Making sure decoding works. * Make tests passing by being cheesy.
-
Patrick von Platen authored
-
Suraj Patil authored
-
Anton Lozhkov authored
* Add inputs to pretrained tests * Make style
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 02 Sep, 2021 11 commits
-
-
Nathan Raw authored
*
✨ add pytorch image classification example *🔥 remove utils.py *💄 fix flake8 style issues *🔥 remove unnecessary line *✨ limit dataset sizes *📌 update reqs *🎨 restructure - use datasets lib *🎨 import transforms directly *📝 add comments *💄 style *🔥 remove flag *📌 update requirement warning *📝 add vision README.md *📝 update README.md *📝 update README.md *🎨 add image-classification tag to model card *🚚 rename vision➡ ️ image-classification *📝 update image-classification README.md -
Patrick von Platen authored
-
Patrick von Platen authored
-
Aman Madaan authored
* [docs] Update perplexity.rst to use negative log likelihood Model `forward` returns the negative log likelihood. The document correctly defines and calculates perplexity, but the description and variable names are inconsistent, which might cause confusion. * [docs] restyle perplexity.rst
-
Apoorv Garg authored
* correct order of overflowing_tokens for slow tokenizer (issue fix #13148) * python 3.9 requires sentencepiece version 0.1.94 or above * slicing of ids fixed in truncated_sequence() * Update setup.py * Correct order of overflowing tokens for pair of sentences * code reformatted * Update tokenization_utils_base.py * reformatting file * test to check single_input added * missing function restored * test to check pair_input overflowing tokens order * test to check pair_input overflowing tokens order * test to check pair_input overflowing tokens order * added an error message for pair of seq and longest_first strategy * test for pair_input modified * variable name corrected * fixed a typo in error message * requested changes implemented * required test added * Corrected the message to match test message * added error message for Luke Tokenizer * lost test recovered * docstring for truncate_sequences and prepare_for_model updated * docstring for luke tokenizer updated * updated ENCODE_PLUS_ADDITIONAL_KWARGS_DOCSTRING * aligned text and fixed puncuatations * improved style and quality of code * fixed error_msg in truncate_sequences * replaced encode_plus method with regular call method * clean up * rephrased the docstring
-
Nicolas Patry authored
`audio-classification`.
-
Suraj Patil authored
-
NielsRogge authored
-
Sachin Abeywardana authored
* Update clip loss calculation Hello, I'm the author of the blog you took the snippet from. I think this way of calculating is possibly slightly more accurate for calculation. * Apply suggestions from code review Co-authored-by:Suraj Patil <surajp815@gmail.com>
-
Eduardo Gonzalez Ponferrada authored
[Flax/run_hybrid_clip] Fix duplicating images when captions_per_image exceeds the number of captions, enable truncation
-
Sylvain Gugger authored
-
- 01 Sep, 2021 14 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
* finish * finish
-
Lysandre Debut authored
-
Lysandre Debut authored
-
Anton Lozhkov authored
* Add Hubert to the auto feature extractor * Fix import structure
-
Sylvain Gugger authored
-
NielsRogge authored
-
SaulLu authored
* add test in trainer and test tokenizer saving wi th trainer * quality * reverse trainer changes * replace test in test_trainer by a test for all the tokenizers * format * add can_save_slow_tokenizer attribute to all tokenizers * fix Herbert * format * Change comment in error * add comments and a new assert * Update src/transformers/models/albert/tokenization_albert_fast.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * change ValueError barthez * change ValueError BigBird * change ValueError Camembert * change ValueError Mbart50 * change ValueError Pegasus * change ValueError ReFormer * change ValueError T5 * change ValueError RoBERTa * XLNET fast * Update tests/test_tokenization_common.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * change `assert` into `self.assertIn` * format Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sylvain Gugger authored
-
Li-Huai (Allan) Lin authored
This reverts commit ffecfea9.
-
NielsRogge authored
* Remove disclaimer * First draft * Fix rebase * Improve docs some more * Add inference section * Improve example scripts section * Improve code examples of modeling files * Add docs regarding task prefix * Address @craffel's comments * Apply suggestions from @patrickvonplaten's review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Add suggestions from code review * Apply @sgugger's suggestions * Fix Flax code examples * Fix index.rst Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
donggyukimc authored
-
Sylvain Gugger authored
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * up * correct some bugs * correct model * finish speech2text extension * up * up * up * up * Update utils/custom_init_isort.py * up * up * update with tokenizer * correct old tok * correct old tok * fix bug * up * up * add more tests * up * fix docs * up * fix some more tests * add better config * correct some more things " * fix tests * improve docs * Apply suggestions from code review * Apply suggestions from code review * final fixes * finalize * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * apply suggestions Lysandre and Sylvain * apply nicos suggestions * upload everything * finish Co-authored-by:
Patrick von Platen <patrick@huggingface.co> Co-authored-by: your_github_username <your_github_email> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-