- 09 Aug, 2021 4 commits
-
-
Lysandre Debut authored
* Add to ONNX docs * Add MBART example * Update docs/source/serialization.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Lysandre Debut authored
* Add MBART to models exportable with ONNX * unittest mock * Add tests * Misc fixes
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * improve docs for clm * speed-ups * correct t5 example as well * push final touches * Update examples/flax/language-modeling/README.md * correct docs for mlm * Update examples/flax/language-modeling/README.md Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
abhishek thakur authored
-
- 08 Aug, 2021 2 commits
-
-
Patrick von Platen authored
-
lewtun authored
* Use min version for huggingface-hub dependency * Update dependency version table
-
- 06 Aug, 2021 6 commits
-
-
Sylvain Gugger authored
* Fix tied weights on TPU * Manually tie weights in no trainer examples * Fix for test * One last missing * Gettning owned by my scripts * Address review comments * Fix test * Fix tests * Fix reformer tests
-
Lysandre Debut authored
-
Michael Benayoun authored
T5 with past ONNX export, and more explicit past_key_values inputs and outputs names for ONNX model Authored-by:Michael Benayoun <michael@huggingface.co>
-
Michael Benayoun authored
Changed the way dynamically inserted submodules are named and the method used to insert them Authored-by:Michael Benayoun <michael@huggingface.co>
-
Sylvain Gugger authored
* Initial work * All auto models * All tf auto models * All flax auto models * Tokenizers * Add feature extractors * Fix typos * Fix other typo * Use the right config * Remove old mapping names and update logic in AutoTokenizer * Update check_table * Fix copies and check_repo script * Fix last test * Add back name * clean up * Update template * Update template * Forgot a ) * Use alternative to fixup * Fix TF model template * Address review comments * Address review comments * Style
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * update * up * fix * remove f-stings * correct readme * up Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
- 05 Aug, 2021 4 commits
-
-
Patrick von Platen authored
* finish PR * add tests * correct tests * finish * correct other flax tests * better naming * correct naming * finish * apply sylvains suggestions
-
Nils Reimers authored
-
Michael Benayoun authored
GPT-Neo ONNX export and task / feature refactoring Authored-by:Michael Benayoun <michael@huggingface.co>
-
Sasha Luccioni authored
Updating the import for load_dataset
-
- 04 Aug, 2021 10 commits
-
-
NielsRogge authored
* First pass * Make conversion script work * Improve conversion script * Fix bug, conversion script working * Improve conversion script, implement BEiTFeatureExtractor * Make conversion script work based on URL * Improve conversion script * Add tests, add documentation * Fix bug in conversion script * Fix another bug * Add support for converting masked image modeling model * Add support for converting masked image modeling * Fix bug * Add print statement for debugging * Fix another bug * Make conversion script finally work for masked image modeling models * Move id2label for datasets to JSON files on the hub * Make sure id's are read in as integers * Add integration tests * Make style & quality * Fix test, add BEiT to README * Apply suggestions from @sgugger's review * Apply suggestions from code review * Make quality * Replace nielsr by microsoft in tests, add docs * Rename BEiT to Beit * Minor fix * Fix docs of BeitForMaskedImageModeling Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Lysandre Debut authored
-
Arman Cohan authored
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * fix flax docs * correct more docs in flax * another correction * fix flax docs * Apply suggestions from code review
-
Patrick von Platen authored
* finish PR * finish mt5 * push * up * Update tests/test_modeling_flax_mt5.py Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Suraj Patil <surajp815@gmail.com>
-
Patrick von Platen authored
* [Flax] Align device name in docs * make style * fix import error
-
Aktsvigun authored
* pad_to_multiple_of added to DataCollatorForWholeWordMask * pad_to_multiple_of added to DataCollatorForWholeWordMask Co-authored-by:Цвигун Аким Олегович <AOTsvigun@sberbank.ru>
-
Lysandre Debut authored
* Return raw outputs in TextClassificationPipeline * Style * Support for problem type * Update src/transformers/pipelines/text_classification.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Apply Nicolas' comments Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sylvain Gugger authored
* Fix from_pretrained with corrupted state_dict * Adapt test * Use better checkpoint * Style * Clean up
-
NielsRogge authored
-
- 03 Aug, 2021 3 commits
-
-
Michal Szutenberg authored
This change enables tf.keras.mixed_precision with bf16
-
Philip May authored
* fix #12970 * Update tests/test_trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove unnecessary issue link * fix test formatting Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sylvain Gugger authored
-
- 02 Aug, 2021 3 commits
-
-
Chungman Lee authored
* fix typo in example/text-classification README * add space to align the table
-
Sylvain Gugger authored
-
Tadej Svetina authored
-
- 01 Aug, 2021 1 commit
-
-
Alex Hedges authored
-
- 30 Jul, 2021 6 commits
-
-
Stefan Schweter authored
-
Sylvain Gugger authored
-
Kevin Canwen Xu authored
* Add multilingual documentation support * Add multilingual documentation support * make style * make style * revert
-
wulu473 authored
Co-authored-by:Lukas Wutschitz <lukas.wutschitz@microsoft.com>
-
harshithapv authored
* minor change to log azureml only for rank 0 * fix typo
-
21jun authored
help for `ModelArguments.gradient_checkpointing` should be "If True, use gradient checkpointing to save memory at the expense of slower backward pass." not "Whether to freeze the feature extractor layers of the model." (which is duplicated from `freeze_feature_extractor` arg)
-
- 29 Jul, 2021 1 commit
-
-
Kevin Canwen Xu authored
* Add CpmTokenizerFast * Fix isort * Overwrite _batch_encode_plus
-