- 01 Oct, 2021 3 commits
-
-
Anton Lozhkov authored
* Restore broken merge * Additional args, DDP, remove CommonLanguage * Update examples for V100, add training results * Style * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Remove custom datasets for simplicity, apply suggestions from code review * Add the attention_mask flag, reorganize README Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Arfon Smith authored
-
Yuta Hayashibe authored
* Removed wrong warning * Raise a warning when `max_length` is given with wrong `truncation` * Update the error message * Update the warning message Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 30 Sep, 2021 8 commits
-
-
Suraj Patil authored
-
Patrick von Platen authored
* update * add to docs and init * make fix-copies
-
Patrick von Platen authored
-
Gunjan Chhablani authored
* Init multibert checkpoint conversion script * Rename conversion script * Fix MultiBerts Conversion Script * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
Stas Bekman authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Suraj Patil authored
* use Repository for push_to_hub * update readme * update other flax scripts * update readme * update qa example * fix push_to_hub call * fix typo * fix more typos * update readme * use abosolute path to get repo name * fix glue script
-
- 29 Sep, 2021 10 commits
-
-
Stas Bekman authored
* missing requirement * list both
-
Suraj Patil authored
* add a note about tokenizer * add tips to load model is less RAM * fix link * fix more links
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Matt authored
-
Sylvain Gugger authored
* Fix length of IterableDatasetShard and add test * Add comments
-
Li-Huai (Allan) Lin authored
* Enable readme link synchronization * Style * Reuse regex pattern * Apply suggestions * Update
-
Nishant Prabhu authored
Fix LayoutLM ONNX test error
-
Matt authored
* Keras callback to push to hub each epoch, or after N steps * Reworked the callback to use Repository * Use an Enum for save_strategy * Style pass * Correct type for tokenizer * Update src/transformers/keras_callbacks.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/keras_callbacks.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/keras_callbacks.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/keras_callbacks.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/keras_callbacks.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/keras_callbacks.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Adding print message to the final upload * Adding print message to the final upload * Change how we wait for the last process to finish * is_done is a property, not a method, derp * Docstrings and documentation * Style pass * Style edit * Docstring reformat * Docstring rewrite * Replacing print with internal logger Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Patrick von Platen authored
-
- 28 Sep, 2021 2 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
- 27 Sep, 2021 9 commits
-
-
Sylvain Gugger authored
-
Lysandre authored
-
Lysandre authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Matt authored
Allows models to be compiled without a loss, and to use the internal loss computations for training with fit()
-
Sylvain Gugger authored
-
Sylvain Gugger authored
Co-authored-by:
quantitative-technologies <james.hirschorn@quantitative-technologies.com> Co-authored-by:
quantitative-technologies <james.hirschorn@quantitative-technologies.com>
-
Xiaohan Zou authored
* Fix type annotations for `distributed_concat()` * Use Any
-
- 26 Sep, 2021 4 commits
-
-
Anton Lozhkov authored
-
Stas Bekman authored
-
Patrick von Platen authored
[Trainer] Make sure shown loss in distributed training is correctly averaged over all workers (#13681) * push * improve tr loss gather
-
Sylvain Gugger authored
-
- 25 Sep, 2021 2 commits
-
-
Patrick von Platen authored
-
Sylvain Gugger authored
-
- 24 Sep, 2021 2 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
-