- 02 Jun, 2022 2 commits
-
-
Stas Bekman authored
* [trainer/deepspeed] load_best_model * to sync with DS PR #1947 * simplify * rework load_best_model test * cleanup * bump deepspeed>=0.6.5 Co-authored-by:Olatunji Ruwase <olruwase@microsoft.com>
-
Sylvain Gugger authored
-
- 26 May, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 23 May, 2022 1 commit
-
-
Sylvain Gugger authored
* Initial work * More or less finished with first draft * Update src/transformers/modeling_utils.py Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/modeling_utils.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Fix randomly initialized weights * Update src/transformers/modeling_utils.py Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comments * Rename DeepSpeed folder to temporarily fix the test issue? * Revert to try if Accelerate fix works * Use latest Accelerate release * Quality and fixes * Style * Quality * Add doc * Test + fix * More blocks Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr>
-
- 20 May, 2022 1 commit
-
-
Sylvain Gugger authored
* Pin dill for now * Try this version? * force install * Actually use dep in testing * Try a larger pin
-
- 12 May, 2022 2 commits
-
-
Sylvain Gugger authored
* Black preview * Fixup too! * Fix check copies * Use the same version as the CI * Bump black
-
Lysandre Debut authored
-
- 10 May, 2022 1 commit
-
-
Stas Bekman authored
* model zoo take 2 * add deberta * new param for zero2 * doc update * doc update * add layoutlm * bump deepspeed * add deberta-v2, funnel, longformer * new models * style * add t5_v1 * update TAPAS status * reorg problematic models * move doc to another PR * style * fix checkpoint check test * making progress on more models running * cleanup * new version * cleanup
-
- 09 May, 2022 1 commit
-
-
Zachary Mueller authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com> - Adds auto_batch_size finder - Moves training loop to an inner training loop
-
- 04 May, 2022 1 commit
-
-
lewtun authored
* Skip RoFormer ONNX test if rjieba not installed * Update deps table * Skip RoFormer serialization test * Fix RoFormer vocab * Add rjieba to CircleCI
-
- 02 May, 2022 2 commits
-
-
Sylvain Gugger authored
* Clean up setup.py * Trigger CI * Upgrade Python used
-
Lysandre Debut authored
* Make sacremoses optional * Pickle
-
- 29 Apr, 2022 1 commit
-
-
Sylvain Gugger authored
* Result of new doc style with fixes * Add last two files * Bump hf-doc-builder
-
- 28 Apr, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 17 Apr, 2022 1 commit
-
-
Sylvain Gugger authored
* Pin Jax to last working release * Try lower * Try lower
-
- 15 Apr, 2022 1 commit
-
-
Stas Bekman authored
* [trainer / deepspeed] fix hyperparameter_search * require optuna * style * oops * add dep in the right place * create deepspeed-testing dep group * Trigger CI
-
- 06 Apr, 2022 1 commit
-
-
Lysandre Debut authored
-
- 01 Apr, 2022 1 commit
-
-
Lysandre Debut authored
* Pin tokenizers version <0.13 * Style
-
- 28 Mar, 2022 1 commit
-
-
Sylvain Gugger authored
* Config update * Use doc-builder styler * Cleanup * Adapt import * We need it there too!
-
- 24 Mar, 2022 1 commit
-
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 23 Mar, 2022 1 commit
-
-
Lysandre Debut authored
* Updates the default branch from master to main * Links from `master` to `main` * Typo * Update examples/flax/README.md Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 18 Mar, 2022 1 commit
-
-
Suraj Patil authored
-
- 12 Mar, 2022 1 commit
-
-
Stas Bekman authored
* [WIP] add support for bf16 mode * prep for bf16 * prep for bf16 * fix; zero2/bf16 is ok * check bf16 is available * test fixes * enable zero3_bf16 * config files * docs * split stage_dtype; merge back to non-dtype-specific config file * fix doc * cleanup * cleanup * bfloat16 => bf16 to match the PR changes * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/ * test fixes/skipping * move * fix * Update docs/source/main_classes/deepspeed.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * backticks * cleanup * cleanup * cleanup * new version * add note about grad accum in bf16 Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 03 Mar, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 01 Mar, 2022 1 commit
-
-
Lysandre Debut authored
* Add PT + TF automatic builds * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Wrap up Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
- 18 Feb, 2022 1 commit
-
-
SaulLu authored
Very big changes concerning the tokenizer fast of CLIP which did not correspond to the tokenizer slow of CLIP Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 15 Feb, 2022 1 commit
-
-
Alex Hedges authored
`tokenizers` version that supports the feature to choose the direction of truncation
-
- 09 Feb, 2022 1 commit
-
-
Lysandre Debut authored
* Upgrade black to version ~=22.0 * Check copies * Fix code
-
- 28 Jan, 2022 1 commit
-
-
Stas Bekman authored
* [deepspeed] saving checkpoint fallback when fp16 weights aren't saved * Bump required deepspeed version to match usage when saving checkpoints * update version Co-authored-by:Mihai Balint <balint.mihai@gmail.com>
-
- 27 Jan, 2022 2 commits
- 18 Jan, 2022 1 commit
-
-
Patrick von Platen authored
* [ASR pipeline] correct with lm pipeline * improve error
-
- 17 Jan, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 14 Jan, 2022 1 commit
-
-
Joao Gante authored
-
- 30 Dec, 2021 1 commit
-
-
Nicolas Patry authored
* Enabling `tokenizers` upgrade. * Moved ugly comment. * Tokenizers==0.11.1 needs an update to keep borrow checker happy in highly contiguous calls. * Support both 0.11.1 and 0.11.0
-
- 22 Dec, 2021 2 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
-
- 17 Dec, 2021 1 commit
-
-
Patrick von Platen authored
* up * add tokenizer * improve more * finish tokenizer * finish * adapt speech recognition script * adapt convert * more fixes * more fixes * update phonemizer wav2vec2 * better naming * fix more tests * more fixes swedish * correct tests * finish * improve script * remove file * up * lets get those 100 model architectures until the end of the month * make fix-copies * correct more * correct script * more fixes * more fixes * add to docs * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * replace assert * fix copies * fix docs * new try docs * boom boom * update * add phonemizer to audio tests * make fix-copies * up * upload models * some changes * Update tests/test_tokenization_wav2vec2_phoneme.py Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com> * more fixes * remove @ Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com>
-
- 16 Dec, 2021 1 commit
-
-
Sylvain Gugger authored
* Clean up sphinx * Update contributing guide * Update docs README * No example title * Fix copies * Update CONTRIBUTING.md Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
- 15 Dec, 2021 1 commit
-
-
Lysandre authored
-