"model_cards/vscode:/vscode.git/clone" did not exist on "e93ccb3290ec4fb0076495c86af9de33f27048bd"
- 01 Sep, 2022 1 commit
-
-
Albert Villanova del Moral authored
-
- 31 Aug, 2022 2 commits
-
-
Sylvain Gugger authored
* Pin ffspec * Typo
-
Joao Gante authored
-
- 08 Aug, 2022 2 commits
-
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
* Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * Fix compatibility with 1.12 * Remove pin from examples requirements * Update torch scatter version * fix torch.onnx.symbolic_opset12 import * Reject bad version Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 05 Aug, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 03 Aug, 2022 1 commit
-
-
Omar Sanseviero authored
* Update pinned hhub version * Make style
-
- 01 Aug, 2022 2 commits
-
-
Sylvain Gugger authored
* Fix ROUGE add example check and update README * Stay consistent in values
-
Sylvain Gugger authored
-
- 27 Jul, 2022 1 commit
-
-
Lysandre authored
-
- 08 Jul, 2022 1 commit
-
-
Sylvain Gugger authored
* Fix slow CI by pinning resampy * Actually put it in the speech dependencies
-
- 05 Jul, 2022 1 commit
-
-
Sanchit Gandhi authored
-
- 28 Jun, 2022 2 commits
-
-
Sylvain Gugger authored
-
Lysandre Debut authored
-
- 27 Jun, 2022 1 commit
-
-
Matt authored
* Add a TF in-graph tokenizer for BERT * Add from_pretrained * Add proper truncation, option handling to match other tokenizers * Add proper imports and guards * Add test, fix all the bugs exposed by said test * Fix truncation of paired texts in graph mode, more test updates * Small fixes, add a (very careful) test for savedmodel * Add tensorflow-text dependency, make fixup * Update documentation * Update documentation * make fixup * Slight changes to tests * Add some docstring examples * Update tests * Update tests and add proper lowercasing/normalization * make fixup * Add docstring for padding! * Mark slow tests * make fixup * Fall back to BertTokenizerFast if BertTokenizer is unavailable * Fall back to BertTokenizerFast if BertTokenizer is unavailable * make fixup * Properly handle tensorflow-text dummies
-
- 17 Jun, 2022 1 commit
-
-
Sourab Mangrulkar authored
* Migrate HFDeepSpeedConfig from trfrs to accelerate * add `accelerate` to testing dep * addressing comments * addressing comments Using `_shared_state` and avoiding object creation. This is necessary as `notebook_launcher` in `launcers.py` checks `len(AcceleratorState._shared_state)>0` to throw an error. * resolving comments 1. Use simple API from accelerate to manage the deepspeed config integration 2. Update the related documentation * reverting changes and addressing comments * docstring correction * addressing nits * addressing nits * addressing nits 3 * bumping up the accelerate version to 0.10.0 * resolving import * update setup.py to include deepspeed dependencies * Update dependency_versions_table.py * fixing imports * reverting changes to CI dependencies for "run_tests_pipelines_tf*" tests These changes didn't help with resolving the failures and I believe this needs to be addressed in another PR. * removing `accelerate` as hard dependency Resolves issues related to CI Tests * adding `accelerate` as dependency for building docs resolves failure in Build PR Documentation test * adding `accelerate` as dependency in "dev" to resolve doc build issue * resolving comments 1. adding `accelerate` to extras["all"] 2. Including check for accelerate too before import HFDeepSpeedConfig from there Co-Authored-By:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * resolving comments Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 16 Jun, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 02 Jun, 2022 2 commits
-
-
Stas Bekman authored
* [trainer/deepspeed] load_best_model * to sync with DS PR #1947 * simplify * rework load_best_model test * cleanup * bump deepspeed>=0.6.5 Co-authored-by:Olatunji Ruwase <olruwase@microsoft.com>
-
Sylvain Gugger authored
-
- 26 May, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 23 May, 2022 1 commit
-
-
Sylvain Gugger authored
* Initial work * More or less finished with first draft * Update src/transformers/modeling_utils.py Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/modeling_utils.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Fix randomly initialized weights * Update src/transformers/modeling_utils.py Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Address review comments * Rename DeepSpeed folder to temporarily fix the test issue? * Revert to try if Accelerate fix works * Use latest Accelerate release * Quality and fixes * Style * Quality * Add doc * Test + fix * More blocks Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr>
-
- 20 May, 2022 1 commit
-
-
Sylvain Gugger authored
* Pin dill for now * Try this version? * force install * Actually use dep in testing * Try a larger pin
-
- 12 May, 2022 2 commits
-
-
Sylvain Gugger authored
* Black preview * Fixup too! * Fix check copies * Use the same version as the CI * Bump black
-
Lysandre Debut authored
-
- 10 May, 2022 1 commit
-
-
Stas Bekman authored
* model zoo take 2 * add deberta * new param for zero2 * doc update * doc update * add layoutlm * bump deepspeed * add deberta-v2, funnel, longformer * new models * style * add t5_v1 * update TAPAS status * reorg problematic models * move doc to another PR * style * fix checkpoint check test * making progress on more models running * cleanup * new version * cleanup
-
- 09 May, 2022 1 commit
-
-
Zachary Mueller authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com> - Adds auto_batch_size finder - Moves training loop to an inner training loop
-
- 04 May, 2022 1 commit
-
-
lewtun authored
* Skip RoFormer ONNX test if rjieba not installed * Update deps table * Skip RoFormer serialization test * Fix RoFormer vocab * Add rjieba to CircleCI
-
- 02 May, 2022 2 commits
-
-
Sylvain Gugger authored
* Clean up setup.py * Trigger CI * Upgrade Python used
-
Lysandre Debut authored
* Make sacremoses optional * Pickle
-
- 29 Apr, 2022 1 commit
-
-
Sylvain Gugger authored
* Result of new doc style with fixes * Add last two files * Bump hf-doc-builder
-
- 28 Apr, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 17 Apr, 2022 1 commit
-
-
Sylvain Gugger authored
* Pin Jax to last working release * Try lower * Try lower
-
- 15 Apr, 2022 1 commit
-
-
Stas Bekman authored
* [trainer / deepspeed] fix hyperparameter_search * require optuna * style * oops * add dep in the right place * create deepspeed-testing dep group * Trigger CI
-
- 06 Apr, 2022 1 commit
-
-
Lysandre Debut authored
-
- 01 Apr, 2022 1 commit
-
-
Lysandre Debut authored
* Pin tokenizers version <0.13 * Style
-
- 28 Mar, 2022 1 commit
-
-
Sylvain Gugger authored
* Config update * Use doc-builder styler * Cleanup * Adapt import * We need it there too!
-
- 24 Mar, 2022 1 commit
-
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 23 Mar, 2022 1 commit
-
-
Lysandre Debut authored
* Updates the default branch from master to main * Links from `master` to `main` * Typo * Update examples/flax/README.md Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 18 Mar, 2022 1 commit
-
-
Suraj Patil authored
-
- 12 Mar, 2022 1 commit
-
-
Stas Bekman authored
* [WIP] add support for bf16 mode * prep for bf16 * prep for bf16 * fix; zero2/bf16 is ok * check bf16 is available * test fixes * enable zero3_bf16 * config files * docs * split stage_dtype; merge back to non-dtype-specific config file * fix doc * cleanup * cleanup * bfloat16 => bf16 to match the PR changes * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/ * test fixes/skipping * move * fix * Update docs/source/main_classes/deepspeed.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * backticks * cleanup * cleanup * cleanup * new version * add note about grad accum in bf16 Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-