- 12 Oct, 2023 1 commit
-
-
Tom Aarsen authored
Add missing spaces in adjacent strings
-
- 22 Sep, 2023 1 commit
-
-
Phuc Van Phan authored
* feat: adding num_proc to load_dataset * feat: add add_num_proc for run_mlm_flax * feat: add num_proc for bart and t5 * chorse: remove
-
- 11 Sep, 2023 2 commits
-
-
Phuc Van Phan authored
* docs: add space to docs * docs: remove reduntant space
-
Phuc Van Phan authored
-
- 02 Aug, 2023 1 commit
-
-
Yih-Dar authored
* fix * fix * fix * fix * fix * fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 28 Jul, 2023 2 commits
-
-
Yih-Dar authored
* pytorch examples * tensorflow examples * flax examples --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Lucain authored
* Fix .push_to_hub and cleanup get_full_repo_name usage * Do not rely on Python bool conversion magic * request changes
-
- 02 May, 2023 1 commit
-
-
Alex Punnen authored
-
- 22 Feb, 2023 1 commit
-
-
Aaron Gokaslan authored
-
- 06 Feb, 2023 1 commit
-
-
Sylvain Gugger authored
* Result of black 23.1 * Update target to Python 3.7 * Switch flake8 to ruff * Configure isort * Configure isort * Apply isort with line limit * Put the right black version * adapt black in check copies * Fix copies
-
- 18 Jan, 2023 1 commit
-
-
Sylvain Gugger authored
* Adapt repository creation to latest hf_hub * Update all examples * Fix other tests, add Flax examples * Address review comments
-
- 13 Oct, 2022 1 commit
-
-
Duong A. Nguyen authored
* Compute true loss * fixup * final * final * final * Update examples/flax/language-modeling/run_bart_dlm_flax.py Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * jax.tree_map => jax.tree_util.tree_map * Compute true loss * final * fixup * final * final * Update examples/flax/language-modeling/run_bart_dlm_flax.py Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * jax.tree_map => jax.tree_util.tree_map Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
-
- 10 Oct, 2022 1 commit
-
-
Kaiyu Yang authored
-
- 09 Sep, 2022 1 commit
-
-
Sanchit Gandhi authored
* [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* * fix double tree_util
-
- 06 Aug, 2022 1 commit
-
-
Julien Chaumond authored
* zero chance anyone's using that constant no? * `transformers-cli login` => `huggingface-cli login` * `transformers-cli repo create` => `huggingface-cli repo create` * `make style`
-
- 01 Aug, 2022 1 commit
-
-
Duong A. Nguyen authored
* add bart pretraining flax script * fixup * add bart pretraining flax script * add BART to README * add BART to README * add BART to README * add BART to README * add BART to README * add bos eos document * Update README.md * Update README.md * Update examples/flax/language-modeling/run_bart_dlm_flax.py Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * final * final * final * remove use_auth_token ing from_config Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
-
- 27 Jul, 2022 2 commits
-
-
Sanchit Gandhi authored
* [Flax] Fix incomplete batches in example scripts * fix dataloader batching * convert jnp batch idxs to np array * add missing `pad_shard_unpad` to final prediction generate step * only `pad_shard_unpad` at inference time * merge conflicts * remove incomplete batch step from eval * fix run_qa.py * add `pad_shard_unpad` to run_flax_ner.py * add `pad_shard_unpad` to run_flax_glue.py * add `pad_shard_unpad` to run_image_classification.py * make style * fix mlm flax eval batches * remove redundant imports
-
Duong A. Nguyen authored
* generalize decay_mask_fn to find all layernorm params * fixup * generalising decay_mask_fn
-
- 19 Jul, 2022 1 commit
-
-
Duong A. Nguyen authored
* remove use_auth_token from from_config * restore use_auth_token from_pretrained run_t5_mlm_flax
-
- 11 Jul, 2022 1 commit
-
-
Duong A. Nguyen authored
* Fix RESOURCE_EXHAUSTED error for large datasets on Flax example scripts * using np.permutation for creating batch_idx * train_samples_idx -> training_samples_idx * fix type hints
-
- 07 Jun, 2022 1 commit
-
-
Sylvain Gugger authored
* Add examples telemetry * Alternative approach * Add to all other examples * Add to templates as well * Put framework separately * Same for TensorFlow
-
- 12 May, 2022 1 commit
-
-
Sylvain Gugger authored
* Black preview * Fixup too! * Fix check copies * Use the same version as the CI * Bump black
-
- 11 Apr, 2022 1 commit
-
-
Ahmed Elnaggar authored
* Fix t5 shard on TPU Pods The current script doesn't work properly on a TPU pod because the global batch is not divided correctly per host. This pull request fixes this issue by dividing the global batch to each host before it is shared on each host. * fix style Co-authored-by:ahmed-elnaggar <ahmed.elnaggar@allianz.com>
-
- 04 Apr, 2022 1 commit
-
-
Karim Foda authored
* fix formatting and remove use_auth * Add use_auth_token to Flax examples
-
- 28 Mar, 2022 1 commit
-
-
Yongrae Jo authored
Fix typo in comment: proprocessed -> preprocessed
-
- 23 Mar, 2022 1 commit
-
-
Sylvain Gugger authored
* Split file_utils in several submodules * Fixes * Add back more objects * More fixes * Who exactly decided to import that from there? * Second suggestion to code with code review * Revert wront move * Fix imports * Adapt all imports * Adapt all imports everywhere * Revert this import, will fix in a separate commit
-
- 08 Mar, 2022 1 commit
-
-
Yeb Havinga authored
Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by:Yeb Havinga <y.t.havinga@mgrid.net>
-
- 04 Mar, 2022 1 commit
-
-
Patrick von Platen authored
-
- 13 Jan, 2022 1 commit
-
-
Stas Bekman authored
-
- 12 Dec, 2021 1 commit
-
-
Suraj Patil authored
* use custom training arguments * update tests
-
- 06 Dec, 2021 1 commit
-
-
Suraj Patil authored
* add test for glue * add tests for clm * fix clm test * add summrization tests * more tests * fix few tests * add test for t5 mlm * fix t5 mlm test * fix tests for multi device * cleanup * ci job * fix metric file name * make t5 more robust
-
- 29 Nov, 2021 1 commit
-
-
Rahul Nadkarni authored
-
- 22 Nov, 2021 1 commit
-
-
Nicholas Broad authored
* remove sum for list flattening * change to chain(*) * make chain object a list * delete empty lines per sgugger's suggestions Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Nicholas Broad <nicholas@nmbroad.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 30 Sep, 2021 1 commit
-
-
Suraj Patil authored
* use Repository for push_to_hub * update readme * update other flax scripts * update readme * update qa example * fix push_to_hub call * fix typo * fix more typos * update readme * use abosolute path to get repo name * fix glue script
-
- 06 Aug, 2021 1 commit
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * update * up * fix * remove f-stings * correct readme * up Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
- 20 Jul, 2021 1 commit
-
-
fgaim authored
* Allow validation split when loading dataset from local file * Flax clm & t5, enable validation split for datasets loaded from local file
-
- 13 Jul, 2021 1 commit
-
-
Nick Doiron authored
* Allow ByT5 type in Flax T5 script * use T5TokenizerFast * change up tokenizer config * model_args * reorder imports * Update run_t5_mlm_flax.py
-
- 09 Jul, 2021 1 commit
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * fix save problem
-
- 08 Jul, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 07 Jul, 2021 1 commit
-
-
Ibraheem Moosa authored
Successfully logging this requires Pytorch. For the purposes of this script we are not using Pytorch.
-