- 19 Apr, 2022 1 commit
-
-
Jeevesh Juneja authored
* Correct Logging of Eval metric to Tensorboard An empty dictionary ``eval_metrics`` was being logged, is replaced by ``eval_metric`` which is the output dictionary of ``metric.compute()``. * Remove unused variable
-
- 11 Apr, 2022 2 commits
-
-
Zachary Mueller authored
Move declaration of log streams to before tests, so that results won't get compounded on top of each other
-
Ahmed Elnaggar authored
* Fix t5 shard on TPU Pods The current script doesn't work properly on a TPU pod because the global batch is not divided correctly per host. This pull request fixes this issue by dividing the global batch to each host before it is shared on each host. * fix style Co-authored-by:ahmed-elnaggar <ahmed.elnaggar@allianz.com>
-
- 06 Apr, 2022 1 commit
-
-
Lysandre Debut authored
-
- 04 Apr, 2022 1 commit
-
-
Karim Foda authored
* fix formatting and remove use_auth * Add use_auth_token to Flax examples
-
- 30 Mar, 2022 1 commit
-
-
Stas Bekman authored
* [examples] max samples can't be bigger than then len of dataset * do tf and flax
-
- 28 Mar, 2022 1 commit
-
-
Yongrae Jo authored
Fix typo in comment: proprocessed -> preprocessed
-
- 25 Mar, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 23 Mar, 2022 2 commits
-
-
Sylvain Gugger authored
* Split file_utils in several submodules * Fixes * Add back more objects * More fixes * Who exactly decided to import that from there? * Second suggestion to code with code review * Revert wront move * Fix imports * Adapt all imports * Adapt all imports everywhere * Revert this import, will fix in a separate commit
-
Lysandre Debut authored
* Updates the default branch from master to main * Links from `master` to `main` * Typo * Update examples/flax/README.md Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 08 Mar, 2022 1 commit
-
-
Yeb Havinga authored
Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by:Yeb Havinga <y.t.havinga@mgrid.net>
-
- 04 Mar, 2022 1 commit
-
-
Patrick von Platen authored
-
- 03 Mar, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 23 Feb, 2022 1 commit
-
-
Lysandre Debut authored
* Per-folder tests reorganization Co-authored-by:
sgugger <sylvain.gugger@gmail.com> Co-authored-by:
Stas Bekman <stas@stason.org>
-
- 01 Feb, 2022 2 commits
-
-
Sylvain Gugger authored
* Harder check for IndexErrors in QA scripts * Make test stronger
-
Kamal Raj authored
fix typo
-
- 31 Jan, 2022 1 commit
-
-
Suraj Patil authored
* add a section about GPUs * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
- 27 Jan, 2022 2 commits
- 19 Jan, 2022 1 commit
-
-
Kamal Raj authored
* refactor run_flax_glue.py * updated readme * rm unused import and args typo fix * refactor * make consistent arg name across task * has_tensorboard check * argparse -> argument dataclasses * refactor according to review * fix
-
- 13 Jan, 2022 1 commit
-
-
Stas Bekman authored
-
- 06 Jan, 2022 1 commit
-
-
Yih-Dar authored
* add image captioning example * update README * fix style & quality * simplify * apply review suggestions * Apply suggestions from code review Co-authored-by:
Suraj Patil <surajp815@gmail.com> * Apply suggestions from code review Co-authored-by:
Suraj Patil <surajp815@gmail.com> * Apply review suggestions * add comments about using np instead jax array * remove unused lines * add model creation script * only support from_pretrained * fix style * fix * not use cache_dir when creating model * fix tokenizer creation * update README * fix quality * apply suggestion * simplify some blocks * Update examples/flax/image-captioning/README.md * Update examples/flax/image-captioning/run_image_captioning_flax.py Co-authored-by:
Suraj Patil <surajp815@gmail.com> * apply suggestion Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Suraj Patil <surajp815@gmail.com>
-
- 22 Dec, 2021 2 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
-
- 15 Dec, 2021 3 commits
-
-
Lysandre authored
-
Lysandre authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 14 Dec, 2021 1 commit
-
-
Benjamin Minixhofer authored
* use jax and jnp instead of numpy in data_loader * return batches as np.ndarray
-
- 12 Dec, 2021 1 commit
-
-
Suraj Patil authored
* use custom training arguments * update tests
-
- 09 Dec, 2021 2 commits
- 06 Dec, 2021 4 commits
-
-
Suraj Patil authored
* make tensorboard optional * update test_fetcher for flax examples * make the tests slow
-
Suraj Patil authored
-
Julien Chaumond authored
* Replace outdated model tags with their now-canonical pipeline types * spam the CI till it's green
-
Suraj Patil authored
* add test for glue * add tests for clm * fix clm test * add summrization tests * more tests * fix few tests * add test for t5 mlm * fix t5 mlm test * fix tests for multi device * cleanup * ci job * fix metric file name * make t5 more robust
-
- 29 Nov, 2021 2 commits
-
-
Rahul Nadkarni authored
-
Kamal Raj authored
* step start from 1 * Updated cur_step calcualtion
-
- 22 Nov, 2021 1 commit
-
-
Nicholas Broad authored
* remove sum for list flattening * change to chain(*) * make chain object a list * delete empty lines per sgugger's suggestions Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Nicholas Broad <nicholas@nmbroad.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 09 Nov, 2021 1 commit
-
-
Suraj Patil authored
-
- 28 Oct, 2021 1 commit
-
-
Lysandre authored
-