- 06 Aug, 2021 1 commit
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * update * up * fix * remove f-stings * correct readme * up Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
- 02 Aug, 2021 1 commit
-
-
Chungman Lee authored
* fix typo in example/text-classification README * add space to align the table
-
- 30 Jul, 2021 1 commit
-
-
Stefan Schweter authored
-
- 27 Jul, 2021 1 commit
-
-
Stefan Schweter authored
* readme: fix retrieval of vocab size for flax clm example * examples: fix flax clm example when using training/evaluation files
-
- 20 Jul, 2021 2 commits
-
-
Patrick von Platen authored
cc @patil-suraj
-
fgaim authored
* Allow validation split when loading dataset from local file * Flax clm & t5, enable validation split for datasets loaded from local file
-
- 14 Jul, 2021 1 commit
-
-
Patrick von Platen authored
-
- 13 Jul, 2021 1 commit
-
-
Nick Doiron authored
* Allow ByT5 type in Flax T5 script * use T5TokenizerFast * change up tokenizer config * model_args * reorder imports * Update run_t5_mlm_flax.py
-
- 12 Jul, 2021 1 commit
-
-
Bhadresh Savani authored
-
- 09 Jul, 2021 1 commit
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * fix save problem
-
- 08 Jul, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 07 Jul, 2021 3 commits
-
-
Ibraheem Moosa authored
Successfully logging this requires Pytorch. For the purposes of this script we are not using Pytorch.
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * finish
-
Suraj Patil authored
* add adafactor * Update examples/flax/language-modeling/run_mlm_flax.py Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
- 06 Jul, 2021 1 commit
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * up * up * correct * upload Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
- 05 Jul, 2021 4 commits
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * up
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * push
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * add logging steps * correct training scripts * correct readme * correct
-
Suraj Patil authored
* begin script * clean example, add readme * update readme * remove decay mask * remove masking * update readme & make flake happy
-
- 29 Jun, 2021 2 commits
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * finish * finish * correct style
-
Suraj Patil authored
* add readme * update readme and add requirements * Update examples/flax/summarization/README.md Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
- 28 Jun, 2021 2 commits
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * add length computatan * finish masking * finish * upload * fix some bugs * finish * fix dependency table * correct tensorboard * Apply suggestions from code review * correct processing * slight change init * correct some more mistakes * apply suggestions * improve readme * fix indent * Apply suggestions from code review Co-authored-by:
SaulLu <55560583+SaulLu@users.noreply.github.com> * correct tokenizer * finish * finish * finish * finish Co-authored-by:
Patrick von Platen <patrick@huggingface.co> Co-authored-by:
SaulLu <55560583+SaulLu@users.noreply.github.com>
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * finish * correct summary writer * correct push to hub * fix indent * finish * finish * finish * finish * finish Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
- 25 Jun, 2021 1 commit
-
-
Stas Bekman authored
-
- 24 Jun, 2021 1 commit
-
-
Suraj Patil authored
-
- 23 Jun, 2021 1 commit
-
-
Suraj Patil authored
* add summrization script * fix arguments, preprocessing, metrics * add generation and metrics * auto model, prediction loop * prettify * label smoothing * adress Sylvain and Patricks suggestions * dynamically import shift_tokens_right * fix shift_tokens_right_fn call
-
- 15 Jun, 2021 1 commit
-
-
Avital Oliver authored
Use a released version of optax rather than installing from Git
-
- 14 Jun, 2021 3 commits
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * add colab links
-
Suraj Patil authored
* add readme for flax clm * use section link for tokenizer * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * update metrics Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * upload * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review * Update examples/flax/language-modeling/README.md * add more info * finish * fix Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
- 11 Jun, 2021 1 commit
-
-
Suraj Patil authored
* first draft * max_seq_length => block_size * fix arg names * fix typos * fix loss calculation * add max examples, fix train eval steps, metrics * optimizer mask * fix perpelexity, metric logging * fix logging * data_collator = > data_loader * refactor loss_fn * support single GPU * pass distributed to write_metric * fix jitting * fix single device training * fix single device metrics * close inner progress bars once finished * add overwrite_cache arg * ifx dataset caching issue * add more logs * few small fixes, * address nicholas suggestions * fix docstr * address patricks suggestions * make flake happy * pass new new_dropout_rng to apply_gradients * reset train metrics after every epoc * remove distributed logis, small fixes
-
- 09 Jun, 2021 1 commit
-
-
Suraj Patil authored
-
- 03 Jun, 2021 2 commits
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * finish refactor Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
Nicholas Vadivelu authored
* Fix weight decay masking in `run_flax_glue.py` Issues with the previous implementation: - The `dict` from `traverse_util.flatten_dict` has keys which are tuples of strings, not one long string with the path separated by periods. - `optax.masked` applies the transformation wherever the mask is True, so the masks are flipped. - Flax's LayerNorm calls the scale parameter `scale` not `weight` * Fix formatting with black * adapt results Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
- 31 May, 2021 1 commit
-
-
Nicholas Vadivelu authored
* Remove redundant `nn.log_softmax` in `run_flax_glue.py` `optax.softmax_cross_entropy` expects unnormalized logits, and so it already calls `nn.log_softmax`, so I believe it is not needed here. `nn.log_softmax` is idempotent so mathematically it shouldn't have made a difference. * Remove unused 'flax.linen' import
-
- 26 May, 2021 1 commit
-
-
Avital Oliver authored
-
- 24 May, 2021 1 commit
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * change pytorch import to flax import
-
- 21 May, 2021 3 commits
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * add flax glue link
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * correct best seed for flax fine-tuning Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
Patrick von Platen authored
* speed up flax glue * remove unnecessary line * remove folder * remove run in loop Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-