- 30 Jan, 2024 1 commit
-
-
Matt authored
* Port core files + ESM (because ESM code is odd) * Search-replace in modelling code * Fix up transfo_xl as well * Fix other core files + tests (still need to add correct import to tests) * Fix cookiecutter * make fixup, fix imports in some more core files * Auto-add imports to tests * Cleanup, add imports to sagemaker tests * Use correct exception for importing tf_keras * Fixes in modeling_tf_utils * make fixup * Correct version parsing code * Ensure the pipeline tests correctly revert to float32 after each test * Ensure the pipeline tests correctly revert to float32 after each test * More tf.keras -> keras * Add dtype cast * Better imports of tf_keras * Add a cast for tf.assign, just in case * Fix callback imports
-
- 29 Jan, 2024 1 commit
-
-
Klaus Hipp authored
-
- 19 Jan, 2024 1 commit
-
-
Amy Roberts authored
-
- 12 Jan, 2024 1 commit
-
-
Joao Gante authored
-
- 11 Jan, 2024 1 commit
-
-
Alex Hedges authored
While using `run_clm.py`,[^1] I noticed that some files were being added to my global cache, not the local cache. I set the `cache_dir` parameter for the one call to `evaluate.load()`, which partially solved the problem. I figured that while I was fixing the one script upstream, I might as well fix the problem in all other example scripts that I could. There are still some files being added to my global cache, but this appears to be a bug in `evaluate` itself. This commit at least moves some of the files into the local cache, which is better than before. To create this PR, I made the following regex-based transformation: `evaluate\.load\((.*?)\)` -> `evaluate\.load\($1, cache_dir=model_args.cache_dir\)`. After using that, I manually fixed all modified files with `ruff` serving as useful guidance. During the process, I removed one existing usage of the `cache_dir` parameter in a script that did not have a corresponding `--cache-dir` argument declared. [^1]: I specifically used `pytorch/language-modeling/run_clm.py` from v4.34.1 of the library. For the original code, see the following URL: https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.
-
- 13 Dec, 2023 1 commit
-
-
Lysandre authored
-
- 17 Nov, 2023 1 commit
-
-
V.Prasanna kumar authored
fixed the broken links belogs to dataset library of transformers
-
- 16 Nov, 2023 1 commit
-
-
Matt authored
* Move the TF pin for 2.15 * make fixup
-
- 15 Nov, 2023 1 commit
-
-
Matt authored
* Remove the torch main_process_first context manager from TF examples * Correctly set num_beams=1 in our examples, and add a guard in GenerationConfig.validate() * Update src/transformers/generation/configuration_utils.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 02 Nov, 2023 1 commit
-
-
Lysandre authored
-
- 27 Oct, 2023 1 commit
-
-
Lucain authored
-
- 19 Oct, 2023 1 commit
-
-
Matt authored
* Pin Keras for now out of paranoia * Add the keras pin to _tests_requirements.txt too * Make sure the Keras version matches the TF one * make fixup
-
- 12 Oct, 2023 1 commit
-
-
Tom Aarsen authored
Add missing spaces in adjacent strings
-
- 04 Oct, 2023 1 commit
-
-
Phuc Van Phan authored
* refactor: change default block_size * fix: return tf to origin * fix: change files to origin * rebase * rebase * rebase * rebase * rebase * rebase * rebase * rebase * refactor: add min block_size to files * reformat: add min block_size for run_clm tf
-
- 03 Oct, 2023 1 commit
-
-
Lysandre authored
-
- 18 Sep, 2023 1 commit
-
-
Nino Risteski authored
Fixed a few typos
-
- 11 Sep, 2023 2 commits
-
-
Phuc Van Phan authored
* docs: add space to docs * docs: remove reduntant space
-
Phuc Van Phan authored
-
- 04 Sep, 2023 1 commit
-
-
Lysandre authored
-
- 01 Sep, 2023 1 commit
-
-
Zach Mueller authored
* Revert frozen training arguments * TODO
-
- 22 Aug, 2023 1 commit
-
-
Matt authored
* Update the TF pin and see if anything breaks * make fixup * make fixup * make fixup
-
- 21 Aug, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 15 Aug, 2023 1 commit
-
-
Zach Mueller authored
* Make training args fully immutable * Working tests, PyTorch * In test_trainer * during testing * Use proper dataclass way * Fix test * Another one * Fix tf * Lingering slow * Exception * Clean
-
- 08 Aug, 2023 1 commit
-
-
Yih-Dar authored
* add missing tokens * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 07 Aug, 2023 1 commit
-
-
Jackmin801 authored
* pytorch examples * pytorch mim no trainer * cookiecutter * flax examples * missed line in pytorch run_glue * tensorflow examples * tensorflow run_clip * tensorflow run_mlm * tensorflow run_ner * tensorflow run_clm * pytorch example from_configs * pytorch no trainer examples * Revert "tensorflow run_clip" This reverts commit 261f86ac1f1c9e05dd3fd0291e1a1f8e573781d5. * fix: duplicated argument
-
- 02 Aug, 2023 1 commit
-
-
Yih-Dar authored
* fix * fix * fix * fix * fix * fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 28 Jul, 2023 1 commit
-
-
Yih-Dar authored
* pytorch examples * tensorflow examples * flax examples --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 17 Jul, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 03 Jul, 2023 1 commit
-
-
Gema Parre帽o authored
* fix loading dataset link * Update examples/tensorflow/translation/run_translation.py Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Update examples/tensorflow/translation/run_translation.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 23 Jun, 2023 1 commit
-
-
Matt authored
* An end to accursed version-specific imports * No more K.is_keras_tensor() either * Update dependency tables * Use a cleaner call context function getter * Add a cap to <2.14 * Add cap to examples requirements too
-
- 07 Jun, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 02 Jun, 2023 1 commit
-
-
Matt authored
* Add an option to reduce compile() console spam * Add annotations to the example scripts * Add notes to the quicktour docs as well * minor fix
-
- 26 May, 2023 1 commit
-
-
Ran Ran authored
* Fix no such file or directory error * Address comment * Fix formatting issue
-
- 09 May, 2023 2 commits
-
-
Sylvain Gugger authored
-
Nicolas Patry authored
* Proposed fix for TF example now running on safetensors. * Adding more warnings and returning keys. * Trigger CI * Trigger CI --------- Co-authored-by:Sylvain Gugger <Sylvain.gugger@gmail.com>
-
- 08 May, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 20 Apr, 2023 1 commit
-
-
Sayak Paul authored
minor refactoring to allow compatible datasets to work.
-
- 17 Apr, 2023 2 commits
-
-
Zachary Mueller authored
Remove accelerate from tf
-
Matt authored
-
- 14 Apr, 2023 1 commit
-
-
Sayak Paul authored
* add: tokenizer training script for TF TPU LM training. * add: script for preparing the TFRecord shards. * add: sequence of execution to readme. * remove limit from the tfrecord shard name. * Add initial train_model.py * Add basic training arguments and model init * Get up to the point of writing the data collator * Pushing progress so far! * Complete first draft of model training code * feat: grouping of texts efficiently. Co-authored-by:
Matt <rocketknight1@gmail.com> * Add proper masking collator and get training loop working * fix: things. * Read sample counts from filenames * Read sample counts from filenames * Draft README * Improve TPU warning * Use distribute instead of distribute.experimental * Apply suggestions from code review Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Modularize loading and add MLM probability as arg * minor refactoring to better use the cli args. * readme fillup. * include tpu and inference sections in the readme. * table of contents. * parallelize maps. * polish readme. * change script name to run_mlm.py * address PR feedback (round I). --------- Co-authored-by:
Matt <rocketknight1@gmail.com> Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com>
-