1. 19 Jan, 2024 1 commit
  2. 12 Jan, 2024 1 commit
  3. 11 Jan, 2024 1 commit
    • Alex Hedges's avatar
      Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15
      Alex Hedges authored
      While using `run_clm.py`,[^1] I noticed that some files were being added
      to my global cache, not the local cache. I set the `cache_dir` parameter
      for the one call to `evaluate.load()`, which partially solved the
      problem. I figured that while I was fixing the one script upstream, I
      might as well fix the problem in all other example scripts that I could.
      
      There are still some files being added to my global cache, but this
      appears to be a bug in `evaluate` itself. This commit at least moves
      some of the files into the local cache, which is better than before.
      
      To create this PR, I made the following regex-based transformation:
      `evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
      cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
      all modified files with `ruff` serving as useful guidance. During the
      process, I removed one existing usage of the `cache_dir` parameter in a
      script that did not have a corresponding `--cache-dir` argument
      declared.
      
      [^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
      v4.34.1 of the library. For the original code, see the following URL:
      https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.
      95091e15
  4. 13 Dec, 2023 1 commit
  5. 17 Nov, 2023 1 commit
  6. 16 Nov, 2023 1 commit
  7. 15 Nov, 2023 1 commit
  8. 02 Nov, 2023 1 commit
  9. 27 Oct, 2023 1 commit
  10. 19 Oct, 2023 1 commit
    • Matt's avatar
      Pin Keras for now (#26904) · cbd278f0
      Matt authored
      * Pin Keras for now out of paranoia
      
      * Add the keras pin to _tests_requirements.txt too
      
      * Make sure the Keras version matches the TF one
      
      * make fixup
      cbd278f0
  11. 12 Oct, 2023 1 commit
  12. 04 Oct, 2023 1 commit
    • Phuc Van Phan's avatar
      refactor: change default block_size (#26229) · 6015f91a
      Phuc Van Phan authored
      * refactor: change default block_size
      
      * fix: return tf to origin
      
      * fix: change files to origin
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * refactor: add min block_size to files
      
      * reformat: add min block_size for run_clm tf
      6015f91a
  13. 03 Oct, 2023 1 commit
  14. 18 Sep, 2023 1 commit
  15. 11 Sep, 2023 2 commits
  16. 04 Sep, 2023 1 commit
  17. 01 Sep, 2023 1 commit
  18. 22 Aug, 2023 1 commit
  19. 21 Aug, 2023 1 commit
  20. 15 Aug, 2023 1 commit
    • Zach Mueller's avatar
      Make training args fully immutable (#25435) · ca514992
      Zach Mueller authored
      * Make training args fully immutable
      
      * Working tests, PyTorch
      
      * In test_trainer
      
      * during testing
      
      * Use proper dataclass way
      
      * Fix test
      
      * Another one
      
      * Fix tf
      
      * Lingering slow
      
      * Exception
      
      * Clean
      ca514992
  21. 08 Aug, 2023 1 commit
  22. 07 Aug, 2023 1 commit
    • Jackmin801's avatar
      Allow `trust_remote_code` in example scripts (#25248) · 14510938
      Jackmin801 authored
      * pytorch examples
      
      * pytorch mim no trainer
      
      * cookiecutter
      
      * flax examples
      
      * missed line in pytorch run_glue
      
      * tensorflow examples
      
      * tensorflow run_clip
      
      * tensorflow run_mlm
      
      * tensorflow run_ner
      
      * tensorflow run_clm
      
      * pytorch example from_configs
      
      * pytorch no trainer examples
      
      * Revert "tensorflow run_clip"
      
      This reverts commit 261f86ac1f1c9e05dd3fd0291e1a1f8e573781d5.
      
      * fix: duplicated argument
      14510938
  23. 02 Aug, 2023 1 commit
  24. 28 Jul, 2023 1 commit
  25. 17 Jul, 2023 1 commit
  26. 03 Jul, 2023 1 commit
  27. 23 Jun, 2023 1 commit
    • Matt's avatar
      Improved keras imports (#24448) · 8e164c54
      Matt authored
      * An end to accursed version-specific imports
      
      * No more K.is_keras_tensor() either
      
      * Update dependency tables
      
      * Use a cleaner call context function getter
      
      * Add a cap to <2.14
      
      * Add cap to examples requirements too
      8e164c54
  28. 07 Jun, 2023 1 commit
  29. 02 Jun, 2023 1 commit
  30. 26 May, 2023 1 commit
  31. 09 May, 2023 2 commits
  32. 08 May, 2023 1 commit
  33. 20 Apr, 2023 1 commit
  34. 17 Apr, 2023 2 commits
  35. 14 Apr, 2023 1 commit
    • Sayak Paul's avatar
      [Examples] TPU-based training of a language model using TensorFlow (#21657) · 390e121f
      Sayak Paul authored
      
      
      * add: tokenizer training script for TF TPU LM training.
      
      * add: script for preparing the TFRecord shards.
      
      * add: sequence of execution to readme.
      
      * remove limit from the tfrecord shard name.
      
      * Add initial train_model.py
      
      * Add basic training arguments and model init
      
      * Get up to the point of writing the data collator
      
      * Pushing progress so far!
      
      * Complete first draft of model training code
      
      * feat: grouping of texts efficiently.
      Co-authored-by: default avatarMatt <rocketknight1@gmail.com>
      
      * Add proper masking collator and get training loop working
      
      * fix: things.
      
      * Read sample counts from filenames
      
      * Read sample counts from filenames
      
      * Draft README
      
      * Improve TPU warning
      
      * Use distribute instead of distribute.experimental
      
      * Apply suggestions from code review
      Co-authored-by: default avatarMatt <Rocketknight1@users.noreply.github.com>
      
      * Modularize loading and add MLM probability as arg
      
      * minor refactoring to better use the cli args.
      
      * readme fillup.
      
      * include tpu and inference sections in the readme.
      
      * table of contents.
      
      * parallelize maps.
      
      * polish readme.
      
      * change script name to run_mlm.py
      
      * address PR feedback (round I).
      
      ---------
      Co-authored-by: default avatarMatt <rocketknight1@gmail.com>
      Co-authored-by: default avatarMatt <Rocketknight1@users.noreply.github.com>
      390e121f
  36. 13 Apr, 2023 1 commit
  37. 24 Mar, 2023 1 commit