1. 12 Jan, 2024 1 commit
  2. 11 Jan, 2024 1 commit
    • Alex Hedges's avatar
      Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15
      Alex Hedges authored
      While using `run_clm.py`,[^1] I noticed that some files were being added
      to my global cache, not the local cache. I set the `cache_dir` parameter
      for the one call to `evaluate.load()`, which partially solved the
      problem. I figured that while I was fixing the one script upstream, I
      might as well fix the problem in all other example scripts that I could.
      
      There are still some files being added to my global cache, but this
      appears to be a bug in `evaluate` itself. This commit at least moves
      some of the files into the local cache, which is better than before.
      
      To create this PR, I made the following regex-based transformation:
      `evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
      cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
      all modified files with `ruff` serving as useful guidance. During the
      process, I removed one existing usage of the `cache_dir` parameter in a
      script that did not have a corresponding `--cache-dir` argument
      declared.
      
      [^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
      v4.34.1 of the library. For the original code, see the following URL:
      https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.
      95091e15
  3. 13 Dec, 2023 1 commit
  4. 11 Dec, 2023 1 commit
    • Adam Louly's avatar
      fix no sequence length models error (#27522) · 4850aaba
      Adam Louly authored
      * fix no sequence length models error
      
      * block size check
      
      ---------
      
      Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>
      4850aaba
  5. 30 Nov, 2023 1 commit
  6. 27 Nov, 2023 1 commit
  7. 20 Nov, 2023 1 commit
  8. 17 Nov, 2023 1 commit
  9. 15 Nov, 2023 3 commits
  10. 09 Nov, 2023 3 commits
  11. 08 Nov, 2023 3 commits
  12. 02 Nov, 2023 1 commit
  13. 31 Oct, 2023 1 commit
  14. 30 Oct, 2023 1 commit
  15. 29 Oct, 2023 1 commit
  16. 27 Oct, 2023 1 commit
  17. 24 Oct, 2023 1 commit
  18. 23 Oct, 2023 1 commit
  19. 12 Oct, 2023 1 commit
  20. 11 Oct, 2023 1 commit
  21. 10 Oct, 2023 1 commit
  22. 04 Oct, 2023 1 commit
    • Phuc Van Phan's avatar
      refactor: change default block_size (#26229) · 6015f91a
      Phuc Van Phan authored
      * refactor: change default block_size
      
      * fix: return tf to origin
      
      * fix: change files to origin
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * rebase
      
      * refactor: add min block_size to files
      
      * reformat: add min block_size for run_clm tf
      6015f91a
  23. 03 Oct, 2023 1 commit
  24. 28 Sep, 2023 1 commit
  25. 12 Sep, 2023 1 commit
  26. 11 Sep, 2023 2 commits
  27. 05 Sep, 2023 2 commits
  28. 04 Sep, 2023 1 commit
  29. 01 Sep, 2023 1 commit
  30. 23 Aug, 2023 1 commit
  31. 21 Aug, 2023 1 commit
  32. 15 Aug, 2023 1 commit
    • Zach Mueller's avatar
      Make training args fully immutable (#25435) · ca514992
      Zach Mueller authored
      * Make training args fully immutable
      
      * Working tests, PyTorch
      
      * In test_trainer
      
      * during testing
      
      * Use proper dataclass way
      
      * Fix test
      
      * Another one
      
      * Fix tf
      
      * Lingering slow
      
      * Exception
      
      * Clean
      ca514992