1. 05 Jul, 2024 1 commit
    • Anton Vlasjuk's avatar
      Fix galore lr display with schedulers (#31710) · a01b033c
      Anton Vlasjuk authored
      * fix galore lr display with lr schedulers
      
      * style
      
      * add some tests to check for displayed lrs
      
      * copy-paste err for warmup steps
      
      * standardize the default lr to be only in the optimizer
      
      * trying out my luck with the reads
      a01b033c
  2. 28 Jun, 2024 1 commit
  3. 26 Jun, 2024 1 commit
  4. 17 Jun, 2024 2 commits
    • Albert Villanova del Moral's avatar
      Pass datasets trust_remote_code (#31406) · a14b055b
      Albert Villanova del Moral authored
      * Pass datasets trust_remote_code
      
      * Pass trust_remote_code in more tests
      
      * Add trust_remote_dataset_code arg to some tests
      
      * Revert "Temporarily pin datasets upper version to fix CI"
      
      This reverts commit b7672826.
      
      * Pass trust_remote_code in librispeech_asr_dummy docstrings
      
      * Revert "Pin datasets<2.20.0 for examples"
      
      This reverts commit 833fc17a.
      
      * Pass trust_remote_code to all examples
      
      * Revert "Add trust_remote_dataset_code arg to some tests" to research_projects
      
      * Pass trust_remote_code to tests
      
      * Pass trust_remote_code to docstrings
      
      * Fix flax examples tests requirements
      
      * Pass trust_remote_dataset_code arg to tests
      
      * Replace trust_remote_dataset_code with trust_remote_code in one example
      
      * Fix duplicate trust_remote_code
      
      * Replace args.trust_remote_dataset_code with args.trust_remote_code
      
      * Replace trust_remote_dataset_code with trust_remote_code in parser
      
      * Replace trust_remote_dataset_code with trust_remote_code in dataclasses
      
      * Replace trust_remote_dataset_code with trust_remote_code arg
      a14b055b
    • Bastien Le Chenadec's avatar
      Support multiple validation datasets when `dataloader_persistent_workers=True` (#30627) · 485fd814
      Bastien Le Chenadec authored
      * Support multiple validation datasets when dataloader_persistent_workers=True
      
      * Test support of multiple validation datasets
      485fd814
  5. 07 Jun, 2024 1 commit
  6. 29 May, 2024 1 commit
  7. 21 May, 2024 3 commits
  8. 20 May, 2024 1 commit
  9. 13 May, 2024 1 commit
    • fxmarty's avatar
      CI: update to ROCm 6.0.2 and test MI300 (#30266) · 37bba2a3
      fxmarty authored
      
      
      * update to ROCm 6.0.2 and test MI300
      
      * add callers for mi300
      
      * update dockerfile
      
      * fix trainer tests
      
      * remove apex
      
      * style
      
      * Update tests/trainer/test_trainer_seq2seq.py
      
      * Update tests/trainer/test_trainer_seq2seq.py
      
      * Update tests/trainer/test_trainer_seq2seq.py
      
      * Update tests/trainer/test_trainer_seq2seq.py
      
      * update to torch 2.3
      
      * add workflow dispatch target
      
      * we may need branches: mi300-ci after all
      
      * nit
      
      * fix docker build
      
      * nit
      
      * add check runner
      
      * remove docker-gpu
      
      * fix issues
      
      * fix
      
      ---------
      Co-authored-by: default avatarYih-Dar <2521628+ydshieh@users.noreply.github.com>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      37bba2a3
  10. 08 May, 2024 1 commit
    • Anton Vlasjuk's avatar
      Immutability for data collators (#30603) · 71c19850
      Anton Vlasjuk authored
      * immutability fix for seq2seq as well as immutability tests for the collators
      
      * ensure we don't act on none labels and formatting
      
      * remove tf/pt in respective tests as they are not required
      
      * more type error fixes tf/np
      
      * remove todo
      
      * apply suggestions from code review
      
      * formatting / style
      71c19850
  11. 06 May, 2024 2 commits
  12. 30 Apr, 2024 1 commit
    • Anton Vlasjuk's avatar
      Fix seq2seq collator padding (#30556) · 9112520b
      Anton Vlasjuk authored
      * fix seq2seq data collator to respect the given padding strategy
      
      further added tests for the seq2seq data collator in the style of the `data_collator_for_token_classification` (pt, tf, np)
      
      * formatting and change bool equals "==" to "is"
      
      * add missed return types in tests
      
      * update numpy test as it can handle unequal shapes, not like pt or tf
      9112520b
  13. 25 Apr, 2024 1 commit
  14. 19 Apr, 2024 1 commit
  15. 18 Apr, 2024 1 commit
  16. 17 Apr, 2024 1 commit
    • Pavel Iakubovskii's avatar
      Add strategy to store results in evaluation loop (#30267) · c15aad09
      Pavel Iakubovskii authored
      * Add evaluation loop container for interm. results
      
      * Add tests for EvalLoopContainer
      
      * Formatting
      
      * Fix padding_index in test and typo
      
      * Move EvalLoopContainer to pr_utils to avoid additional imports
      
      * Fix `eval_do_concat_batches` arg description
      
      * Fix EvalLoopContainer import
      c15aad09
  17. 16 Apr, 2024 1 commit
  18. 31 Mar, 2024 1 commit
  19. 28 Mar, 2024 2 commits
  20. 26 Mar, 2024 2 commits
  21. 20 Mar, 2024 1 commit
  22. 19 Mar, 2024 1 commit
  23. 15 Mar, 2024 1 commit
  24. 08 Mar, 2024 1 commit
  25. 04 Mar, 2024 1 commit
  26. 01 Mar, 2024 1 commit
    • Zach Mueller's avatar
      Fix deprecated arg issue (#29372) · 1a7c117d
      Zach Mueller authored
      * Fix deprecated arg issue
      
      * Trainer check too
      
      * Check for dict or dataclass
      
      * Simplify, make config always AcceleratorConfig
      
      * Upstream to Trainer
      1a7c117d
  27. 20 Feb, 2024 2 commits
  28. 16 Feb, 2024 2 commits
  29. 14 Feb, 2024 3 commits
  30. 22 Jan, 2024 1 commit