1. 19 Apr, 2022 1 commit
  2. 29 Mar, 2022 1 commit
  3. 23 Mar, 2022 1 commit
    • Sylvain Gugger's avatar
      Reorganize file utils (#16264) · 4975002d
      Sylvain Gugger authored
      * Split file_utils in several submodules
      
      * Fixes
      
      * Add back more objects
      
      * More fixes
      
      * Who exactly decided to import that from there?
      
      * Second suggestion to code with code review
      
      * Revert wront move
      
      * Fix imports
      
      * Adapt all imports
      
      * Adapt all imports everywhere
      
      * Revert this import, will fix in a separate commit
      4975002d
  4. 08 Mar, 2022 1 commit
  5. 23 Feb, 2022 1 commit
  6. 09 Feb, 2022 1 commit
  7. 03 Feb, 2022 1 commit
  8. 02 Feb, 2022 1 commit
    • Ayush Chaurasia's avatar
      Add W&B backend for hyperparameter sweep (#14582) · c74f3d4c
      Ayush Chaurasia authored
      # Add support for W&B hyperparameter sweep
      This PR:
      * allows using wandb for running hyperparameter search.
      * The runs are visualized on W&B sweeps dashboard
      * This supports runnning sweeps on parallel devices, all reporting to the same central dashboard.
      
      ### Usage
      **To run new a hyperparameter search:**
      ```
      trainer.hyperparameter_search(
          backend="wandb", 
          project="transformers_sweep", # name of the project
          n_trials=5,
          metric="eval/loss", # metric to be optimized, default 'eval/loss'. A warning is raised if the passed metric is not found
      )
      ```
      This outputs a sweep id. Eg. `my_project/sweep_id`
      
      **To run sweeps on parallel devices:**
      Just pass sweep id which you want to run parallel
      ```
      trainer.hyperparameter_search(
          backend="wandb", 
          sweep_id = "my_project/sweep_id"
      )
      ```
      c74f3d4c
  9. 13 Jan, 2022 1 commit
  10. 11 Jan, 2022 1 commit
  11. 23 Dec, 2021 1 commit
  12. 16 Dec, 2021 1 commit
  13. 03 Dec, 2021 1 commit
  14. 01 Dec, 2021 1 commit
  15. 18 Nov, 2021 1 commit
  16. 16 Nov, 2021 1 commit
    • Valentin's avatar
      Avoid looping when data exhausted (#14413) · a33168aa
      Valentin authored
      * stop training when a finite IterableDataset is exhausted
      
      when using an iterable dataset num_epochs is set to
      sys.maxsize to make sure all data is consumed
      likewise we want to set max_steps high enough
      but still stop when all data is consumed
      
      (cherry picked from commit 6f0e1d6363153da9051e93acffe1cbab3a3f3b12)
      
      * fix typo flase -> false
      
      * add test for stopping training on exhausted finite iterable dataset
      
      * remove redundant gradient_accumulation_steps
      
      * run make style
      
      reformat training_args docstring
      a33168aa
  17. 02 Nov, 2021 1 commit
  18. 29 Oct, 2021 1 commit
    • Thomas Wang's avatar
      Remove n_ctx from configs (#14165) · 5b45422b
      Thomas Wang authored
      * Remove n_ctx from configs
      
      * Fix GPTJ and OpenAIGPT, both are acceptable breaking changes as there are no configs such that it breaks
      
      * Remove unecessary n_positions from TFOpenAIGPT
      5b45422b
  19. 23 Sep, 2021 1 commit
  20. 17 Sep, 2021 1 commit
  21. 14 Sep, 2021 1 commit
    • Sylvain Gugger's avatar
      Push to hub when saving checkpoints (#13503) · 3081d386
      Sylvain Gugger authored
      * Push to hub when saving checkpoints
      
      * Add model card
      
      * Revert partial model card
      
      * Small fix for checkpoint
      
      * Add tests
      
      * Add documentation
      
      * Fix tests
      
      * Bump huggingface_hub
      
      * Fix test
      3081d386
  22. 09 Sep, 2021 1 commit
  23. 03 Aug, 2021 1 commit
  24. 19 Jul, 2021 1 commit
  25. 23 Jun, 2021 1 commit
  26. 22 Jun, 2021 3 commits
  27. 21 Jun, 2021 1 commit
  28. 15 Jun, 2021 1 commit
  29. 14 Jun, 2021 2 commits
  30. 09 Jun, 2021 1 commit
  31. 01 Jun, 2021 1 commit
  32. 25 May, 2021 1 commit
  33. 24 May, 2021 1 commit
  34. 18 May, 2021 1 commit
  35. 11 May, 2021 1 commit
  36. 04 May, 2021 1 commit
    • Sylvain Gugger's avatar
      Reproducible checkpoint (#11582) · 6b241e0e
      Sylvain Gugger authored
      * Set generator in dataloader
      
      * Use generator in all random samplers
      
      * Checkpoint all RNG states
      
      * Final version
      
      * Quality
      
      * Test
      
      * Address review comments
      
      * Quality
      
      * Remove debug util
      
      * Add python and numpy RNGs
      
      * Split states in different files in distributed
      
      * Quality
      
      * local_rank for TPUs
      
      * Only use generator when accepted
      
      * Add test
      
      * Set seed to avoid flakiness
      
      * Make test less flaky
      
      * Quality
      6b241e0e
  37. 26 Apr, 2021 1 commit