1. 18 May, 2021 1 commit
  2. 13 May, 2021 1 commit
  3. 11 May, 2021 2 commits
  4. 10 May, 2021 1 commit
  5. 06 May, 2021 1 commit
  6. 04 May, 2021 1 commit
    • Sylvain Gugger's avatar
      Reproducible checkpoint (#11582) · 6b241e0e
      Sylvain Gugger authored
      * Set generator in dataloader
      
      * Use generator in all random samplers
      
      * Checkpoint all RNG states
      
      * Final version
      
      * Quality
      
      * Test
      
      * Address review comments
      
      * Quality
      
      * Remove debug util
      
      * Add python and numpy RNGs
      
      * Split states in different files in distributed
      
      * Quality
      
      * local_rank for TPUs
      
      * Only use generator when accepted
      
      * Add test
      
      * Set seed to avoid flakiness
      
      * Make test less flaky
      
      * Quality
      6b241e0e
  7. 03 May, 2021 1 commit
  8. 30 Apr, 2021 1 commit
  9. 26 Apr, 2021 4 commits
  10. 23 Apr, 2021 2 commits
  11. 22 Apr, 2021 1 commit
  12. 21 Apr, 2021 1 commit
  13. 20 Apr, 2021 2 commits
  14. 19 Apr, 2021 2 commits
  15. 16 Apr, 2021 1 commit
  16. 15 Apr, 2021 1 commit
  17. 14 Apr, 2021 1 commit
  18. 08 Apr, 2021 4 commits
  19. 31 Mar, 2021 2 commits
  20. 29 Mar, 2021 1 commit
    • pcuenca's avatar
      Allow use of pre-computed lengths when grouping by length. (#10953) · ae6b6963
      pcuenca authored
      A new argument `length_column_name` has been added to
      `TrainingArguments`, with default value `"length"`. If this column
      exists and `group_by_length` is `True`, the train sampler will use
      it for grouping rather than computing it before training starts.
      
      This is an optimization that allows the user to prepare data for fast
      processing, preventing sequential access to the dataset as described in
      issue #10909.
      ae6b6963
  21. 24 Mar, 2021 1 commit
  22. 23 Mar, 2021 1 commit
  23. 22 Mar, 2021 2 commits
  24. 18 Mar, 2021 1 commit
  25. 17 Mar, 2021 3 commits
    • Mansi Mane's avatar
      Smmp batch not divisible by microbatches fix (#10778) · 0282e24e
      Mansi Mane authored
      
      
      * Added debug prints
      
      * Added config
      
      * Added prints
      
      * Added prints
      
      * Added extra samples to SequentialDistributedSampler
      
      * Added extra samples to SequentialDistributedSampler
      
      Updated SequentialDistributedSampler call
      
      * Added deubg prints
      
      * Removed extra prints
      
      * Making predicitons and labels multiple of batchsize
      
      * updated number of microbatches
      
      * Removed extra prints
      
      * Made start_remainder similar to DistributedSamplerWithLoop
      
      * Minor spacing update
      
      * Added debug prints
      
      Added config
      
      Added prints
      
      Added prints
      
      * Added extra samples to SequentialDistributedSampler
      
      Updated SequentialDistributedSampler call
      
      Added extra samples to SequentialDistributedSampler
      
      Added deubg prints
      
      Removed extra prints
      
      Making predicitons and labels multiple of batchsize
      
      updated number of microbatches
      
      Removed extra prints
      
      Squashing redundant commits
      
      * Made start_remainder similar to DistributedSamplerWithLoop
      
      Minor spacing update
      
      Made start_remainder similar to DistributedSamplerWithLoop
      
      * Test and styling
      
      * Rename test
      Co-authored-by: default avatarSylvain Gugger <sylvain.gugger@gmail.com>
      0282e24e
    • Stas Bekman's avatar
    • Stas Bekman's avatar
      [DeepSpeed] improve checkpoint loading code plus tests (#10760) · cd8c93f7
      Stas Bekman authored
      * deepspeed checkpoint loading code plus tests
      
      * style
      
      * style
      cd8c93f7
  26. 16 Mar, 2021 1 commit