1. 14 Jun, 2021 2 commits
    • Nicholas Broad's avatar
      Use text_column_name variable instead of "text" (#12132) · cd7961b6
      Nicholas Broad authored
      
      
      * Use text_column_name variable instead of "text"
      
      `text_column_name` was already defined above where I made the changes and it was also used below where I made changes.
      
      This is a very minor change. If a dataset does not use "text" as the column name, then the `tokenize_function` will now use whatever column is assigned to `text_column_name`. `text_column_name` is just the first column name if "text" is not a column name. It makes the function a little more robust, though I would assume that 90% + of datasets use "text" anyway.
      
      * black formatting
      
      * make style
      Co-authored-by: default avatarNicholas Broad <nicholas@nmbroad.com>
      cd7961b6
    • Sylvain Gugger's avatar
      Don't log anything before logging is setup in examples (#12121) · b8ab5413
      Sylvain Gugger authored
      * Don't log anything before logging is setup in examples
      
      * Last example
      b8ab5413
  2. 10 Jun, 2021 4 commits
  3. 09 Jun, 2021 1 commit
  4. 08 Jun, 2021 3 commits
  5. 01 Jun, 2021 1 commit
  6. 31 May, 2021 1 commit
  7. 25 May, 2021 4 commits
  8. 20 May, 2021 1 commit
  9. 18 May, 2021 4 commits
  10. 17 May, 2021 1 commit
  11. 12 May, 2021 2 commits
  12. 11 May, 2021 2 commits
  13. 10 May, 2021 1 commit
  14. 09 May, 2021 1 commit
  15. 07 May, 2021 2 commits
  16. 06 May, 2021 1 commit
  17. 05 May, 2021 1 commit
  18. 04 May, 2021 1 commit
    • Sylvain Gugger's avatar
      Reproducible checkpoint (#11582) · 6b241e0e
      Sylvain Gugger authored
      * Set generator in dataloader
      
      * Use generator in all random samplers
      
      * Checkpoint all RNG states
      
      * Final version
      
      * Quality
      
      * Test
      
      * Address review comments
      
      * Quality
      
      * Remove debug util
      
      * Add python and numpy RNGs
      
      * Split states in different files in distributed
      
      * Quality
      
      * local_rank for TPUs
      
      * Only use generator when accepted
      
      * Add test
      
      * Set seed to avoid flakiness
      
      * Make test less flaky
      
      * Quality
      6b241e0e
  19. 03 May, 2021 1 commit
  20. 30 Apr, 2021 2 commits
  21. 29 Apr, 2021 1 commit
  22. 26 Apr, 2021 2 commits
  23. 23 Apr, 2021 1 commit