1. 23 Feb, 2022 1 commit
  2. 03 Feb, 2022 1 commit
  3. 07 Dec, 2021 1 commit
  4. 23 Nov, 2021 1 commit
  5. 11 Nov, 2021 1 commit
  6. 08 Nov, 2021 1 commit
  7. 30 Aug, 2021 1 commit
  8. 23 Jul, 2021 1 commit
  9. 14 Jul, 2021 1 commit
  10. 13 Jul, 2021 1 commit
  11. 22 Jun, 2021 1 commit
  12. 08 Jun, 2021 2 commits
  13. 04 Jun, 2021 1 commit
  14. 02 Jun, 2021 2 commits
  15. 01 Jun, 2021 1 commit
  16. 21 May, 2021 1 commit
  17. 06 May, 2021 1 commit
  18. 30 Apr, 2021 1 commit
    • Stas Bekman's avatar
      [DeepSpeed] fp32 support (#11499) · 4e7bf94e
      Stas Bekman authored
      * prep for deepspeed==0.3.16
      
      * new version
      
      * too soon
      
      * support and test fp32 mode
      
      * troubleshooting doc start
      
      * workaround no longer needed
      
      * add fp32 doc
      
      * style
      
      * cleanup, add tf32 note
      
      * clarify
      
      * release was made
      4e7bf94e
  19. 26 Apr, 2021 3 commits
  20. 21 Apr, 2021 1 commit
  21. 14 Apr, 2021 1 commit
  22. 13 Apr, 2021 1 commit
  23. 08 Apr, 2021 2 commits
    • Stas Bekman's avatar
      [tests] relocate core integration tests (#11146) · 66446909
      Stas Bekman authored
      * relocate core integration tests
      
      * add sys.path context manager
      
      * cleanup
      
      * try
      
      * try2
      
      * fix path
      
      * doc
      
      * style
      
      * add dep
      
      * add 2 more deps
      66446909
    • Stas Bekman's avatar
      [DeepSpeed] ZeRO Stage 3 (#10753) · c6d66484
      Stas Bekman authored
      
      
      * synced gpus
      
      * fix
      
      * fix
      
      * need to use t5-small for quality tests
      
      * notes
      
      * complete merge
      
      * fix a disappearing std stream problem
      
      * start zero3 tests
      
      * wip
      
      * tune params
      
      * sorting out the pre-trained model loading
      
      * reworking generate loop wip
      
      * wip
      
      * style
      
      * fix tests
      
      * split the tests
      
      * refactor tests
      
      * wip
      
      * parameterized
      
      * fix
      
      * workout the resume from non-ds checkpoint pass + test
      
      * cleanup
      
      * remove no longer needed code
      
      * split getter/setter functions
      
      * complete the docs
      
      * suggestions
      
      * gpus and their compute capabilities link
      
      * Apply suggestions from code review
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * style
      
      * remove invalid paramgd
      
      * automatically configure zero3 params that rely on hidden size
      
      * make _get_resized_embeddings zero3-aware
      
      * add test exercising resize_token_embeddings()
      
      * add docstring
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      c6d66484
  24. 17 Mar, 2021 1 commit
  25. 16 Mar, 2021 1 commit
  26. 15 Mar, 2021 1 commit
  27. 24 Feb, 2021 1 commit
  28. 22 Feb, 2021 1 commit
  29. 18 Feb, 2021 1 commit
  30. 17 Feb, 2021 1 commit
  31. 15 Feb, 2021 1 commit
  32. 11 Feb, 2021 1 commit
  33. 10 Feb, 2021 1 commit
  34. 08 Feb, 2021 2 commits