1. 12 Mar, 2022 1 commit
    • Stas Bekman's avatar
      [Deepspeed] add support for bf16 mode (#14569) · 580dd87c
      Stas Bekman authored
      
      
      * [WIP] add support for bf16 mode
      
      * prep for bf16
      
      * prep for bf16
      
      * fix; zero2/bf16 is ok
      
      * check bf16 is available
      
      * test fixes
      
      * enable zero3_bf16
      
      * config files
      
      * docs
      
      * split stage_dtype; merge back to non-dtype-specific config file
      
      * fix doc
      
      * cleanup
      
      * cleanup
      
      * bfloat16 => bf16 to match the PR changes
      
      * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/
      
      * test fixes/skipping
      
      * move
      
      * fix
      
      * Update docs/source/main_classes/deepspeed.mdx
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * backticks
      
      * cleanup
      
      * cleanup
      
      * cleanup
      
      * new version
      
      * add note about grad accum in bf16
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      580dd87c
  2. 02 Mar, 2022 1 commit
  3. 23 Feb, 2022 1 commit
  4. 03 Feb, 2022 1 commit
  5. 13 Jan, 2022 1 commit
  6. 07 Dec, 2021 1 commit
  7. 23 Nov, 2021 1 commit
  8. 11 Nov, 2021 1 commit
  9. 08 Nov, 2021 1 commit
  10. 30 Aug, 2021 1 commit
  11. 23 Jul, 2021 1 commit
  12. 14 Jul, 2021 1 commit
  13. 13 Jul, 2021 1 commit
  14. 22 Jun, 2021 1 commit
  15. 08 Jun, 2021 2 commits
  16. 04 Jun, 2021 1 commit
  17. 02 Jun, 2021 2 commits
  18. 01 Jun, 2021 1 commit
  19. 21 May, 2021 1 commit
  20. 06 May, 2021 1 commit
  21. 30 Apr, 2021 1 commit
    • Stas Bekman's avatar
      [DeepSpeed] fp32 support (#11499) · 4e7bf94e
      Stas Bekman authored
      * prep for deepspeed==0.3.16
      
      * new version
      
      * too soon
      
      * support and test fp32 mode
      
      * troubleshooting doc start
      
      * workaround no longer needed
      
      * add fp32 doc
      
      * style
      
      * cleanup, add tf32 note
      
      * clarify
      
      * release was made
      4e7bf94e
  22. 26 Apr, 2021 3 commits
  23. 21 Apr, 2021 1 commit
  24. 14 Apr, 2021 1 commit
  25. 13 Apr, 2021 1 commit
  26. 08 Apr, 2021 1 commit