1. 24 Jan, 2024 1 commit
    • Steven Liu's avatar
      [docs] DeepSpeed (#28542) · 738ec75c
      Steven Liu authored
      * config
      
      * optim
      
      * pre deploy
      
      * deploy
      
      * save weights, memory, troubleshoot, non-Trainer
      
      * done
      738ec75c
  2. 12 Jan, 2024 1 commit
  3. 02 Jan, 2024 1 commit
  4. 27 Nov, 2023 1 commit
  5. 10 Oct, 2023 1 commit
  6. 29 Aug, 2023 1 commit
  7. 25 Aug, 2023 1 commit
  8. 02 Aug, 2023 1 commit
  9. 20 Jun, 2023 1 commit
  10. 12 Jun, 2023 1 commit
  11. 22 Mar, 2023 2 commits
  12. 13 Mar, 2023 1 commit
  13. 01 Mar, 2023 1 commit
  14. 27 Feb, 2023 1 commit
  15. 13 Feb, 2023 1 commit
  16. 10 Feb, 2023 1 commit
  17. 07 Nov, 2022 1 commit
  18. 14 Sep, 2022 1 commit
  19. 29 Aug, 2022 1 commit
  20. 04 Apr, 2022 1 commit
  21. 23 Mar, 2022 1 commit
  22. 17 Mar, 2022 1 commit
  23. 12 Mar, 2022 1 commit
    • Stas Bekman's avatar
      [Deepspeed] add support for bf16 mode (#14569) · 580dd87c
      Stas Bekman authored
      
      
      * [WIP] add support for bf16 mode
      
      * prep for bf16
      
      * prep for bf16
      
      * fix; zero2/bf16 is ok
      
      * check bf16 is available
      
      * test fixes
      
      * enable zero3_bf16
      
      * config files
      
      * docs
      
      * split stage_dtype; merge back to non-dtype-specific config file
      
      * fix doc
      
      * cleanup
      
      * cleanup
      
      * bfloat16 => bf16 to match the PR changes
      
      * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/
      
      * test fixes/skipping
      
      * move
      
      * fix
      
      * Update docs/source/main_classes/deepspeed.mdx
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * backticks
      
      * cleanup
      
      * cleanup
      
      * cleanup
      
      * new version
      
      * add note about grad accum in bf16
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      580dd87c
  24. 11 Feb, 2022 1 commit
  25. 10 Feb, 2022 1 commit
  26. 04 Feb, 2022 1 commit
  27. 03 Feb, 2022 1 commit
  28. 31 Jan, 2022 1 commit
  29. 26 Jan, 2022 2 commits
  30. 12 Jan, 2022 1 commit
  31. 28 Dec, 2021 1 commit
    • Sylvain Gugger's avatar
      Doc styler examples (#14953) · b5e2b183
      Sylvain Gugger authored
      * Fix bad examples
      
      * Add black formatting to style_doc
      
      * Use first nonempty line
      
      * Put it at the right place
      
      * Don't add spaces to empty lines
      
      * Better templates
      
      * Deal with triple quotes in docstrings
      
      * Result of style_doc
      
      * Enable mdx treatment and fix code examples in MDXs
      
      * Result of doc styler on doc source files
      
      * Last fixes
      
      * Break copy from
      b5e2b183
  32. 21 Dec, 2021 1 commit