1. 26 Apr, 2024 1 commit
  2. 25 Apr, 2024 1 commit
    • Hongxin Liu's avatar
      [shardformer] fix chatglm implementation (#5644) · bbb2c21f
      Hongxin Liu authored
      * [shardformer] fix chatglm policy
      
      * [shardformer] fix chatglm flash attn
      
      * [shardformer] update readme
      
      * [shardformer] fix chatglm init
      
      * [shardformer] fix chatglm test
      
      * [pipeline] fix chatglm merge batch
      bbb2c21f
  3. 23 Apr, 2024 1 commit
  4. 08 Apr, 2024 1 commit
  5. 25 Mar, 2024 2 commits
  6. 22 Mar, 2024 1 commit
  7. 20 Mar, 2024 1 commit
  8. 18 Mar, 2024 1 commit
  9. 05 Mar, 2024 3 commits
  10. 29 Feb, 2024 1 commit
  11. 19 Feb, 2024 2 commits
  12. 25 Jan, 2024 1 commit
  13. 09 Jan, 2024 1 commit
  14. 08 Jan, 2024 1 commit
    • binmakeswell's avatar
      [doc] SwiftInfer release (#5236) · 7bc6969c
      binmakeswell authored
      * [doc] SwiftInfer release
      
      * [doc] SwiftInfer release
      
      * [doc] SwiftInfer release
      
      * [doc] SwiftInfer release
      
      * [doc] SwiftInfer release
      7bc6969c
  15. 07 Jan, 2024 1 commit
  16. 15 Dec, 2023 1 commit
  17. 28 Nov, 2023 2 commits
  18. 27 Nov, 2023 1 commit
  19. 24 Nov, 2023 1 commit
  20. 22 Nov, 2023 1 commit
  21. 21 Nov, 2023 1 commit
  22. 31 Oct, 2023 1 commit
  23. 18 Oct, 2023 1 commit
  24. 17 Oct, 2023 1 commit
    • Baizhou Zhang's avatar
      [gemini] support gradient accumulation (#4869) · 21ba89ca
      Baizhou Zhang authored
      * add test
      
      * fix no_sync bug in low level zero plugin
      
      * fix test
      
      * add argument for grad accum
      
      * add grad accum in backward hook for gemini
      
      * finish implementation, rewrite tests
      
      * fix test
      
      * skip stuck model in low level zero test
      
      * update doc
      
      * optimize communication & fix gradient checkpoint
      
      * modify doc
      
      * cleaning codes
      
      * update cpu adam fp16 case
      21ba89ca
  25. 10 Oct, 2023 1 commit
    • flybird11111's avatar
      [doc] update advanced tutorials, training gpt with hybrid parallelism (#4866) · 6a21f96a
      flybird11111 authored
      * [doc]update advanced tutorials, training gpt with hybrid parallelism
      
      * [doc]update advanced tutorials, training gpt with hybrid parallelism
      
      * update vit tutorials
      
      * update vit tutorials
      
      * update vit tutorials
      
      * update vit tutorials
      
      * update en/train_vit_with_hybrid_parallel.py
      
      * fix
      
      * resolve comments
      
      * fix
      6a21f96a
  26. 05 Oct, 2023 1 commit
  27. 27 Sep, 2023 2 commits
  28. 26 Sep, 2023 2 commits
  29. 25 Sep, 2023 1 commit
  30. 21 Sep, 2023 2 commits
  31. 20 Sep, 2023 1 commit
  32. 19 Sep, 2023 1 commit