1. 31 Jul, 2023 2 commits
    • LuGY's avatar
      [zero]support no_sync method for zero1 plugin (#4138) · 79cf1b5f
      LuGY authored
      * support no sync for zero1 plugin
      
      * polish
      
      * polish
      79cf1b5f
    • LuGY's avatar
      [zero] refactor low level zero for shard evenly (#4030) · c6ab9698
      LuGY authored
      * refactor low level zero
      
      * fix zero2 and support cpu offload
      
      * avg gradient and modify unit test
      
      * refactor grad store, support layer drop
      
      * refactor bucket store, support grad accumulation
      
      * fix and update unit test of zero and ddp
      
      * compatible with tp, ga and unit test
      
      * fix memory leak and polish
      
      * add zero layer drop unittest
      
      * polish code
      
      * fix import err in unit test
      
      * support diffenert comm dtype, modify docstring style
      
      * polish code
      
      * test padding and fix
      
      * fix unit test of low level zero
      
      * fix pad recording in bucket store
      
      * support some models
      
      * polish
      c6ab9698
  2. 28 Jul, 2023 1 commit
  3. 26 Jul, 2023 21 commits
  4. 21 Jul, 2023 2 commits
  5. 19 Jul, 2023 1 commit
  6. 18 Jul, 2023 1 commit
    • Cuiqing Li's avatar
      [Kernels] added triton-implemented of self attention for colossal-ai (#4241) · 4b977541
      Cuiqing Li authored
      * added softmax kernel
      
      * added qkv_kernel
      
      * added ops
      
      * adding tests
      
      * upload tets
      
      * fix tests
      
      * debugging
      
      * debugging tests
      
      * debugging
      
      * added
      
      * fixed errors
      
      * added softmax kernel
      
      * clean codes
      
      * added tests
      
      * update tests
      
      * update tests
      
      * added attention
      
      * add
      
      * fixed pytest checking
      
      * add cuda check
      
      * fix cuda version
      
      * fix typo
      4b977541
  7. 17 Jul, 2023 2 commits
  8. 12 Jul, 2023 1 commit
  9. 10 Jul, 2023 1 commit
  10. 07 Jul, 2023 2 commits
  11. 04 Jul, 2023 6 commits