1. 17 Oct, 2023 1 commit
    • Baizhou Zhang's avatar
      [gemini] support gradient accumulation (#4869) · 21ba89ca
      Baizhou Zhang authored
      * add test
      
      * fix no_sync bug in low level zero plugin
      
      * fix test
      
      * add argument for grad accum
      
      * add grad accum in backward hook for gemini
      
      * finish implementation, rewrite tests
      
      * fix test
      
      * skip stuck model in low level zero test
      
      * update doc
      
      * optimize communication & fix gradient checkpoint
      
      * modify doc
      
      * cleaning codes
      
      * update cpu adam fp16 case
      21ba89ca
  2. 21 Sep, 2023 1 commit
  3. 04 Aug, 2023 1 commit
  4. 23 May, 2023 2 commits
    • jiangmingyan's avatar
      [doc]fix · 278fcbc4
      jiangmingyan authored
      278fcbc4
    • jiangmingyan's avatar
      [doc] update gradient accumulation (#3771) · ef02d7ef
      jiangmingyan authored
      * [doc]update gradient accumulation
      
      * [doc]update gradient accumulation
      
      * [doc]update gradient accumulation
      
      * [doc]update gradient accumulation
      
      * [doc]update gradient accumulation, fix
      
      * [doc]update gradient accumulation, fix
      
      * [doc]update gradient accumulation, fix
      
      * [doc]update gradient accumulation, add sidebars
      
      * [doc]update gradient accumulation, fix
      
      * [doc]update gradient accumulation, fix
      
      * [doc]update gradient accumulation, fix
      
      * [doc]update gradient accumulation, resolve comments
      
      * [doc]update gradient accumulation, resolve comments
      
      * fix
      ef02d7ef