1. 08 Apr, 2024 1 commit
  2. 25 Mar, 2024 1 commit
    • flybird11111's avatar
      [shardformer]Fix lm parallel. (#5480) · 0688d92e
      flybird11111 authored
      * fix
      
      * padding vocab_size when using pipeline parallellism
      
      padding vocab_size when using pipeline parallellism
      
      fix
      
      fix
      
      * fix
      
      * fix
      
      fix
      
      fix
      
      * fix gather output
      
      * fix
      
      * fix
      
      * fix
      
      fix resize embedding
      
      fix resize embedding
      
      * fix resize embedding
      
      fix
      
      * revert
      
      * revert
      
      * revert
      
      * fix lm forward distribution
      
      * fix
      
      * test ci
      
      * fix
      0688d92e
  3. 20 Oct, 2023 1 commit
  4. 19 Sep, 2023 1 commit
  5. 06 Apr, 2023 1 commit
  6. 26 Jul, 2022 1 commit
    • ver217's avatar
      [nvme] CPUAdam and HybridAdam support NVMe offload (#1360) · c415240d
      ver217 authored
      * impl nvme optimizer
      
      * update cpu adam
      
      * add unit test
      
      * update hybrid adam
      
      * update docstr
      
      * add TODOs
      
      * update CI
      
      * fix CI
      
      * fix CI
      
      * fix CI path
      
      * fix CI path
      
      * fix CI path
      
      * fix install tensornvme
      
      * fix CI
      
      * fix CI path
      
      * fix CI env variables
      
      * test CI
      
      * test CI
      
      * fix CI
      
      * fix nvme optim __del__
      
      * fix adam __del__
      
      * fix nvme optim
      
      * fix CI env variables
      
      * fix nvme optim import
      
      * test CI
      
      * test CI
      
      * fix CI
      c415240d