1. 05 Mar, 2024 2 commits
  2. 30 Jan, 2024 1 commit
  3. 09 Jan, 2024 1 commit
  4. 08 Jan, 2024 1 commit
    • Xuanlei Zhao's avatar
      [npu] use extension for op builder (#5172) · dd2c28a3
      Xuanlei Zhao authored
      * update extension
      
      * update cpu adam
      
      * update is
      
      * add doc for cpu adam
      
      * update kernel
      
      * update commit
      
      * update flash
      
      * update memory efficient
      
      * update flash attn
      
      * update flash attention loader
      
      * update api
      
      * fix
      
      * update doc
      
      * update example time limit
      
      * reverse change
      
      * fix doc
      
      * remove useless kernel
      
      * fix
      
      * not use warning
      
      * update
      
      * update
      dd2c28a3
  5. 28 Nov, 2023 1 commit
  6. 09 Nov, 2023 1 commit
    • Wenhao Chen's avatar
      [moe]: fix ep/tp tests, add hierarchical all2all (#4982) · 72444127
      Wenhao Chen authored
      * fix: add warning for EP different behavior
      
      * fix: use shard_data in ep & tp model
      
      * to: add used_capacity
      
      * fix: fix router test
      
      * feat: add create_ep_node_group
      
      * feat: add create_ep_hierarchical_group fn
      
      * feat: add HierarchicalAllToAll
      
      * test: add hierarchical all2all test
      
      * fix: fix test errors
      
      * fix: simplify create_ep_hierarchical_group
      
      * fix: add hierarchical_alltoall arg
      
      * fix: fix environ typo
      
      * revert: revert process mesh order
      
      * to: add todo mark
      
      * fix: skip hierarchical_comm if torch < 1.13.1
      72444127
  7. 08 Nov, 2023 1 commit
    • Xuanlei Zhao's avatar
      [moe] support optimizer checkpoint (#5015) · f71e63b0
      Xuanlei Zhao authored
      * Refactor MoE Manager setup method
      
      * unshard optim ckpt
      
      * optim io
      
      * update transformer version
      
      * update requirements
      
      * update ckpt
      
      * update ckpt
      
      * update ckpt
      
      * fix engine
      
      * fix engine
      f71e63b0
  8. 02 Nov, 2023 1 commit