1. 25 Jan, 2024 1 commit
  2. 09 Jan, 2024 1 commit
  3. 08 Jan, 2024 1 commit
    • Xuanlei Zhao's avatar
      [npu] use extension for op builder (#5172) · dd2c28a3
      Xuanlei Zhao authored
      * update extension
      
      * update cpu adam
      
      * update is
      
      * add doc for cpu adam
      
      * update kernel
      
      * update commit
      
      * update flash
      
      * update memory efficient
      
      * update flash attn
      
      * update flash attention loader
      
      * update api
      
      * fix
      
      * update doc
      
      * update example time limit
      
      * reverse change
      
      * fix doc
      
      * remove useless kernel
      
      * fix
      
      * not use warning
      
      * update
      
      * update
      dd2c28a3
  4. 16 Oct, 2023 1 commit
  5. 19 Sep, 2023 1 commit
  6. 05 Jun, 2023 1 commit
    • Hongxin Liu's avatar
      [bf16] add bf16 support (#3882) · ae02d4e4
      Hongxin Liu authored
      * [bf16] add bf16 support for fused adam (#3844)
      
      * [bf16] fused adam kernel support bf16
      
      * [test] update fused adam kernel test
      
      * [test] update fused adam test
      
      * [bf16] cpu adam and hybrid adam optimizers support bf16 (#3860)
      
      * [bf16] implement mixed precision mixin and add bf16 support for low level zero (#3869)
      
      * [bf16] add mixed precision mixin
      
      * [bf16] low level zero optim support bf16
      
      * [text] update low level zero test
      
      * [text] fix low level zero grad acc test
      
      * [bf16] add bf16 support for gemini (#3872)
      
      * [bf16] gemini support bf16
      
      * [test] update gemini bf16 test
      
      * [doc] update gemini docstring
      
      * [bf16] add bf16 support for plugins (#3877)
      
      * [bf16] add bf16 support for legacy zero (#3879)
      
      * [zero] init context support bf16
      
      * [zero] legacy zero support bf16
      
      * [test] add zero bf16 test
      
      * [doc] add bf16 related docstring for legacy zero
      ae02d4e4