1. 03 Dec, 2025 2 commits
  2. 02 Dec, 2025 1 commit
  3. 27 Nov, 2025 1 commit
  4. 26 Nov, 2025 1 commit
  5. 21 Nov, 2025 1 commit
  6. 19 Nov, 2025 1 commit
    • Kane's avatar
      Mlu590 deployment (#453) · fcc2a411
      Kane authored
      Feature:
          1. added mlu590 bfloat16, single-gpu and multi-gpus inference.
          2. added mlu590 int8 inference.
      fcc2a411
  7. 13 Nov, 2025 1 commit
  8. 24 Oct, 2025 2 commits
  9. 16 Oct, 2025 1 commit
  10. 29 Sep, 2025 2 commits
  11. 18 Sep, 2025 1 commit
  12. 03 Sep, 2025 1 commit
  13. 02 Sep, 2025 1 commit
  14. 28 Aug, 2025 1 commit
  15. 27 Aug, 2025 2 commits
  16. 26 Aug, 2025 1 commit
  17. 20 Aug, 2025 1 commit
  18. 14 Aug, 2025 1 commit
  19. 11 Aug, 2025 1 commit
  20. 09 Aug, 2025 1 commit
  21. 08 Aug, 2025 2 commits
  22. 05 Aug, 2025 2 commits
  23. 30 Jul, 2025 1 commit
  24. 14 Jul, 2025 1 commit
  25. 11 Jun, 2025 1 commit
  26. 22 May, 2025 2 commits
  27. 14 May, 2025 1 commit
  28. 29 Apr, 2025 1 commit
  29. 20 Apr, 2025 2 commits
  30. 08 Apr, 2025 3 commits
    • gushiqiao's avatar
      Support sync cpu offload. (#10) · 683aaa3a
      gushiqiao authored
      
      Co-authored-by: default avatargushiqiao <gushiqiao@sensetime.com>
      683aaa3a
    • Dongz's avatar
      add lint feature and minor fix (#7) · a50bcc53
      Dongz authored
      * [minor]: optimize dockerfile for fewer layer
      
      * [feature]: add pre-commit lint, update readme for contribution guidance
      
      * [minor]: fix run shell privileges
      
      * [auto]: first lint without rule F, fix rule E
      
      * [minor]: fix docker file error
      a50bcc53
    • TorynCurtis's avatar
      wan model cpu_offload (#3) · b93699b0
      TorynCurtis authored
      * 修改了main.py, t5的model, wan的model、三个weights文件和三个infer文件, 并且在common的conv3d算子中注册新算子
      
      * 修改了Conv3dWeightForceBF16算子,更新了wan的pre_weights中对此算子的使用
      
      * 修复了import中的bug
      
      * 修复了WanPreWeights, WanTransformerWeights没有self.config的bug
      
      * 修复了WanPreWeights, WanTransformerWeights没有self.config的bug
      
      * 修复了config的bug,目前在使用cpu_offload的时候,vae阶段有tensor不在同一device的bug
      
      * 修复了vae阶段迁移的bug
      
      * 修复了scale在mean和inv_std迁移后仍需重新赋值的bug
      b93699b0