1. 09 Dec, 2025 1 commit
  2. 03 Dec, 2025 3 commits
  3. 02 Dec, 2025 1 commit
  4. 27 Nov, 2025 1 commit
  5. 26 Nov, 2025 1 commit
  6. 21 Nov, 2025 1 commit
  7. 19 Nov, 2025 1 commit
    • Kane's avatar
      Mlu590 deployment (#453) · fcc2a411
      Kane authored
      Feature:
          1. added mlu590 bfloat16, single-gpu and multi-gpus inference.
          2. added mlu590 int8 inference.
      fcc2a411
  8. 13 Nov, 2025 1 commit
  9. 03 Nov, 2025 1 commit
  10. 28 Oct, 2025 1 commit
  11. 24 Oct, 2025 2 commits
  12. 22 Oct, 2025 1 commit
  13. 20 Oct, 2025 1 commit
  14. 16 Oct, 2025 1 commit
  15. 01 Oct, 2025 1 commit
  16. 29 Sep, 2025 2 commits
  17. 18 Sep, 2025 1 commit
  18. 15 Sep, 2025 1 commit
  19. 03 Sep, 2025 1 commit
  20. 02 Sep, 2025 2 commits
  21. 01 Sep, 2025 2 commits
  22. 28 Aug, 2025 1 commit
  23. 27 Aug, 2025 2 commits
  24. 26 Aug, 2025 1 commit
  25. 25 Aug, 2025 1 commit
    • sandy's avatar
      Fix/wan2 2 vae encode api (#244) · 87343386
      sandy authored
      * bugfix:adapt to  5B dit model, derive attention_head_dim from config[dim]
      
      * [Fix] Wan2.2 Vae Encode refactor: drop args parameter and use self.cpu_offload
      87343386
  26. 20 Aug, 2025 3 commits
  27. 18 Aug, 2025 2 commits
  28. 14 Aug, 2025 3 commits