1. 21 Nov, 2025 1 commit
  2. 19 Nov, 2025 1 commit
    • Kane's avatar
      Mlu590 deployment (#453) · fcc2a411
      Kane authored
      Feature:
          1. added mlu590 bfloat16, single-gpu and multi-gpus inference.
          2. added mlu590 int8 inference.
      fcc2a411
  3. 13 Nov, 2025 1 commit
  4. 03 Nov, 2025 1 commit
  5. 28 Oct, 2025 1 commit
  6. 24 Oct, 2025 2 commits
  7. 22 Oct, 2025 1 commit
  8. 20 Oct, 2025 1 commit
  9. 16 Oct, 2025 1 commit
  10. 01 Oct, 2025 1 commit
  11. 29 Sep, 2025 2 commits
  12. 18 Sep, 2025 1 commit
  13. 15 Sep, 2025 1 commit
  14. 03 Sep, 2025 1 commit
  15. 02 Sep, 2025 2 commits
  16. 01 Sep, 2025 2 commits
  17. 28 Aug, 2025 1 commit
  18. 27 Aug, 2025 2 commits
  19. 26 Aug, 2025 1 commit
  20. 25 Aug, 2025 1 commit
    • sandy's avatar
      Fix/wan2 2 vae encode api (#244) · 87343386
      sandy authored
      * bugfix:adapt to  5B dit model, derive attention_head_dim from config[dim]
      
      * [Fix] Wan2.2 Vae Encode refactor: drop args parameter and use self.cpu_offload
      87343386
  21. 20 Aug, 2025 3 commits
  22. 18 Aug, 2025 2 commits
  23. 14 Aug, 2025 4 commits
  24. 12 Aug, 2025 1 commit
  25. 11 Aug, 2025 1 commit
  26. 09 Aug, 2025 1 commit
  27. 08 Aug, 2025 2 commits
  28. 06 Aug, 2025 1 commit