"eigen-master/bench/btl/actions/action_hessenberg.hh" did not exist on "e7df86554156b36846008d8ddbcc4d8521a16554"
  1. 23 Sep, 2025 1 commit
    • Muyang Li's avatar
      chore: add qwen-image tests for v1, clean the requirements (#712) · de6a75b6
      Muyang Li authored
      * add the v1 tests
      
      * formalize the requirements
      
      * update
      
      * update
      
      * well format the dependency
      
      * update docs
      
      * update
      
      * update
      
      * update the dependency
      
      * fix the version in ci
      
      * well organize the dependency
      
      * update the pr list
      
      * update tests
      
      * add tests for qwen-image
      
      * add qwen-image lightning
      
      * add tests
      
      * update fp4 lpips
      
      * add tests for qwen-image-edit
      
      * update
      
      * updat
      
      * add 8 steps lora back
      
      * update nvfp4 results
      
      * add qwen-image-edit-lightning
      
      * style: make linter happy
      
      * do not use batch when batchsize is 1
      
      * update
      
      * finished all the tests
      
      * add cn test
      
      * style: make linter happy
      
      * update
      
      * add qwen-image controlnet
      
      * update
      
      * add fp4 metrics
      
      * fix the tests
      de6a75b6
  2. 10 Sep, 2025 1 commit
  3. 03 Sep, 2025 1 commit
    • Muyang Li's avatar
      feat: async CPU offloading for Python backend (#624) · eb901251
      Muyang Li authored
      * tmp
      
      * update
      
      * update
      
      * finished the offloading impl
      
      * the offloading is buggy
      
      * update utils
      
      * the offloading is still buggy
      
      * update
      
      * correctness and speedup done; need to check the vram overhead
      
      * done
      
      * final debugging
      
      * update
      
      * update
      
      * correct now
      
      * fix
      
      * update
      
      * use per-layer offloading
      
      * fix the offloading on 5090
      
      * support setting the num_blocks_on_gpu
      
      * change the import name
      eb901251
  4. 27 Aug, 2025 1 commit