1. 12 Jun, 2025 1 commit
  2. 09 Jun, 2025 2 commits
    • PengGao's avatar
      Implement distributed inference API server with FastAPI (#60) · a94695e5
      PengGao authored
      * Implement distributed inference API server with FastAPI
      
      - Added a new file `api_server_dist.py` to handle video generation tasks using distributed inference.
      - Introduced endpoints for task submission, status checking, and result retrieval.
      - Implemented image downloading and task management with error handling.
      - Enhanced `infer.py` to ensure proper initialization of distributed processes.
      - Created a shell script `start_api_with_dist_inference.sh` for easy server startup with environment setup.
      
      This commit establishes a robust framework for managing video generation tasks in a distributed manner.
      
      * Enhance file download endpoint with path traversal protection and error handling
      
      * style: format
      
      * refactor: Enhance video generation functionality with task interruption support
      
      * feat: Add image upload and video generation endpoint with unique task handling
      
      - Introduced a new endpoint `/v1/local/video/generate_form` for video generation that accepts image uploads.
      - Implemented unique filename generation for uploaded images to prevent conflicts.
      - Enhanced directory management for input and output paths.
      - Improved file download response with detailed status and size information.
      - Added error handling for distributed inference processes and graceful shutdown procedures.
      a94695e5
    • gushiqiao's avatar
      Support run load memory machine, fix some bugs and reconstruct quantizaton. (#61) · 5c241f86
      gushiqiao authored
      
      
      * reconstruct quantization and fix memory leak bug.
      
      * Support lazy load inference.
      
      * reconstruct quantization
      
      * Fix hunyuan bugs
      
      * deleted tmp file
      
      ---------
      Co-authored-by: default avatarroot <root@pt-c0b333b3a1834e81a0d4d5f412c6ffa1-worker-0.pt-c0b333b3a1834e81a0d4d5f412c6ffa1.ns-devsft-3460edd0.svc.cluster.local>
      Co-authored-by: default avatargushiqiao <gushqiaio@sensetime.com>
      Co-authored-by: default avatargushiqiao <gushiqiao@sensetime.com>
      5c241f86
  3. 30 May, 2025 1 commit
    • Zhuguanyu Wu's avatar
      support split server for dit module (#58) · b7d2d43f
      Zhuguanyu Wu authored
      * split dit server from default runner
      
      * split dit server from default runner
      
      * update loading functions
      
      * simplify loader functions and runner functions
      
      * simplify code && split dit service
      
      * simplify code && split dit service
      
      * support split server for cogvideox
      
      * clear code.
      b7d2d43f
  4. 27 May, 2025 1 commit
  5. 23 May, 2025 2 commits
  6. 14 May, 2025 1 commit
  7. 13 May, 2025 4 commits
  8. 12 May, 2025 2 commits
  9. 11 May, 2025 2 commits
  10. 09 May, 2025 2 commits
  11. 08 May, 2025 3 commits
  12. 07 May, 2025 1 commit
  13. 30 Apr, 2025 2 commits
    • helloyongyang's avatar
      update post.py examples · a81ad1e5
      helloyongyang authored
      a81ad1e5
    • Zhuguanyu Wu's avatar
      Add prompt enhancer (#29) · d725c154
      Zhuguanyu Wu authored
      * [bugs fixed] fixed bugs for cpu offload.
      
      * [rename] rename causal_model -> causvid_model
      
      * [feature] add prompt enhancer
      
      * [feature] add prompt enhancer
      
      * [rename] rename causal_model -> causvid_model
      d725c154
  14. 29 Apr, 2025 2 commits
  15. 28 Apr, 2025 1 commit
  16. 27 Apr, 2025 3 commits
  17. 24 Apr, 2025 1 commit
  18. 23 Apr, 2025 3 commits
  19. 22 Apr, 2025 1 commit
  20. 21 Apr, 2025 1 commit
  21. 20 Apr, 2025 1 commit
  22. 11 Apr, 2025 1 commit
  23. 08 Apr, 2025 2 commits