1. 11 Jul, 2025 1 commit
  2. 10 Jul, 2025 3 commits
  3. 09 Jul, 2025 1 commit
    • PengGao's avatar
      Refactor/api (#94) · 398b598a
      PengGao authored
      * fix: correct frequency computation in WanTransformerInfer
      
      * refactor: restructure API server and distributed inference services
      
      - Removed the old api_server_dist.py file and integrated its functionality into a new modular structure.
      - Created a new ApiServer class to handle API routes and services.
      - Introduced DistributedInferenceService and FileService for better separation of concerns.
      - Updated the main entry point to initialize and run the new API server with distributed inference capabilities.
      - Added schema definitions for task requests and responses to improve data handling.
      - Enhanced error handling and logging throughout the services.
      
      * refactor: enhance API structure and file handling in server
      
      - Introduced APIRouter for modular route management in the ApiServer class.
      - Updated task creation and file download endpoints to improve clarity and functionality.
      - Implemented a new method for streaming file responses with proper MIME type handling.
      - Refactored task request schema to auto-generate task IDs and handle optional video save paths.
      - Improved error handling and logging for better debugging and user feedback.
      
      * feat: add configurable parameters for video generation
      
      - Introduced new parameters: infer_steps, target_video_length, and seed to the API and task request schema.
      - Updated DefaultRunner and VideoGenerationService to handle the new parameters for enhanced video generation control.
      - Improved default values for parameters to ensure consistent behavior.
      
      * refactor: enhance profiling context for async support
      
      * refactor: improve signal handling in API server
      
      * feat: enhance video generation capabilities with audio support
      
      * refactor: improve subprocess call for audio-video merging in wan_audio_runner.py
      
      * refactor: enhance API server argument parsing and improve code readability
      
      * refactor: enhance logging and improve code comments for clarity
      
      * refactor: update response model for task listing endpoint to return a dictionary
      
      * docs: update API endpoints and improve documentation clarity
      
      * refactor: update API endpoints in scripts for task management and remove unused code
      
      * fix: pre-commit
      398b598a
  4. 04 Jul, 2025 2 commits
  5. 03 Jul, 2025 5 commits
  6. 02 Jul, 2025 1 commit
  7. 01 Jul, 2025 3 commits
  8. 30 Jun, 2025 2 commits
  9. 23 Jun, 2025 1 commit
  10. 16 Jun, 2025 2 commits
  11. 12 Jun, 2025 1 commit
  12. 09 Jun, 2025 2 commits
    • PengGao's avatar
      Implement distributed inference API server with FastAPI (#60) · a94695e5
      PengGao authored
      * Implement distributed inference API server with FastAPI
      
      - Added a new file `api_server_dist.py` to handle video generation tasks using distributed inference.
      - Introduced endpoints for task submission, status checking, and result retrieval.
      - Implemented image downloading and task management with error handling.
      - Enhanced `infer.py` to ensure proper initialization of distributed processes.
      - Created a shell script `start_api_with_dist_inference.sh` for easy server startup with environment setup.
      
      This commit establishes a robust framework for managing video generation tasks in a distributed manner.
      
      * Enhance file download endpoint with path traversal protection and error handling
      
      * style: format
      
      * refactor: Enhance video generation functionality with task interruption support
      
      * feat: Add image upload and video generation endpoint with unique task handling
      
      - Introduced a new endpoint `/v1/local/video/generate_form` for video generation that accepts image uploads.
      - Implemented unique filename generation for uploaded images to prevent conflicts.
      - Enhanced directory management for input and output paths.
      - Improved file download response with detailed status and size information.
      - Added error handling for distributed inference processes and graceful shutdown procedures.
      a94695e5
    • gushiqiao's avatar
      Support run load memory machine, fix some bugs and reconstruct quantizaton. (#61) · 5c241f86
      gushiqiao authored
      
      
      * reconstruct quantization and fix memory leak bug.
      
      * Support lazy load inference.
      
      * reconstruct quantization
      
      * Fix hunyuan bugs
      
      * deleted tmp file
      
      ---------
      Co-authored-by: default avatarroot <root@pt-c0b333b3a1834e81a0d4d5f412c6ffa1-worker-0.pt-c0b333b3a1834e81a0d4d5f412c6ffa1.ns-devsft-3460edd0.svc.cluster.local>
      Co-authored-by: default avatargushiqiao <gushqiaio@sensetime.com>
      Co-authored-by: default avatargushiqiao <gushiqiao@sensetime.com>
      5c241f86
  13. 30 May, 2025 1 commit
    • Zhuguanyu Wu's avatar
      support split server for dit module (#58) · b7d2d43f
      Zhuguanyu Wu authored
      * split dit server from default runner
      
      * split dit server from default runner
      
      * update loading functions
      
      * simplify loader functions and runner functions
      
      * simplify code && split dit service
      
      * simplify code && split dit service
      
      * support split server for cogvideox
      
      * clear code.
      b7d2d43f
  14. 27 May, 2025 1 commit
  15. 23 May, 2025 2 commits
  16. 14 May, 2025 1 commit
  17. 13 May, 2025 4 commits
  18. 12 May, 2025 2 commits
  19. 11 May, 2025 2 commits
  20. 09 May, 2025 2 commits
  21. 08 May, 2025 1 commit