- 09 Jul, 2025 11 commits
-
-
helloyongyang authored
-
helloyongyang authored
-
gushiqiao authored
Update gradio
-
gushiqiao authored
-
wangshankun authored
-
PengGao authored
* fix: correct frequency computation in WanTransformerInfer * refactor: restructure API server and distributed inference services - Removed the old api_server_dist.py file and integrated its functionality into a new modular structure. - Created a new ApiServer class to handle API routes and services. - Introduced DistributedInferenceService and FileService for better separation of concerns. - Updated the main entry point to initialize and run the new API server with distributed inference capabilities. - Added schema definitions for task requests and responses to improve data handling. - Enhanced error handling and logging throughout the services. * refactor: enhance API structure and file handling in server - Introduced APIRouter for modular route management in the ApiServer class. - Updated task creation and file download endpoints to improve clarity and functionality. - Implemented a new method for streaming file responses with proper MIME type handling. - Refactored task request schema to auto-generate task IDs and handle optional video save paths. - Improved error handling and logging for better debugging and user feedback. * feat: add configurable parameters for video generation - Introduced new parameters: infer_steps, target_video_length, and seed to the API and task request schema. - Updated DefaultRunner and VideoGenerationService to handle the new parameters for enhanced video generation control. - Improved default values for parameters to ensure consistent behavior. * refactor: enhance profiling context for async support * refactor: improve signal handling in API server * feat: enhance video generation capabilities with audio support * refactor: improve subprocess call for audio-video merging in wan_audio_runner.py * refactor: enhance API server argument parsing and improve code readability * refactor: enhance logging and improve code comments for clarity * refactor: update response model for task listing endpoint to return a dictionary * docs: update API endpoints and improve documentation clarity * refactor: update API endpoints in scripts for task management and remove unused code * fix: pre-commit
-
Xtra authored
* add mxfp8 quant kernel and some tests * Update .gitignore --------- Co-authored-by:Yang Yong(雍洋) <yongyang1030@163.com>
-
gushiqiao authored
Fix config bugs
-
gushiqiao authored
-
gushiqiao authored
Fix bugs
-
gushiqiao authored
-
- 08 Jul, 2025 5 commits
- 04 Jul, 2025 3 commits
-
-
Yang Rongjin authored
* add readme * modify readme
-
GoatWu authored
-
Zhuguanyu Wu authored
* update lora keys * update lora extractor/merger tools * rename lora config and script files * bug fixed for lora tools
-
- 03 Jul, 2025 10 commits
-
-
Zhuguanyu Wu authored
* update lora keys * update lora extractor/merger tools
-
wangshankun authored
-
wangshankun authored
-
sandy authored
feature: audio driven video gen
-
wangshankun authored
-
wangshankun authored
-
wangshankun authored
-
wangshankun authored
-
gushiqiao authored
Update gradio
-
gushiqiao authored
-
- 02 Jul, 2025 5 commits
-
-
helloyongyang authored
-
helloyongyang authored
-
gushiqiao authored
Enable 720p model inference on low-spec GPUs/CPUs and accelerate T5/C…
-
gushiqiao authored
Enable 720p model inference on low-spec GPUs/CPUs and accelerate T5/CLIP quantized models with vLLM operators
-
gushiqiao authored
update lora adapter
-
- 01 Jul, 2025 6 commits