Commits · 708ea9d7e06ee09df35f66150e21abc5ab40af5f · xuwx1 / LightX2V

11 Jul, 2025 1 commit
- Debug distill (#105) · 708ea9d7
  Zhuguanyu Wu authored Jul 11, 2025
```
* bugs fixed for distill_model server

* bug fixed for lora merge
```
  708ea9d7
10 Jul, 2025 3 commits
- update doc · 8db2daa3
  helloyongyang authored Jul 10, 2025
  
  8db2daa3
- update readme and docs · 8ec850b8
  helloyongyang authored Jul 10, 2025
  
  8ec850b8
- Support dynamic CFG distillation (#100) · 6ac3cee7
  Zhuguanyu Wu authored Jul 10, 2025
```
* support dynamic cfg for cfg_distill

* reformat files
```
  6ac3cee7
09 Jul, 2025 1 commit

PengGao authored Jul 09, 2025

* fix: correct frequency computation in WanTransformerInfer

* refactor: restructure API server and distributed inference services

- Removed the old api_server_dist.py file and integrated its functionality into a new modular structure.
- Created a new ApiServer class to handle API routes and services.
- Introduced DistributedInferenceService and FileService for better separation of concerns.
- Updated the main entry point to initialize and run the new API server with distributed inference capabilities.
- Added schema definitions for task requests and responses to improve data handling.
- Enhanced error handling and logging throughout the services.

* refactor: enhance API structure and file handling in server

- Introduced APIRouter for modular route management in the ApiServer class.
- Updated task creation and file download endpoints to improve clarity and functionality.
- Implemented a new method for streaming file responses with proper MIME type handling.
- Refactored task request schema to auto-generate task IDs and handle optional video save paths.
- Improved error handling and logging for better debugging and user feedback.

* feat: add configurable parameters for video generation

- Introduced new parameters: infer_steps, target_video_length, and seed to the API and task request schema.
- Updated DefaultRunner and VideoGenerationService to handle the new parameters for enhanced video generation control.
- Improved default values for parameters to ensure consistent behavior.

* refactor: enhance profiling context for async support

* refactor: improve signal handling in API server

* feat: enhance video generation capabilities with audio support

* refactor: improve subprocess call for audio-video merging in wan_audio_runner.py

* refactor: enhance API server argument parsing and improve code readability

* refactor: enhance logging and improve code comments for clarity

* refactor: update response model for task listing endpoint to return a dictionary

* docs: update API endpoints and improve documentation clarity

* refactor: update API endpoints in scripts for task management and remove unused code

* fix: pre-commit

398b598a

04 Jul, 2025 2 commits

Cache readme (#90) · 7a8951ba
Yang Rongjin authored Jul 05, 2025
```
* add readme

* modify readme
```
7a8951ba

bug fixed for lora tools (#89) · 2d823f25

Zhuguanyu Wu authored Jul 04, 2025

* update lora keys

* update lora extractor/merger tools

* rename lora config and script files

* bug fixed for lora tools

2d823f25

03 Jul, 2025 5 commits
- ♻️ Refactor: Move audio inference files to 'infer/audio' subdirectory · b8084e83
  wangshankun authored Jul 03, 2025
  
  b8084e83
- update config attention type · 8b230da5
  wangshankun authored Jul 03, 2025
  
  8b230da5
- Support:radial attention · 6060ff4f
  wangshankun authored Jul 03, 2025
  
  6060ff4f
- Apply: distll lora · b2147c40
  wangshankun authored Jun 27, 2025
  
  b2147c40
- audio驱动wan视频生成 · e58dd9fe
  wangshankun authored Jun 25, 2025
  
  e58dd9fe
02 Jul, 2025 1 commit
- Enable 720p model inference on low-spec GPUs/CPUs and accelerate T5/CLIP... · d66b98de
  gushiqiao authored Jul 02, 2025
```
Enable 720p model inference on low-spec GPUs/CPUs and accelerate T5/CLIP quantized models with vLLM operators
```
  d66b98de
01 Jul, 2025 3 commits
- update configs · 11fcc3fb
  GoatWu authored Jul 01, 2025
  
  11fcc3fb
- update lora adapter · 1f50bcb2
  GoatWu authored Jul 01, 2025
  
  1f50bcb2
- reconstruct config and scripts (#81) · 8bc0da34
  gushiqiao authored Jul 01, 2025
```
Co-authored-by: gushiqiao <gushiqiao@sensetime.com>
```
  8bc0da34
30 Jun, 2025 2 commits
- update i2v 4_step distillation scripts · fb69083e
  GoatWu authored Jun 30, 2025
  
  fb69083e
- update caching scripts and configs · d7d45faf
  helloyongyang authored Jun 30, 2025
  
  d7d45faf
23 Jun, 2025 1 commit
- Update demo · 941fa16f
  gushiqiao authored Jun 23, 2025
  
  941fa16f
16 Jun, 2025 2 commits
- Fixed the accuracy fluctuation bug · 8ba6e3b4
  gushiqiao authored Jun 16, 2025
  
  8ba6e3b4
- Dev distill (#69) · e9e33065
  Zhuguanyu Wu authored Jun 16, 2025
```
* add step & cfg distillation wan model
```
  e9e33065
12 Jun, 2025 1 commit
- add wan2.1 cfg & step distillation model (#67) · 793ec1db
  Zhuguanyu Wu authored Jun 12, 2025
```
* add step & cfg distillation wan model

* bug fixed
```
  793ec1db
09 Jun, 2025 2 commits

Implement distributed inference API server with FastAPI (#60) · a94695e5

PengGao authored Jun 09, 2025

* Implement distributed inference API server with FastAPI

- Added a new file `api_server_dist.py` to handle video generation tasks using distributed inference.
- Introduced endpoints for task submission, status checking, and result retrieval.
- Implemented image downloading and task management with error handling.
- Enhanced `infer.py` to ensure proper initialization of distributed processes.
- Created a shell script `start_api_with_dist_inference.sh` for easy server startup with environment setup.

This commit establishes a robust framework for managing video generation tasks in a distributed manner.

* Enhance file download endpoint with path traversal protection and error handling

* style: format

* refactor: Enhance video generation functionality with task interruption support

* feat: Add image upload and video generation endpoint with unique task handling

- Introduced a new endpoint `/v1/local/video/generate_form` for video generation that accepts image uploads.
- Implemented unique filename generation for uploaded images to prevent conflicts.
- Enhanced directory management for input and output paths.
- Improved file download response with detailed status and size information.
- Added error handling for distributed inference processes and graceful shutdown procedures.

a94695e5

Support run load memory machine, fix some bugs and reconstruct quantizaton. (#61) · 5c241f86

gushiqiao authored Jun 09, 2025



* reconstruct quantization and fix memory leak bug.

* Support lazy load inference.

* reconstruct quantization

* Fix hunyuan bugs

* deleted tmp file

---------
Co-authored-by: root <root@pt-c0b333b3a1834e81a0d4d5f412c6ffa1-worker-0.pt-c0b333b3a1834e81a0d4d5f412c6ffa1.ns-devsft-3460edd0.svc.cluster.local>
Co-authored-by: gushiqiao <gushqiaio@sensetime.com>
Co-authored-by: gushiqiao <gushiqiao@sensetime.com>

5c241f86

30 May, 2025 1 commit

support split server for dit module (#58) · b7d2d43f

Zhuguanyu Wu authored May 30, 2025

* split dit server from default runner

* split dit server from default runner

* update loading functions

* simplify loader functions and runner functions

* simplify code && split dit service

* simplify code && split dit service

* support split server for cogvideox

* clear code.

b7d2d43f

27 May, 2025 1 commit
- [feature]: add cogvideox t2v (#55) · 45467a6b
  Watebear authored May 27, 2025
  
  45467a6b
23 May, 2025 2 commits

support prompt enhancer with vllm (#53) · 429dcc45
Zhuguanyu Wu authored May 23, 2025
```
* support prompt enhancer server

* bugs fixed

* finished prompt enhancer service
```
429dcc45

[feature] support split server (#50) · a852f879

Zhuguanyu Wu authored May 23, 2025

* add load_transformer methods for split server

* add service utils

* [feature] support split servers

a852f879

14 May, 2025 1 commit
- remove unsed files · c8606815
  helloyongyang authored May 14, 2025
  
  c8606815
13 May, 2025 4 commits

[feature]: Add CausalVid I2V · 7ec70cbb
wangshankun authored May 13, 2025

7ec70cbb

update multi gpu servers (#41) · fc2468ce

Zhuguanyu Wu authored May 13, 2025

* [feature] add server for multi-gpus

* [update] update start parameters for multi-gpu servers

* [update] update start parameters for multi-gpu servers

* [update] update documents for multi-gpu services

fc2468ce

[feature] add server for multi-gpus (#40) · b4322e20
Zhuguanyu Wu authored May 13, 2025

b4322e20

function feature caching (#38) · cfd0423f

TorynCurtis authored May 13, 2025



* function hunyuan_t2v_tea, hunyuan_t2v_taylorseer, modify the fresh_threshold of taylorseer

* hunyuan i2v,t2v + tea,tay; wan i2v,t2v + tea function, add log files

* 删除了TeaCace Scheduler的多余属性

* 删除了多余目录

* 修复了TeaCaching部分的bug,目前t2v, i2v feature caching均可跑通

* Update attn_weight.py

---------
Co-authored-by: Yang Yong(雍洋) <yongyang1030@163.com>

cfd0423f

12 May, 2025 2 commits
- adding a padding strategy for SP (#37) · bd1e469c
  Xinchi Huang authored May 12, 2025
```
adding a padding strategy for SP

---------
Co-authored-by: “de1star” <“843414674@qq.com”>
```
  bd1e469c
- feat(server): Support stopping the running task · fe13f4db
  helloyongyang authored May 12, 2025
  
  fe13f4db
11 May, 2025 2 commits
- feat(server): Support multi servers · ea8da6fb
  helloyongyang authored May 12, 2025
  
  ea8da6fb
- feat(server): Support async server · af02604e
  helloyongyang authored May 12, 2025
  
  af02604e
09 May, 2025 2 commits

update configs for causvid (#34) · ad0237f9
Zhuguanyu Wu authored May 09, 2025

ad0237f9

Support load advance ptq model. (#33) · 165ec807

gushiqiao authored May 09, 2025



* Support load advance ptq model.

* Update run_wan_i2v_advanced_ptq.sh

---------
Co-authored-by: gushiqiao <gushiqiao@sensetime.com>
Co-authored-by: Yang Yong(雍洋) <yongyang1030@163.com>

165ec807

08 May, 2025 1 commit

[feature]: add Wan Sparge infer (#32) · 78640ad0

Dongz authored May 08, 2025



* [feature]: add Wan Sparge infer

* Update scripts/run_wan_t2v_sparge.sh
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* [minor]: fix typo and use config style

* [minor]: remove breakpoint

* [feature]: add all attn class

* [minor]: remove args

* [minor]: remove shared weights

---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

78640ad0