- 11 Jul, 2025 1 commit
-
-
gushiqiao authored
-
- 10 Jul, 2025 1 commit
-
-
gushiqiao authored
-
- 09 Jul, 2025 1 commit
-
-
gushiqiao authored
-
- 08 Jul, 2025 2 commits
- 03 Jul, 2025 1 commit
-
-
wangshankun authored
-
- 02 Jul, 2025 1 commit
-
-
gushiqiao authored
Enable 720p model inference on low-spec GPUs/CPUs and accelerate T5/CLIP quantized models with vLLM operators
-
- 29 Jun, 2025 2 commits
-
-
Yang Yong(雍洋) authored
Co-authored-by:Linboyan-trc <1584340372@qq.com>
-
helloyongyang authored
-
- 26 Jun, 2025 1 commit
-
-
gushiqiao authored
-
- 23 Jun, 2025 1 commit
-
-
gushiqiao authored
-
- 16 Jun, 2025 3 commits
-
-
gushiqiao authored
-
gushiqiao authored
-
Zhuguanyu Wu authored
* add step & cfg distillation wan model
-
- 12 Jun, 2025 1 commit
-
-
Zhuguanyu Wu authored
* add step & cfg distillation wan model * bug fixed
-
- 11 Jun, 2025 1 commit
-
-
gushiqiao authored
-
- 10 Jun, 2025 1 commit
-
-
gushiqiao authored
-
- 09 Jun, 2025 2 commits
-
-
gushiqiao authored
-
gushiqiao authored
* reconstruct quantization and fix memory leak bug. * Support lazy load inference. * reconstruct quantization * Fix hunyuan bugs * deleted tmp file --------- Co-authored-by:
root <root@pt-c0b333b3a1834e81a0d4d5f412c6ffa1-worker-0.pt-c0b333b3a1834e81a0d4d5f412c6ffa1.ns-devsft-3460edd0.svc.cluster.local> Co-authored-by:
gushiqiao <gushqiaio@sensetime.com> Co-authored-by:
gushiqiao <gushiqiao@sensetime.com>
-
- 30 May, 2025 1 commit
-
-
Zhuguanyu Wu authored
* split dit server from default runner * split dit server from default runner * update loading functions * simplify loader functions and runner functions * simplify code && split dit service * simplify code && split dit service * support split server for cogvideox * clear code.
-
- 28 May, 2025 1 commit
-
-
Xinchi Huang authored
* fix offload extra latency in the first step by pre-allocating pinned memory * pre-commit --------- Co-authored-by:“de1star” <“843414674@qq.com”>
-
- 27 May, 2025 1 commit
-
-
Watebear authored
-
- 23 May, 2025 2 commits
-
-
Zhuguanyu Wu authored
* support prompt enhancer server * bugs fixed * finished prompt enhancer service
-
Zhuguanyu Wu authored
* add load_transformer methods for split server * add service utils * [feature] support split servers
-
- 22 May, 2025 2 commits
-
-
Xinchi Huang authored
* async offload & context4debug * offload ratio * Merge branch 'main' into xinchi/fix_offload * adding offload ratio * pre-commit --------- Co-authored-by:“de1star” <“843414674@qq.com”>
-
root authored
-
- 14 May, 2025 1 commit
-
-
Xinchi Huang authored
* fix offload * fix offload --------- Co-authored-by:“de1star” <“843414674@qq.com”>
-
- 13 May, 2025 1 commit
-
-
TorynCurtis authored
* function hunyuan_t2v_tea, hunyuan_t2v_taylorseer, modify the fresh_threshold of taylorseer * hunyuan i2v,t2v + tea,tay; wan i2v,t2v + tea function, add log files * 删除了TeaCace Scheduler的多余属性 * 删除了多余目录 * 修复了TeaCaching部分的bug,目前t2v, i2v feature caching均可跑通 * Update attn_weight.py --------- Co-authored-by:Yang Yong(雍洋) <yongyang1030@163.com>
-
- 09 May, 2025 3 commits
-
-
gushiqiao authored
* Support load advance ptq model. * Update run_wan_i2v_advanced_ptq.sh --------- Co-authored-by:
gushiqiao <gushiqiao@sensetime.com> Co-authored-by:
Yang Yong(雍洋) <yongyang1030@163.com>
-
helloyongyang authored
-
helloyongyang authored
-
- 08 May, 2025 1 commit
-
-
Dongz authored
* [feature]: add Wan Sparge infer * Update scripts/run_wan_t2v_sparge.sh Co-authored-by:
Copilot <175728472+Copilot@users.noreply.github.com> * [minor]: fix typo and use config style * [minor]: remove breakpoint * [feature]: add all attn class * [minor]: remove args * [minor]: remove shared weights --------- Co-authored-by:
Copilot <175728472+Copilot@users.noreply.github.com>
-
- 07 May, 2025 2 commits
-
-
helloyongyang authored
-
helloyongyang authored
-
- 30 Apr, 2025 1 commit
-
-
Zhuguanyu Wu authored
* [bugs fixed] fixed bugs for cpu offload. * [rename] rename causal_model -> causvid_model * [feature] add prompt enhancer * [feature] add prompt enhancer * [rename] rename causal_model -> causvid_model
-
- 29 Apr, 2025 2 commits
-
-
helloyongyang authored
-
root authored
-
- 28 Apr, 2025 1 commit
-
-
helloyongyang authored
-
- 11 Apr, 2025 1 commit
-
-
gushiqiao authored
* Support hunyuan offload and teacache. * Fix * Fix --------- Co-authored-by:gushiqiao <gushiqiao@sensetime.com>
-
- 09 Apr, 2025 1 commit
-
-
helloyongyang authored
-