"benchmark/vscode:/vscode.git/clone" did not exist on "df97b31f378995a3cd611445047e9aab9d23841b"
- 29 Aug, 2025 1 commit
-
-
Watebear authored
* feature: qwen-image support cpu offload (block) and refactor transfomrer model * bugfix
-
- 26 Aug, 2025 1 commit
-
-
gushiqiao authored
* [Fix] Fix audio model offload bug. * Fix * Fix * Fix * Fix * Fix * Fix
-
- 08 Aug, 2025 1 commit
-
-
gushiqiao authored
-
- 01 Aug, 2025 1 commit
-
-
gushiqiao authored
-
- 22 Jul, 2025 1 commit
-
-
helloyongyang authored
-
- 12 Jul, 2025 1 commit
-
-
gushiqiao authored
-
- 11 Jul, 2025 3 commits
-
-
GoatWu authored
-
helloyongyang authored
-
helloyongyang authored
-
- 03 Jul, 2025 1 commit
-
-
wangshankun authored
-
- 02 Jul, 2025 1 commit
-
-
gushiqiao authored
Enable 720p model inference on low-spec GPUs/CPUs and accelerate T5/CLIP quantized models with vLLM operators
-
- 01 Jul, 2025 2 commits
-
-
gushiqiao authored
-
gushiqiao authored
Co-authored-by:gushiqiao <gushiqiao@sensetime.com>
-
- 10 Jun, 2025 1 commit
-
-
gushiqiao authored
-
- 09 Jun, 2025 1 commit
-
-
gushiqiao authored
* reconstruct quantization and fix memory leak bug. * Support lazy load inference. * reconstruct quantization * Fix hunyuan bugs * deleted tmp file --------- Co-authored-by:
root <root@pt-c0b333b3a1834e81a0d4d5f412c6ffa1-worker-0.pt-c0b333b3a1834e81a0d4d5f412c6ffa1.ns-devsft-3460edd0.svc.cluster.local> Co-authored-by:
gushiqiao <gushqiaio@sensetime.com> Co-authored-by:
gushiqiao <gushiqiao@sensetime.com>
-
- 22 May, 2025 1 commit
-
-
root authored
-