lightx2v/models/networks/wan/infer/pre_infer.py · 62d8881a9fc1f3b239b9386f56ce8e7c44b5be67 · xuwx1 / LightX2V · GitLab

"ollama/llm/llama.cpp/ggml/src/ggml-cuda/diagmask.cu" did not exist on "ff27a8172ae24bbcff76eec4220c3081852c201b"

Find file Blame History Permalink

Enable 720p model inference on low-spec GPUs/CPUs and accelerate T5/CLIP... · d66b98de
gushiqiao authored Jul 02, 2025
```
Enable 720p model inference on low-spec GPUs/CPUs and accelerate T5/CLIP quantized models with vLLM operators
```
d66b98de

pre_infer.py 4.82 KB