"examples/vscode:/vscode.git/clone" did not exist on "98c1117d00edd38d72610d6a87c0c8d706873863"
Feat: Clear cache during weight loading to prevent OOM on GPUs with <=8GB VRAM
This change explicitly clears CUDA cache during weight loading to mitigate memory fragmentation issues, particularly beneficial for low-VRAM GPUs.
Showing
Please register or sign in to comment