Enable the build flag for llama.cpp to use CPU copy for multi-GPU scenarios.
Attach a file by drag & drop or click to upload