"git@developer.sourcefind.cn:OpenDAS/vision.git" did not exist on "1de7a74a8b93483f1703eef0b306e0ec68e0cd9d"
Adjust mmap logic for cuda windows for faster model load
On Windows, recent llama.cpp changes make mmap slower in most cases, so default to off. This also implements a tri-state for use_mmap so we can detect the difference between a user provided value of true/false, or unspecified.
Showing
Please register or sign in to comment