improve api error handling (#781)
- remove new lines from llama.cpp error messages relayed to client - check api option types and return error on wrong type - change num layers from 95% VRAM to 92% VRAM
Showing
Please register or sign in to comment