"vscode:/vscode.git/clone" did not exist on "58bf2682612bc29b7cdb8a10ba6eee28a024d6d3"
llm: Support KV cache quantization with gpt-oss
With the new version of GGML in #12245, KV cache quantization no longer causes a fallback to CPU.
Showing
Please register or sign in to comment