• Jesse Gross's avatar
    ggml: Report graph memory for failed allocations · 6db8a377
    Jesse Gross authored
    GGML has a function to report the allocated size of a backend buffer.
    However, this returns 0 if we tried to allocate a buffer and it failed.
    For memory management purposes, it's important to know how much we were
    trying to allocate. This extends the API to report attempted sizes for
    all buffers and whether it succeeeded.
    6db8a377
ggml-backend.cpp 76.6 KB