• Jesse Gross's avatar
    ggml: No-alloc mode · 79f6376f
    Jesse Gross authored
    Callers can set a backend buffer type to be no-alloc, meaning that
    it does not allocate memory for tensors or operations. This can
    be used for calculating memory requirements. Tensors and graphs
    must be recreated with no-alloc set to false before loading data.
    
    Defaults to false for newly created backend buffer types.
    79f6376f
0026-ggml-No-alloc-mode.patch 3.65 KB