• Jesse Gross's avatar
    ggml: Seperate tensor load from backend creation · 94ab428e
    Jesse Gross authored
    Currently, when the backend is created, the tensors are loaded at the
    same time, which is a slow operation. This separates them to be two
    steps:
     - Create backend, including enumerating tensors and memory allocation
     - Loading tensor data
    
    This allows more flexibility in managing model loading.
    94ab428e
convert_test.go 10.1 KB