"ggml/src/ggml-vulkan/vulkan-shaders/sum_rows.comp" did not exist on "4cc1a6143387f41e2466536abcd6a2620b63a35b"
  • Jesse Gross's avatar
    runner.go: Better abstract vision model integration · c826e574
    Jesse Gross authored
    
    
    -Update mllama to take the cross attention state as embeddings in
    a batch, more similar to how Llava handles it. This improves
    integration with the input cache.
    -Pass locations in a prompt for embeddings using tags similar to Llava.
    -Abstract interface to vision models so the main runner accesses Clip
    and Mllama similarly
    Co-authored-by: default avatarMichael Yang <mxyng@pm.me>
    c826e574
llava.cpp 23.3 KB