• Jesse Gross's avatar
    ggml: Increase maximum graph size · ef549d51
    Jesse Gross authored
    The initial implementation of qwen3-vl:235b exceeded the maximum graph
    size based on the number of tensors. Although this was later fixed
    through the use of the mrope operation, we are close to the limit in
    some cases. This updates to track the current llama.cpp usage of GGML.
    ef549d51
ggml.go 44.5 KB