Blame · llm/llama.cpp/ggml-cuda/fattn-vec-f16.cuh · 97b02a8981110c76d901e8b5f96af514ee0326f3 · OpenDAS / ollama · GitLab

Switch branch/tag

ollama

llm

llama.cpp

ggml-cuda

fattn-vec-f16.cuh
Find file
Normal viewHistoryPermalink

fattn-vec-f16.cuh

213 Bytes

Newer

Older

v1

mashun1
committed
Jun 26, 2024

#include "common.cuh"

void ggml_cuda_flash_attn_ext_vec_f16(ggml_backend_cuda_context & ctx, ggml_tensor * dst);

void ggml_cuda_flash_attn_ext_vec_f16_no_mma(ggml_backend_cuda_context & ctx, ggml_tensor * dst);