Blame · ggml/src/ggml-cuda/softmax.cuh · 4cc1a6143387f41e2466536abcd6a2620b63a35b · OpenDAS / llama.cpp · GitLab

Switch branch/tag

llama.cpp

ggml

src

ggml-cuda

softmax.cuh
Find file
Normal viewHistoryPermalink

softmax.cuh

142 Bytes

Newer

Older

init

xuxzh1
committed
Nov 11, 2024

#include "common.cuh"

#define CUDA_SOFT_MAX_BLOCK_SIZE 1024

void ggml_cuda_op_soft_max(ggml_backend_cuda_context & ctx, ggml_tensor * dst);