Blame · llm/llama.cpp/ggml-cuda/softmax.cuh · ecd2f176277db4f074e25a2c3646b04b51cec119 · OpenDAS / ollama · GitLab

Switch branch/tag

ollama

llm

llama.cpp

ggml-cuda

softmax.cuh
Find file
Normal viewHistoryPermalink

softmax.cuh

142 Bytes

Newer

Older

v1

mashun1
committed
Jun 26, 2024

#include "common.cuh"

#define CUDA_SOFT_MAX_BLOCK_SIZE 1024

void ggml_cuda_op_soft_max(ggml_backend_cuda_context & ctx, ggml_tensor * dst);