You need to sign in or sign up before continuing.
Commit 3f1166ab authored by zhuwenwen's avatar zhuwenwen
Browse files

use half atomicAdd of dtk24.04

parent f0647bb8
......@@ -11,7 +11,9 @@ Adapted from https://github.com/turboderp/exllamav2 and https://github.com/qwopq
#include <cuda_runtime.h>
#include <cuda_fp16.h>
#ifndef USE_ROCM
#include "compat.cuh"
#endif
#include "matrix_view.cuh"
#include "qdq_2.cuh"
#include "qdq_3.cuh"
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment