Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
norm
vllm
Commits
3f1166ab
Commit
3f1166ab
authored
Mar 23, 2024
by
zhuwenwen
Browse files
use half atomicAdd of dtk24.04
parent
f0647bb8
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
0 deletions
+2
-0
csrc/quantization/gptq/q_gemm.cu
csrc/quantization/gptq/q_gemm.cu
+2
-0
No files found.
csrc/quantization/gptq/q_gemm.cu
View file @
3f1166ab
...
@@ -11,7 +11,9 @@ Adapted from https://github.com/turboderp/exllamav2 and https://github.com/qwopq
...
@@ -11,7 +11,9 @@ Adapted from https://github.com/turboderp/exllamav2 and https://github.com/qwopq
#include <cuda_runtime.h>
#include <cuda_runtime.h>
#include <cuda_fp16.h>
#include <cuda_fp16.h>
#ifndef USE_ROCM
#include "compat.cuh"
#include "compat.cuh"
#endif
#include "matrix_view.cuh"
#include "matrix_view.cuh"
#include "qdq_2.cuh"
#include "qdq_2.cuh"
#include "qdq_3.cuh"
#include "qdq_3.cuh"
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment