Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ox696c
ktransformers
Commits
ae76a729
Commit
ae76a729
authored
Feb 13, 2025
by
fxzjshm
Browse files
gptq_marlin: temporarily disable on AMD ROCm
Signed-off-by:
fxzjshm
<
fxzjshm@163.com
>
parent
4cda4543
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
7 additions
and
2 deletions
+7
-2
ktransformers/ktransformers_ext/cuda/gptq_marlin/gptq_marlin.cu
...formers/ktransformers_ext/cuda/gptq_marlin/gptq_marlin.cu
+1
-1
ktransformers/ktransformers_ext/cuda/gptq_marlin/gptq_marlin.cuh
...ormers/ktransformers_ext/cuda/gptq_marlin/gptq_marlin.cuh
+1
-1
ktransformers/ktransformers_ext/cuda/gptq_marlin/gptq_marlin_dtypes.cuh
...ktransformers_ext/cuda/gptq_marlin/gptq_marlin_dtypes.cuh
+5
-0
No files found.
ktransformers/ktransformers_ext/cuda/gptq_marlin/gptq_marlin.cu
View file @
ae76a729
...
...
@@ -36,7 +36,7 @@ inline std::string str(T x) {
namespace
gptq_marlin
{
#if defined(__CUDA_ARCH__) && __CUDA_ARCH__ < 800
#if
(
defined(__CUDA_ARCH__) && __CUDA_ARCH__ < 800
) || defined(__HIP_PLATFORM_AMD__)
__global__
void
permute_cols_kernel
(
int4
const
*
__restrict__
a_int4_ptr
,
int
const
*
__restrict__
perm_int_ptr
,
...
...
ktransformers/ktransformers_ext/cuda/gptq_marlin/gptq_marlin.cuh
View file @
ae76a729
...
...
@@ -39,7 +39,7 @@ using I4 = Vec<int, 4>;
constexpr
int
div_ceil
(
int
a
,
int
b
)
{
return
(
a
+
b
-
1
)
/
b
;
}
#if defined(__CUDA_ARCH__) && __CUDA_ARCH__ < 800
#if
(
defined(__CUDA_ARCH__) && __CUDA_ARCH__ < 800
) || defined (__HIP_PLATFORM_AMD__)
// No support for async
#else
...
...
ktransformers/ktransformers_ext/cuda/gptq_marlin/gptq_marlin_dtypes.cuh
View file @
ae76a729
...
...
@@ -8,6 +8,11 @@
#include <cuda_fp16.h>
#include <cuda_bf16.h>
#ifdef __HIP_PLATFORM_AMD__
typedef
__hip_bfloat16
nv_bfloat16
;
typedef
__hip_bfloat162
nv_bfloat162
;
#endif
namespace
gptq_marlin
{
template
<
typename
scalar_t
>
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment