Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
230b131b
Unverified
Commit
230b131b
authored
Apr 05, 2025
by
Isotr0py
Committed by
GitHub
Apr 04, 2025
Browse files
[Bugfix][kernels] Fix half2float conversion in gguf kernels (#15995)
Signed-off-by:
Isotr0py
<
2037008807@qq.com
>
parent
0812d8dd
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
0 deletions
+5
-0
csrc/quantization/gguf/ggml-common.h
csrc/quantization/gguf/ggml-common.h
+5
-0
No files found.
csrc/quantization/gguf/ggml-common.h
View file @
230b131b
...
@@ -1090,6 +1090,11 @@ __device__ __forceinline__ c10::BFloat16 convert_from_half<c10::BFloat16>(half v
...
@@ -1090,6 +1090,11 @@ __device__ __forceinline__ c10::BFloat16 convert_from_half<c10::BFloat16>(half v
#endif // defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 800
#endif // defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 800
}
}
template
<
>
__device__
__forceinline__
float
convert_from_half
<
float
>
(
half
val
)
{
return
__half2float
(
val
);
}
#if defined(USE_ROCM)
#if defined(USE_ROCM)
#ifndef __has_builtin
#ifndef __has_builtin
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment