Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
ac04dd37
Unverified
Commit
ac04dd37
authored
Feb 06, 2026
by
R3hankhan
Committed by
GitHub
Feb 06, 2026
Browse files
[CPU] Add BF16 Kernel type for s390x (#33788)
Signed-off-by:
Rehan Khan
<
Rehan.Khan7@ibm.com
>
parent
035a6cb0
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
9 additions
and
0 deletions
+9
-0
csrc/cpu/mla_decode.cpp
csrc/cpu/mla_decode.cpp
+9
-0
No files found.
csrc/cpu/mla_decode.cpp
View file @
ac04dd37
...
...
@@ -38,6 +38,15 @@ struct KernelVecType<c10::BFloat16> {
using
qk_vec_type
=
vec_op
::
BF16Vec32
;
using
v_load_vec_type
=
vec_op
::
BF16Vec16
;
};
#elif defined(__s390x__)
template
<
>
struct
KernelVecType
<
c10
::
BFloat16
>
{
using
qk_load_vec_type
=
vec_op
::
BF16Vec16
;
using
qk_vec_type
=
vec_op
::
FP32Vec16
;
using
v_load_vec_type
=
vec_op
::
BF16Vec16
;
};
#elif defined(__aarch64__)
template
<
>
struct
KernelVecType
<
c10
::
BFloat16
>
{
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment