Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
fengzch-das
nunchaku
Commits
63913f29
Commit
63913f29
authored
Mar 28, 2025
by
LeeDongYeun
Committed by
Zhekai Zhang
Apr 01, 2025
Browse files
fix shape in GEMM W8A8
parent
af6b1a3c
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
3 deletions
+3
-3
src/Linear.cpp
src/Linear.cpp
+3
-3
No files found.
src/Linear.cpp
View file @
63913f29
...
...
@@ -451,9 +451,9 @@ GEMM_W8A8::QuantizedActivation GEMM_W8A8::quantize(Tensor x, bool fuse_glu) {
}
Tensor
GEMM_W8A8
::
forward_quant
(
QuantizedActivation
qact
)
{
auto
o
shape
=
qact
.
act
.
shape
;
o
shape
[
-
1
]
=
out_features
;
Tensor
out
=
Tensor
::
allocate
(
o
shape
,
this
->
dtype
,
qact
.
act
.
device
());
auto
shape
=
TensorShape
(
qact
.
act
.
shape
.
dataExtent
)
;
shape
[
-
1
]
=
out_features
;
Tensor
out
=
Tensor
::
allocate
(
shape
,
this
->
dtype
,
qact
.
act
.
device
());
kernels
::
gemm_w8a8
(
qact
.
act
,
this
->
qweight
,
out
,
qact
.
ascales
,
this
->
wscales
,
this
->
bias
);
debug
(
"gemm.out"
,
out
);
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment