Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
0f04a5f4
"tools/vscode:/vscode.git/clone" did not exist on "d5097f72dbefc7656e1a19a84d4fddf03f356074"
Unverified
Commit
0f04a5f4
authored
Sep 22, 2025
by
Qi Yuhang
Committed by
GitHub
Sep 21, 2025
Browse files
Optimize cutlass int8 gemm kernel for large M on SM89 Ada GPU (#10714)
parent
2f18602f
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
sgl-kernel/csrc/gemm/int8_gemm_kernel.cu
sgl-kernel/csrc/gemm/int8_gemm_kernel.cu
+2
-2
No files found.
sgl-kernel/csrc/gemm/int8_gemm_kernel.cu
View file @
0f04a5f4
...
@@ -409,8 +409,8 @@ void sm89_dispatch_shape(
...
@@ -409,8 +409,8 @@ void sm89_dispatch_shape(
cutlass_int8_scaled_mm
<
cutlass_int8_scaled_mm
<
ElementOutput
,
ElementOutput
,
ArchTag
,
ArchTag
,
cutlass
::
gemm
::
GemmShape
<
32
,
64
,
128
>
,
cutlass
::
gemm
::
GemmShape
<
128
,
128
,
64
>
,
cutlass
::
gemm
::
GemmShape
<
1
6
,
64
,
64
>
,
cutlass
::
gemm
::
GemmShape
<
6
4
,
64
,
64
>
,
InstructionShape
,
InstructionShape
,
5
>
(
out
,
mat_a
,
mat_b
,
scales_a
,
scales_b
,
bias
);
5
>
(
out
,
mat_a
,
mat_b
,
scales_a
,
scales_b
,
bias
);
}
}
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment