Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
zhaoyu6
sglang
Commits
50f28f65
"test/vscode:/vscode.git/clone" did not exist on "be6ecff6f4ce5e2905055cf05ffe620758510748"
Unverified
Commit
50f28f65
authored
Mar 02, 2025
by
Xiaoyu Zhang
Committed by
GitHub
Mar 02, 2025
Browse files
fix typo in deep gemm benchmarking(#3991)
parent
90a55e25
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
1 deletion
+0
-1
benchmark/kernels/deepseek/benchmark_deepgemm_fp8_gemm.py
benchmark/kernels/deepseek/benchmark_deepgemm_fp8_gemm.py
+0
-1
No files found.
benchmark/kernels/deepseek/benchmark_deepgemm_fp8_gemm.py
View file @
50f28f65
...
...
@@ -211,7 +211,6 @@ def get_benchmark(tp_size):
x
=
torch
.
randn
((
m
,
k
),
device
=
"cuda"
,
dtype
=
torch
.
bfloat16
)
y
=
torch
.
randn
((
n
,
k
),
device
=
"cuda"
,
dtype
=
torch
.
bfloat16
)
# 预处理数据,在计时之前完成
x_fp8
,
x_scale
=
per_token_cast_to_fp8
(
x
)
y_fp8
,
y_scale
=
per_block_cast_to_fp8
(
y
)
x_scale_col_major
=
get_col_major_tma_aligned_tensor
(
x_scale
.
clone
())
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment