Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel
Commits
f18ac016
Commit
f18ac016
authored
Jul 01, 2022
by
Chao Liu
Browse files
refactor
parent
0dcb3496
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
6 additions
and
6 deletions
+6
-6
example/03_gemm_bias_relu/CMakeLists.txt
example/03_gemm_bias_relu/CMakeLists.txt
+1
-1
example/03_gemm_bias_relu/README.md
example/03_gemm_bias_relu/README.md
+5
-5
example/03_gemm_bias_relu/gemm_bias_relu_xdl_fp16.cpp
example/03_gemm_bias_relu/gemm_bias_relu_xdl_fp16.cpp
+0
-0
No files found.
example/03_gemm_bias_relu/CMakeLists.txt
View file @
f18ac016
add_example_executable
(
example_gemm_
xdl_
bias_relu gemm_xdl_bias_relu.cpp
)
add_example_executable
(
example_gemm_bias_relu
_xdl_fp16
gemm_xdl_bias_relu
_xdl_fp16
.cpp
)
example/03_gemm_bias_relu/README.md
View file @
f18ac016
# Instructions for ```example_gemm_
xdl_
bias_relu_
add
```
# Instructions for ```example_gemm_bias_relu_
xdl_fp16
```
## Run ```example_gemm_
xdl_
bias_relu_
add
```
## Run ```example_gemm_bias_relu_
xdl_fp16
```
```
bash
```
bash
#arg1: verification (0=no, 1=yes)
#arg1: verification (0=no, 1=yes)
#arg2: initialization (0=no init, 1=integer value, 2=decimal value)
#arg2: initialization (0=no init, 1=integer value, 2=decimal value)
#arg3:
run
kernel
# of times (>1
)
#arg3:
time
kernel
(0=no, 1=yes
)
#arg4 to 9: M (256x), N(128x), K(32x), StrideA, StrideB, Stride
C
#arg4 to 9: M (256x), N(128x), K(32x), StrideA, StrideB, Stride
E
./bin/example_gemm_
xdl_
bias_relu_
add 0
1
5
3840 4096 4096 4096 4096 4096
./bin/example_gemm_bias_relu_
xdl_fp16 1
1
1
3840 4096 4096 4096 4096 4096
```
```
Result (MI100 @ 1087Mhz, 133.5TFlops peak FP16)
Result (MI100 @ 1087Mhz, 133.5TFlops peak FP16)
...
...
example/03_gemm_bias_relu/gemm_
xdl_
bias_relu.cpp
→
example/03_gemm_bias_relu/gemm_bias_relu
_xdl_fp16
.cpp
View file @
f18ac016
File moved
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment