Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
yangql
composable_kernel-1
Commits
87d8740bf5d8030f9e4e54c9b7e64f353a6f944e
Switch branch/tag
composable_kernel-1
src
include
blockwise_batched_gemm.hip.hpp
16 Apr, 2019
1 commit
refactor ConstantTensorDescriptor and functional
· 17f3d2d4
Chao Liu
authored
Apr 16, 2019
17f3d2d4
13 Apr, 2019
1 commit
implicit gemm v1r2: only load 1d filter
· 00899f19
Chao Liu
authored
Apr 13, 2019
00899f19
10 Apr, 2019
3 commits
tuned implicit gemm v1 for 3x3 on AMD to 82%. Fixed a bug in 4d tensor blockwise copy.
· 96ee9571
Chao Liu
authored
Apr 10, 2019
96ee9571
update flops calculation
· edc89778
Chao Liu
authored
Apr 10, 2019
edc89778
simplify blockwise batched GEMM
· 5696c81f
Chao Liu
authored
Apr 10, 2019
5696c81f
09 Apr, 2019
1 commit
refactor
· 1bd880a6
Chao Liu
authored
Apr 09, 2019
1bd880a6
08 Apr, 2019
3 commits
add more assertion
· c075d3f7
Chao Liu
authored
Apr 08, 2019
c075d3f7
tidy up
· 268d1c71
Chao Liu
authored
Apr 08, 2019
268d1c71
debugging implicit gemm v1: use 10d tensor output
· c9fa46af
Chao Liu
authored
Apr 08, 2019
c9fa46af
07 Apr, 2019
1 commit
refactor
· b57d60c0
Chao Liu
authored
Apr 06, 2019
b57d60c0
02 Apr, 2019
1 commit
cleaning up dead code
· bdbc0eaa
Chao Liu
authored
Apr 02, 2019
bdbc0eaa
24 Mar, 2019
2 commits
experimenting
· 766b0a9e
Chao Liu
authored
Mar 24, 2019
766b0a9e
experimenting
· f35c64eb
Chao Liu
authored
Mar 23, 2019
f35c64eb