Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel
Repository
Branches
Overview
Active
Stale
All
grouped_gemm_multi_abd_fixed_nk_example
b92fe98f
·
Update include/ck/tensor_operation/gpu/device/device_grouped_gemm_multi_abd_fixed_nk.hpp
·
Nov 21, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
bwroblew/direct_loads
820fc546
·
Review: small fix
·
Nov 21, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
bhalf-dotproduct
a0d36aea
·
delegate bhalf2 case to bhalf inner product
·
Nov 18, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
lwpck-1038
ec2fbe1f
·
Merge branch 'develop' into lwpck-1038
·
Nov 18, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
docs/6.0.0
e8cddfdc
·
Improve 4k gemm perf (#1047)
·
Nov 17, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
revert_984
f6b8f18f
·
Revert "Transpose 3d (#984)"
·
Nov 17, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
improve_4k_gemm
34bda446
·
format
·
Nov 17, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
fix_build_231115
d36a895d
·
remove unsed profile_transpose.cpp
·
Nov 16, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
lwpck-1033
41772004
·
Merge branch 'develop' into lwpck-1033
·
Nov 16, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
mha-train-develop-d0param
635ea6a0
·
Merge branch 'mha-train-develop' into mha-train-develop-d0param
·
Nov 15, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
barkocot/fix-Filter1x1Pad0-check
ad6109c0
·
Fix check for conv Fwd Filter1x1Pad0
·
Nov 15, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
barkocot/fix-instance-fwd-xdl-log
1e63ac61
·
Log CDEBlockTransferScalarPerVector_NPerBlock in conv fwd multiD xdl
·
Nov 15, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
barkocot/depracate-multid
9c2589da
·
Change doxygen deprecated to note to avoid warnings
·
Nov 12, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
bf16_more_instance
59f026f7
·
reduce the gemm input values to prevent round-off errors
·
Nov 11, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
host-lib
c4865a1d
·
fixed
·
Nov 10, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
lwpck-987
524143e4
·
Merge branch 'develop' into lwpck-987
·
Nov 09, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
barkocot/grouped-conv-fwd-multiple-ab
2ca963c9
·
Improve multi_ab interface test
·
Nov 09, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
release/rocm-rel-6.0
52b0bffe
·
Support fp64 contraction on gfx94x. (#1029)
·
Nov 09, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
release/rocm-rel-6.0-staging
52b0bffe
·
Support fp64 contraction on gfx94x. (#1029)
·
Nov 09, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
lwpck-1030
6ba844e3
·
add linker script to QA builds
·
Nov 09, 2023
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
Prev
1
2
3
4
5
6
7
…
31
Next