Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel_ROCM
Repository
Branches
"git@developer.sourcefind.cn:modelzoo/resnet50_tensorflow.git" did not exist on "c059e0b29b45b4ee22f11253306345a3b831bdc6"
Overview
Active
Stale
All
merge_from_internal
ef82ed14
·
Merge branch 'develop' into merge_from_internal
·
Feb 05, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
ck_tile/vectorize_transpose
c53edaad
·
Code Cleanup
·
Feb 04, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
codegen_hiprtc
db9e5d6e
·
Merge branch 'develop' into codegen_hiprtc
·
Jan 31, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
bharriso/revert-off-load-compress
251dd9ce
·
Disable offload-compress
·
Jan 28, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
ck_tile/agmem_bgmem_creg_v2
49316982
·
get tback the restrict variable name, need to switch out to solve the transpose issue
·
Jan 28, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
docs/6.3.2
35a3f2eb
·
generate requirements.txt from .in file
·
Jan 27, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
mock_moesorting_disable
3335cd2f
·
disable mock moe sorting
·
Jan 27, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
aosewski/ck_tile_gemm_policy
43f07fcf
·
Merge branch 'develop' into aosewski/ck_tile_gemm_policy
·
Jan 27, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
amd/dev/jruan/rms_norm_welford
018e939f
·
Modify tests and bug fix
·
Jan 26, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
ck_tile/improve_async_pipeline_2
1f15fbea
·
Re-format
·
Jan 26, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
ck_tile/gemm_opt_pr
e5068768
·
Fix the perf issue with the better perf than compute pipeline
·
Jan 25, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
warp_specialized_scheduling
80e89ebd
·
minimum reproducable example for warpspecialized scheduling
·
Jan 24, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
device_gemm_multiple_d_xdl_cshuffle_v3_b_preshuffle
merged
64d5c4d6
·
Implement fp8 quant for layernorm and rmsnorm (#1814)
·
Jan 24, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
ck_tile/fa_bwd_v3_dp
92494a8a
·
separate hd pad/unpad kernels
·
Jan 24, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
cka8w8_uc_newpipe
add0b222
·
clean the code
·
Jan 23, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
flexible-fmha-functors
91e1a796
·
move elementwise functors
·
Jan 23, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
vpietila/ggemm-profiling
a932d77f
·
Build fixes.
·
Jan 21, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
lwpck-2719
e34e6308
·
fix compiler error
·
Jan 21, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
muozturk_bf16_sk_padding_fix_debug
42e14c7c
·
adding instance which has CVector=2
·
Jan 20, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
refactor_file_structure
f8107760
·
[CK_TILE] Update the file structure part2
·
Jan 19, 2025
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
Prev
1
2
3
4
5
6
7
8
…
14
Next