Skip to content

GitLab

  • Menu
Projects Groups Snippets
    • Loading...
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • C composable_kernel_ROCM
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 0
    • Issues 0
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Package Registry
    • Infrastructure Registry
  • Analytics
    • Analytics
    • CI/CD
    • Repository
    • Value stream
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • gaoqiong
  • composable_kernel_ROCM
  • Repository
  • Branches

  • Overview
  • Active
  • Stale
  • All
  • vpietila/ggemm-profiling
    a932d77f · Build fixes. · Jan 21, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • flexible-fmha-functors
    91e1a796 · move elementwise functors · Jan 23, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • cka8w8_uc_newpipe
    add0b222 · clean the code · Jan 23, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • ck_tile/fa_bwd_v3_dp
    92494a8a · separate hd pad/unpad kernels · Jan 24, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • device_gemm_multiple_d_xdl_cshuffle_v3_b_preshuffle merged
    64d5c4d6 · Implement fp8 quant for layernorm and rmsnorm (#1814) · Jan 24, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • warp_specialized_scheduling
    80e89ebd · minimum reproducable example for warpspecialized scheduling · Jan 24, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • ck_tile/gemm_opt_pr
    e5068768 · Fix the perf issue with the better perf than compute pipeline · Jan 25, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • ck_tile/improve_async_pipeline_2
    1f15fbea · Re-format · Jan 26, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • amd/dev/jruan/rms_norm_welford
    018e939f · Modify tests and bug fix · Jan 26, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • aosewski/ck_tile_gemm_policy
    43f07fcf · Merge branch 'develop' into aosewski/ck_tile_gemm_policy · Jan 27, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • mock_moesorting_disable
    3335cd2f · disable mock moe sorting · Jan 27, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • docs/6.3.2
    35a3f2eb · generate requirements.txt from .in file · Jan 27, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • ck_tile/agmem_bgmem_creg_v2
    49316982 · get tback the restrict variable name, need to switch out to solve the transpose issue · Jan 28, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • bharriso/revert-off-load-compress
    251dd9ce · Disable offload-compress · Jan 28, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • codegen_hiprtc
    db9e5d6e · Merge branch 'develop' into codegen_hiprtc · Jan 31, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • ck_tile/vectorize_transpose
    c53edaad · Code Cleanup · Feb 04, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • merge_from_internal
    ef82ed14 · Merge branch 'develop' into merge_from_internal · Feb 05, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • lwpck-2809
    fbef1cfc · restore the root access for CI dockers · Feb 05, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • rmsnorm_fp8
    c1a0a8c9 · clang format · Feb 05, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • lwpck-2824
    ad0d4991 · restore cron trigger · Feb 06, 2025
    Compare
    Download source code
    zip tar.gz tar.bz2 tar
  • Prev
  • 1
  • …
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • Next