- 23 Dec, 2024 7 commits
-
-
Po Yen Chen authored
-
Po Yen Chen authored
Merge branch 'feature/fmha-fwd-async-splitkv' into feature/support-vllm-kcache-layout-add-splitkv-instance
-
Po Yen Chen authored
-
Po Yen Chen authored
Merge branch 'feature/add-splitkv-instance' into feature/support-vllm-kcache-layout-add-splitkv-instance
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
- 20 Dec, 2024 2 commits
-
-
Po Yen Chen authored
-
Po Yen Chen authored
Merge branch 'feature/fmha-fwd-async-splitkv' into feature/support-vllm-kcache-layout-add-splitkv-instance
-
- 19 Dec, 2024 8 commits
-
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
Merge branch 'feature/add-splitkv-instance' into feature/support-vllm-kcache-layout-add-splitkv-instance
-
- 18 Dec, 2024 4 commits
-
-
aledudek authored
* Gemm Kernel Refactor part1 * Gemm Kernel Refactor common gemm pipeline part2 * [CK TILE] Refactor batched gemm to reuse GemmKernel * [CK TILE] Refactor GemmKernel - review changes part1 * [CK TILE] Refactor GemmKernel - references fix * [CK TILE] Refactor GemmKernel - naming changes, add problem * [CK_TILE] Refactor GemmKernel - update tests * [CK_TILE] Refactor GemmKernel - review changes * [CK_TILE] Refactor GemmKernel - update test * [CK_TILE] Refactor GemmKernel - constness fixes * [CK_TILE] Refactor GemmKernel - update tests
-
Xiaodong Wang authored
Adding namespace to disambiguate with std::bit_cast Co-authored-by:Po Yen Chen <PoYen.Chen@amd.com>
-
aledudek authored
* [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm * [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm - review changes * [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm - review fix
-
Po Yen Chen authored
-
- 17 Dec, 2024 19 commits
-
-
Harisankar Sadasivan authored
* updated fp16 instances to be on parity with universal gemm instances * corrected instance name to streamk instance
-
Illia Silin authored
* pass the build flags to config.h * fix clang format
-
Max Podkorytov authored
-
Po Yen Chen authored
-
dependabot[bot] authored
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.11.0 to 1.12.0. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.11.0...v1.12.0 ) --- updated-dependencies: - dependency-name: rocm-docs-core dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
Po Yen Chen authored
-
Po Yen Chen authored
-
jakpiase authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-