- 31 Dec, 2024 1 commit
-
-
Po Yen, Chen authored
-
- 29 Dec, 2024 11 commits
-
-
Po Yen Chen authored
-
Po Yen Chen authored
This reverts commit 658350b3.
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
This reverts commit 09486ebf.
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
- 24 Dec, 2024 3 commits
-
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
- 23 Dec, 2024 7 commits
-
-
Po Yen Chen authored
-
Po Yen Chen authored
Merge branch 'feature/fmha-fwd-async-splitkv' into feature/support-vllm-kcache-layout-add-splitkv-instance
-
Po Yen Chen authored
-
Po Yen Chen authored
Merge branch 'feature/add-splitkv-instance' into feature/support-vllm-kcache-layout-add-splitkv-instance
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
- 20 Dec, 2024 2 commits
-
-
Po Yen Chen authored
-
Po Yen Chen authored
Merge branch 'feature/fmha-fwd-async-splitkv' into feature/support-vllm-kcache-layout-add-splitkv-instance
-
- 19 Dec, 2024 8 commits
-
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
-
Po Yen Chen authored
Merge branch 'feature/add-splitkv-instance' into feature/support-vllm-kcache-layout-add-splitkv-instance
-
- 18 Dec, 2024 4 commits
-
-
aledudek authored
* Gemm Kernel Refactor part1 * Gemm Kernel Refactor common gemm pipeline part2 * [CK TILE] Refactor batched gemm to reuse GemmKernel * [CK TILE] Refactor GemmKernel - review changes part1 * [CK TILE] Refactor GemmKernel - references fix * [CK TILE] Refactor GemmKernel - naming changes, add problem * [CK_TILE] Refactor GemmKernel - update tests * [CK_TILE] Refactor GemmKernel - review changes * [CK_TILE] Refactor GemmKernel - update test * [CK_TILE] Refactor GemmKernel - constness fixes * [CK_TILE] Refactor GemmKernel - update tests
-
Xiaodong Wang authored
Adding namespace to disambiguate with std::bit_cast Co-authored-by:Po Yen Chen <PoYen.Chen@amd.com>
-
aledudek authored
* [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm * [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm - review changes * [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm - review fix
-
Po Yen Chen authored
-
- 17 Dec, 2024 4 commits
-
-
Harisankar Sadasivan authored
* updated fp16 instances to be on parity with universal gemm instances * corrected instance name to streamk instance
-
Illia Silin authored
* pass the build flags to config.h * fix clang format
-
Max Podkorytov authored
-
Po Yen Chen authored
-