- 06 Feb, 2025 1 commit
-
-
Aviral Goel authored
-
- 04 Feb, 2025 9 commits
-
-
Aviral Goel authored
-
Illia Silin authored
* enable batched_gemm_softmax_gemm_perm_wmma for gfx12 * disable instances with blocksize=256 in attention examples * debuggging * debug * fixed lds_enabled * debugging * Fix and add limit to skiplds feature * Enable skipLds feature and fix compilation bugs * add ck_tile definitions for gfx12 * fix clang format and test/wmma_op * updage instances cmake for gfx12 * disable the test_wmma_op on gfx12 * fix the builds for gfx950 * add gfx12 and gfx950 to default target list * clean-up cmake file * Initial introduction of OFP8 data types. * Renamed FP8 and BF8 tests into FP8_FNUZ and BF8_FNUZ. * Implementation of ConvertFP32Nearest in test_fp8_ocp. * Remove dependence on possibly undeclared alias. * Implement FP8OCP test for stochastic rounding mode. * Implement FP8OCP tests for half_t type conversions. * enable bf16 atomic add on gfx950 * Implement ConvertFP32Nearest test. * Implement ConvertFP32Stochastic test. ...
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
- 03 Feb, 2025 5 commits
-
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
- 31 Jan, 2025 1 commit
-
-
Illia Silin authored
* turn on the ck_tile gemm tests by default * enable ck_tile gemms CI build by default
-
- 30 Jan, 2025 1 commit
-
-
Illia Silin authored
-
- 15 Jan, 2025 1 commit
-
-
Illia Silin authored
-
- 13 Jan, 2025 1 commit
-
-
Max Podkorytov authored
add unit test for gen instances for gemms add unit tests for conv and batched gemms add unit test for preselected gemm instances apply ruff lint add license header for the unit test add inductor pytest to CI verbose pip install switch the directory before installing python packages move the inductor codegen test try yet another workdir Update Jenkinsfile The directory looks right, fixing pip module not found by invoking pip directly Update Jenkinsfile invoke pytest directly since the module is not found Update Dockerfile Install setuptools update package structure bump setuptools maybe fix data path for library sources fix library search path for conv instances fix path in pyproject definition compare path used in gen_instances with one in pyproject.toml; fix the difference Co-authored-by:Illia Silin <98187287+illsilin@users.noreply.github.com>
-
- 10 Jan, 2025 1 commit
-
-
Thomas Ning authored
* Finished adding the performance benchmark for ck tile gemm * Fix the executable rename problem * fix the executable name error * delete the unsupported layout combinations * Update run_full_test.sh * Update benchmark_mem_pipeline.sh * Update benchmark_basic.sh * change the executable of gemm_universal * change ck_tile_gemm script permissions * Addressed the comment * Addressed the comment * Fixed the comments * Fixed Comment * roll back the malfunctioned change * Fix the Typo * finalize the tile_gemm_fp16 performance monitoring * fix the stash names for ck_tile gemm logs * change the stashing logic * change stashing syntax --------- Co-authored-by:
Illia Silin <98187287+illsilin@users.noreply.github.com> Co-authored-by:
illsilin <Illia.Silin@amd.com>
-
- 16 Dec, 2024 1 commit
-
-
Illia Silin authored
* upgrade sqlalchemy version * replace the connection with engine in to_sql call * change the hipTes=nsor ctest syntax
-
- 10 Dec, 2024 1 commit
-
-
Illia Silin authored
* upgrade to ubuntu 22.04 * try adding -u roof docker options for ubuntu 22
-
- 09 Dec, 2024 2 commits
-
-
Illia Silin authored
-
Illia Silin authored
-
- 06 Dec, 2024 2 commits
-
-
Illia Silin authored
* merge the build and performance tests CI stages together * add gemm performance test on gfx11/gfx12 * add suffices to distinguish gemm performance logs from different archs * use smaller gemm set in CI for gfx10/gfx11/gfx12 * disable performance tests on gfx1030 * fix the shashing logic * fix finding python3 for mha instances
-
Illia Silin authored
* upgrade to rocm6.3 compiler * Proposed solution to convnd test failures in ROCm 6.3 --------- Co-authored-by:Andriy Roshchenko <andriy.roshchenko@amd.com>
-
- 28 Nov, 2024 1 commit
-
-
Illia Silin authored
* refactor docker build in CI * add Dockerfile.compiler * add input args to Dockerfile.compiler * rearrange the docker args
-
- 26 Nov, 2024 1 commit
-
-
Illia Silin authored
-
- 23 Nov, 2024 1 commit
-
-
Illia Silin authored
-
- 05 Nov, 2024 1 commit
-
-
Illia Silin authored
-
- 15 Oct, 2024 1 commit
-
-
Paul Fultz II authored
* Build codegen as standalone * Add exception for device tests * Use local filesystem header * add a codegen test CI stage and daily build --------- Co-authored-by:
illsilin <Illia.Silin@amd.com> Co-authored-by:
Illia Silin <98187287+illsilin@users.noreply.github.com>
-
- 11 Oct, 2024 1 commit
-
-
Illia Silin authored
-
- 09 Oct, 2024 1 commit
-
-
Illia Silin authored
-
- 07 Oct, 2024 1 commit
-
-
Illia Silin authored
* update build logic with GPU_ARCHS * fix the GPU_ARCHS build for codegen * unset GPU_TARGETS when GPU_ARCHS are set
-
- 01 Oct, 2024 1 commit
-
-
Illia Silin authored
-
- 25 Sep, 2024 1 commit
-
-
Illia Silin authored
-
- 23 Sep, 2024 1 commit
-
-
Illia Silin authored
* add an option to build CK with legacy dockers * change the custom docker settings * add environment varianble for custom docker * use a new variable for legacy docker name * new way to pass docker names for legacy OS * add legacy docker check in the Build_CK function * change groovy syntax * add a check for legacy docker in getDockerImage * make sure the legacy docker name is not empty * remove the dumb-init call * disable the tests in legacy OS dockers * disable tests in legacy dockers * use a different way to disable tests in legacy dockers * rearrange the CI stages for legacy OS * use different way to disable tests in legacy dockers * update LD_LIBRARY_PATH for legacy dockers and add cron job * update LD_LIBRARY_PATH at docker launch * change the sytax for setting LD_LIBRARY_PATH
-
- 17 Sep, 2024 1 commit
-
-
Illia Silin authored
* add image for rocm6.3_rc1 * fix deb package url
-
- 14 Sep, 2024 1 commit
-
-
Thomas Ning authored
* Finished the feature of gpu verification * Add the ck_tile_gemm test in the CI CD * add the include of tensor_layou in reference_gemm * Comment Addressed * split ck_tile fhma and gemm tests into separate stages * restructure the reference gemm * restructure a new reference_gemm api that could read the device mem --------- Co-authored-by:
carlushuang <carlus.huang@amd.com> Co-authored-by:
illsilin <Illia.Silin@amd.com>
-
- 12 Sep, 2024 1 commit
-
-
Illia Silin authored
-