- 04 Feb, 2025 11 commits
-
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
to fix accuracy errors
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Max Podkorytov authored
-
Max Podkorytov authored
-
- 03 Feb, 2025 11 commits
-
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Aviral Goel authored
-
Max Podkorytov authored
-
Hashem Hashemi authored
* Add pre_softmax fnctor * remove stray define:wq * Move op out of pipeline, adds it to refnc --------- Co-authored-by:
root <root@splinter-126-wr-d1.aus.dcgpu> Co-authored-by:
Max Podkorytov <4273004+tenpercent@users.noreply.github.com>
-
- 01 Feb, 2025 1 commit
-
-
Ben Richard authored
* Honor BUILD_SHARED_LIBS * Add .so versioning when building shared libraries
-
- 31 Jan, 2025 9 commits
-
-
Max Podkorytov authored
-
Max Podkorytov authored
-
arai713 authored
* updating codegen build for MIOpen access: adding .cmake for codegen component * updating CMake * adding in header guards for some headers due to issues with hiprtc compilation in MIOpen * some more header guards * putting env file in header guard * cleaning up some includes * updated types file for hiprtc purposes * fixed types file: bit-wise/memcpy issue * updating multiple utility files to deal with standard header inclusion for hiprtc * added some more header guards in the utility files, replacing some standard header functionality * added some more header guards * fixing some conflicts in utility files, another round of header guards * fixing errors in data type file * resolved conflict errors in a few utility files * added header guards/replicated functionality in device files * resolved issues with standard headers in device files: device_base and device_grouped_conv_fwd_multiple_abd * resolved issues with standard ...
-
Illia Silin authored
-
Max Podkorytov authored
-
Max Podkorytov authored
-
Max Podkorytov authored
-
Max Podkorytov authored
-
Illia Silin authored
* turn on the ck_tile gemm tests by default * enable ck_tile gemms CI build by default
-
- 30 Jan, 2025 5 commits
-
-
Max Podkorytov authored
-
Adam Osewski authored
* Add spatially local tile partitioner * Use 1D Grid size & create partitioner object. * Docs & use 1D partitioner in example. * Clang format. * Change kernel grid size Now: X is the # of output C-tiles, Y is the batch count Z is the splitK * Formatting & more doc. * Clang format. * Fix batched gemm test. Use 1d partitioner. * Move condition. * FIx ctor. * clang-format. -
dependabot[bot] authored
Bumps [rocm-docs-core](https://github.com/ROCm/rocm-docs-core) from 1.14.1 to 1.15.0. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.14.1...v1.15.0 ) --- updated-dependencies: - dependency-name: rocm-docs-core dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
Illia Silin authored
-
Bartłomiej Kocot authored
* [CK TILE] Implement cschuflle algorithm * Rebase * Vector store size fixes * fixes * Fixes * fixes * fmha fix * fixes * fixes of fixes
-
- 29 Jan, 2025 3 commits
-
-
Max Podkorytov authored
-
Max Podkorytov authored
-
Max Podkorytov authored
-