1. 07 Aug, 2024 2 commits
    • Illia Silin's avatar
      5df10432
    • Illia Silin's avatar
      Run CK_TILE FMHA benchmarks and collect the performance data. (#1447) · 12c1f68d
      Illia Silin authored
      * run ck_tile benchmarks after the smoke tests and store logs
      
      * change the path of fmha benchmark logs
      
      * change the way of stashig ck_tile fmha logs
      
      * prevent the errors in stages where no logs are generated
      
      * fix the ck_tile fmha log names and headers
      
      * generate the fmha performance logs in the root folder
      
      * change jenkins scrip arguments format
      
      * use exact file names for stashing
      
      * modify scripts to process FMHA performance results
      
      * unstash FMHA logs before parsing them
      12c1f68d
  2. 05 Aug, 2024 1 commit
  3. 01 Aug, 2024 1 commit
  4. 11 Jul, 2024 2 commits
    • Illia Silin's avatar
      Add CK_TILE tests to daily CI builds. (#1381) · 98a01bbc
      Illia Silin authored
      * add ck_tile tests to CI
      
      * build and run ck_tile tests on gfx90a and gfx942 in parallel
      
      * fix groovy syntax
      
      * turn ck_tile tests OFF by default
      
      * skip creating the build folder
      
      * build ck_tile examples with 64 threads
      
      * build ck_tile examples with cmake-ck-dev.sh script
      
      * add video group to docker on mi300
      
      * do not retry to rebuild the early CI stages
      
      * help prevent jenkins false failure
      
      * restore cron trigger
      98a01bbc
    • Illia Silin's avatar
      [Jenkins] restore cron jobs (#1380) · f914c228
      Illia Silin authored
      * test the cron trigger
      
      * fix the cron jobs
      
      * restore the list of cron jobs
      f914c228
  5. 04 Jul, 2024 1 commit
  6. 27 Jun, 2024 1 commit
  7. 03 Jun, 2024 1 commit
  8. 28 May, 2024 1 commit
  9. 10 May, 2024 1 commit
  10. 01 May, 2024 1 commit
  11. 30 Apr, 2024 1 commit
  12. 18 Apr, 2024 2 commits
  13. 14 Apr, 2024 1 commit
    • Haocong WANG's avatar
      [GEMM] Gemm universal device operation (#1154) · f83e9701
      Haocong WANG authored
      
      
      * Optimize GEMM on MI200/300:
      1. Add new blockwise gemm pipeline
      2. Add irregular splitk intances
      
      * clang format + typo fix
      
      * Fix a bug
      
      * initial commit
      
      * Add more instances to irregular splitk
      
      * blkgemm pipeline v1~4 prototype
      
      * Sanity Checked. Known issue:
      1. Poor performance of splitk
      2. Register spill on blkgemmpipeline v3
      
      * Sanity and Performance fix:
      1. fix a bug related to sanity in grouped b2c mapping
      2. fix a bug related to sanity and performance in splitk offset
      
      * Sanity and API update:
      1. Remove prefetch stage
      2. Fix valid check bug
      3, Add first gemm_universal instance into ckProfiler
      
      * Add NN instances for gemm universal
      
      * 1. Add NT instances for gemm_universal
      2. Fix a bug about Kpadding in gemm_universal
      
      * Fix a bug regarding padding Odd K number
      
      * remove kernel print
      
      * Fix KPadding bug...
      
      * Update safety check
      
      * another try to fix kpadding..
      
      * Sanity checked
      
      * new instances..
      
      * clang format+typo fix
      
      * remove clang format script's change
      
      * Add non-hotloop compile option
      
      * 1. Add fp16xfp8 example
      2. pull packed convert f8 from pr1150
      
      * Some miscs.. opt and fix
      
      * Add pipeline description docs
      
      * Split universal gemm instance library to cut profiler compiling time
      
      * uncomment cmakefile
      
      * Fix a bug caused by blockwise_gemm_pipe_v2
      
      * reduce default splitk to 1
      
      * Add 224x256x64 tile size
      
      * update, including:
      1. Experiment pipeline 5~7
      2. Optimization for pipeline 4
      3. Organized instance library
      
      * temp save
      
      * temp save
      
      * Permuted lds layout, sanity and function checked
      
      * clang format
      
      * Move OOB check from RunRead to RunWrite, for better software pipeline.
      TODO: agpr spill when NN layout
      
      * clangformat
      
      * A/B splitpipe scheduler for v3
      
      * Fix two bugs
      
      * bug fix
      
      * fix a bug in oob check
      
      * Example for mixed fp16_fp8 gemm
      
      * Clean experimental code blocks
      
      * Add mixed precision gemm into profiler
      
      * tempsave
      
      * optimize m/n major lds layout
      
      * Add RRR GEMM  mixed precision instances
      
      * Optimize f8 matrix transpose
      
      * Add test_gemm_universal
      
      * A/B spilt schedule for blkpip v5
      
      * Take ds_read2 into iglp scheduling scheme
      
      * format
      
      * fixed cmake
      
      * Add llvm-option into CI cmake flag
      
      ---------
      Co-authored-by: default avatarJing Zhang <jizhan@amd.com>
      f83e9701
  14. 22 Mar, 2024 1 commit
  15. 19 Mar, 2024 1 commit
  16. 18 Mar, 2024 1 commit
    • Illia Silin's avatar
      Re-enable the performance tracking in CI. (#1203) · bdcd0374
      Illia Silin authored
      * test CK with rocm6.1 RC2
      
      * add docker credentials for pull
      
      * update the performance db name
      
      * use environment variable for db name
      
      * add rocm-llvm-dev package to ck docker
      
      * turn off verification for daily performance runs
      
      * do not stash ckProfiler on MI300 node
      
      * add processing of mixed gemms to qa, fix parsing of splitk gemm logs
      
      * fix the splitk gemm log file name
      
      * turn the timing on for splitk gemm performance
      bdcd0374
  17. 06 Mar, 2024 1 commit
    • Paul Fultz II's avatar
      Add host lib (#1134) · 8eff4d62
      Paul Fultz II authored
      
      
      * Format
      
      * Format
      
      * Format
      
      * Remove const
      
      * Use the right template
      
      * Format
      
      * Format
      
      * add row/col instances
      
      * Add missing file
      
      * fixed
      
      * Format
      
      * Updates
      
      * Format
      
      * fixed rrr layout
      
      * Format
      
      * Update test and embed modules
      
      * Restore older version
      
      * Update year
      
      * Set -fPIC
      
      * Format
      
      * Use double for isnan
      
      * rename host folder to codegen + minor fix
      
      * add codegen CI test
      
      * add option to build components without building CK
      
      * fix the groovy syntax
      
      * fix typo
      
      * use the correct function for the codegen stage
      
      ---------
      Co-authored-by: default avatarJing Zhang <jizha@amd.com>
      Co-authored-by: default avatarIllia Silin <98187287+illsilin@users.noreply.github.com>
      Co-authored-by: default avatarillsilin <Illia.Silin@amd.com>
      8eff4d62
  18. 05 Mar, 2024 1 commit
  19. 13 Feb, 2024 1 commit
  20. 05 Feb, 2024 1 commit
  21. 30 Jan, 2024 2 commits
  22. 26 Jan, 2024 1 commit
  23. 24 Jan, 2024 1 commit
    • Illia Silin's avatar
      Fixing most of the cppcheck errors. (#1142) · 180e5720
      Illia Silin authored
      * fix cppcheck errors, first pass
      
      * fix format
      
      * fix returned value in examples
      
      * add macro definitions for cppcheck
      
      * fix the profile_gemm logic
      
      * update the gemm profiler logic
      
      * add more difinitions to cppcheck, fix couple more errors
      
      * replace runtime error with message in device function
      
      * fix a couple of int4 issues
      
      * no return for fill function
      
      * fix errors in data_types.hpp
      
      * fix format
      
      * fix few remaining errors
      
      * fix errors in data_types.hpp
      
      * fix last couple of errors in datat_types.hpp
      180e5720
  24. 15 Jan, 2024 1 commit
    • Illia Silin's avatar
      Add cppcheck to CK CI. (#1125) · e6d099c8
      Illia Silin authored
      * add cppcheck to the CK CI
      
      * fix the path to CK source for cppcheck
      
      * fix the path to CK source for cppcheck one more time
      
      * fix the path to CK source for cppcheck third time
      
      * change the path to ck_cppcheck.log
      
      * install latest cppcheck from source
      
      * fix bug in ck.hpp and use 20 threads for cppcheck
      
      * create a switch to turn cppckeck on and off in CI
      e6d099c8
  25. 05 Jan, 2024 1 commit
  26. 16 Dec, 2023 1 commit
  27. 07 Dec, 2023 1 commit
  28. 06 Dec, 2023 1 commit
  29. 05 Dec, 2023 1 commit
  30. 30 Nov, 2023 1 commit
  31. 09 Nov, 2023 1 commit
  32. 03 Nov, 2023 1 commit
  33. 01 Nov, 2023 1 commit
  34. 30 Oct, 2023 1 commit
    • Illia Silin's avatar
      Enable sccache in the default docker and CI. (#1009) · 4e44a9e8
      Illia Silin authored
      
      
      * replace ccache with sccache, pin package versions
      
      * put ccache back temporarily to avoid breaking other CI jobs
      
      * add sccashe_wrapper.sh script
      
      * fix the package version syntax
      
      * fix the pymysql package issue
      
      * run sccache_wrapper before build if ccache server found
      
      * set the paths before calling the sccache_wrapper
      
      * use /tmp instead of /usr/local for cache
      
      * try using sccache --start-server instead of wrapper
      
      * try using redis server with sccache
      
      * define SCCACHE_REDIS
      
      * add redis and ping packages, and redis port
      
      * use the new sccache redis server
      
      * do not use sccache with staging compiler
      
      * fix the condition syntax
      
      * add stunnel to redis
      
      * add tunnel verification
      
      * separate caches for different architectures
      
      * fix syntax for the cache tag
      
      * quse double brackets for conditions
      
      * add bash line to the script
      
      * add a switch for sccache and only use it in build stage
      
      * run check_host function when enabling sccache
      
      * fix the invocation tags for sccache
      
      * fix groovy syntax
      
      * set the invocation tag in groovy
      
      * disable sccache in clang-format stage
      
      * try another syntax for invocation tags
      
      * use local sccache server if can't connect to redis
      
      * fix script syntax
      
      * update README
      
      * refresh readme
      
      * readme updates
      
      * remove the timing and verification caveat from readme
      
      ---------
      Co-authored-by: default avatarLisa Delaney <lisa.delaney@amd.com>
      4e44a9e8
  35. 19 Oct, 2023 1 commit
  36. 16 Oct, 2023 1 commit
    • Illia Silin's avatar
      Add hipTensor build and test to CK CI. (#990) · 707ad002
      Illia Silin authored
      * add a hipTensor test to CI
      
      * use jenkins git plugin
      
      * change hipTensor folder location in CI
      
      * change the git method for hipTensor
      
      * run tests usign ctest
      
      * check the hipTensor contents
      
      * only build hipTensor on MI100/200
      
      * pull hipTensor as zip archive
      
      * fix jenkins syntax
      
      * add path to the CK installation
      
      * combine build commands into one shell
      
      * change jenkins syntax for CK installer path
      
      * try different syntax
      
      * allow unzip overwrite
      
      * fix jenkins file syntax
      
      * remove any old versions of hipTensor before building
      
      * add option to select hipTensor branch for testing
      707ad002