1. 01 Nov, 2023 1 commit
  2. 30 Oct, 2023 1 commit
  3. 14 Oct, 2023 1 commit
  4. 13 Oct, 2023 1 commit
  5. 06 Oct, 2023 1 commit
  6. 29 Sep, 2023 2 commits
  7. 12 Aug, 2023 1 commit
  8. 11 Aug, 2023 1 commit
  9. 24 Jul, 2023 1 commit
  10. 02 Jul, 2023 1 commit
    • Paul Fultz II's avatar
      Improvement to ck integration (#1859) · 3c9df3b4
      Paul Fultz II authored
      Add a CI job to test CK
      Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK
      Continue tuning even when there is invalid configs
      Fix a bug with parallel compilation not using all available threads
      Add additional test for gemms using half types
      Removed int32 as supported type since it doesnt pass our test suite
      3c9df3b4
  11. 17 Jun, 2023 1 commit
    • turneram's avatar
      Update CK commit hash and add gfx940 to supported archs (#1842) · b8898d7e
      turneram authored
      * Add initial ck_gemm code
      
      * Format
      
      * Add additional src files
      
      * Format
      
      * Add include
      
      * Simplify fuse_ck
      
      * Format
      
      * Rename var
      
      * Enable pass
      
      * Update ck version
      
      * Fix include
      
      * Add group stride
      
      * Disable warnings for ck headers
      
      * Format
      
      * Add unpack array
      
      * Add interface to enable tuning
      
      * Format
      
      * Update compile_ops to handle tuning config
      
      * Format
      
      * Add some comments
      
      * Move time_op to migraphx_gpu
      
      * Add banchmarking
      
      * Refactor
      
      * Format
      
      * Add lift class macro
      
      * Use device name
      
      * Format
      
      * Generate configs
      
      * Format
      
      * Pass tuning parameter
      
      * Move data type to is_ck_gemm matcher
      
      * Format
      
      * Add problem_cache to avoid retuning same configs
      
      * Format
      
      * Format
      
      * Mark the problems
      
      * Format
      
      * Use is_null
      
      * Format
      
      * Resize vector
      
      * Only tune with exaustive tuning
      
      * Format
      
      * Use assert
      
      * FOrmat
      
      * Tidy fixes
      
      * More tidy fixes
      
      * Format
      
      * Add license to missing files
      
      * Format
      
      * Use transform
      
      * Format
      
      * Fix tidy
      
      * Format
      
      * Fix cppcheck issues
      
      * Format
      
      * Add static_assert
      
      * Add ops header
      
      * Add assertion in batcher
      
      * Format
      
      * Improve the batch fold check
      
      * Format
      
      * Add where op workaround for CK
      
      * Skip if any input is not a supported ck type
      
      * Format
      
      * Check batch is standard
      
      * Format
      
      * Remove redundant static keyword
      
      * Update commit hash
      
      * Fix error when running without --exhaustive-tune
      
      * Formatting
      
      * Formatting
      
      * Remove fuse_ck_gemm_softmax_gemm
      
      * Update ck hash
      
      * Correct spelling mistake
      
      * Remove commented out logic from fuse_ck
      
      * Remove unused include and add comment
      
      * Formatting
      
      * Remove redundant get_shape and remove ck_gemm from names
      
      * Formatting
      
      * Allow for mixed types with int8 gemms
      
      * Formatting
      
      * Add back find_package from merge
      
      * Update CK commit hash and add gfx940 to fuse_ops supported archs
      
      * Formatting
      
      * Update CK hash
      b8898d7e
  12. 08 Jun, 2023 1 commit
  13. 04 Mar, 2023 1 commit
  14. 21 Oct, 2022 1 commit
  15. 02 Aug, 2022 1 commit
  16. 12 Jul, 2022 1 commit
  17. 22 Jun, 2022 1 commit
  18. 26 May, 2021 1 commit
  19. 11 May, 2021 1 commit
  20. 22 Apr, 2021 1 commit
  21. 09 Apr, 2021 1 commit
    • Paul Fultz II's avatar
      Upgrade docker to rocm 4.1 and drop hcc (#795) · 6d937d80
      Paul Fultz II authored
      * Fix tidy warnings for 4.1
      
      * Formatting
      
      * Upgrade to 4.1 in docker
      
      * Remove hcc build and enable ubsan on clang debug
      
      * Add missing openmp package
      
      * Construct directly
      
      * Construct directly
      
      * Upgrade rocm-cmake version
      6d937d80
  22. 17 Mar, 2021 1 commit
  23. 29 Jan, 2021 1 commit
  24. 08 Jan, 2021 1 commit
    • Paul Fultz II's avatar
      Revamp CI infrastucture (#706) · ceb4ca09
      Paul Fultz II authored
      
      
      * Add build and test github workflow
      
      * Fix cget command
      
      * Remove def-requirements.txt
      
      * Add tmate session to debug workflow
      
      * Run tmate session after installing dependencies
      
      * Print date periodically
      
      * Add clang tidy action
      
      * Seperate build and run container in two different jobs
      
      * Run bash script
      
      * Remove interactive flag
      
      * Try to mount the files
      
      * Try to use the github workspace
      
      * WIthout double braces
      
      * Use env variable
      
      * Pipe bash script in
      
      * Run using hip-clang
      
      * Use correct path
      
      * Add verbose
      
      * Remove j flag
      
      * Only run for onnx file to debug
      
      * Manually run clang-tidy
      
      * Remove quiet flag
      
      * Print header file
      
      * Printout environment
      
      * Remove extra defines
      
      * Remove fixits and config flag
      
      * Show ldd
      
      * Add tmate session
      
      * Run onnx protobuf first
      
      * Generate proto for tensorflow
      
      * Update cppcheck version
      
      * Fix some cppcheck issues
      
      * Add const
      
      * Cppcheck fixes
      
      * Formatting
      
      * Fix more cppcheck issues
      
      * Run two jobs
      
      * Cache analysis and run format checking
      
      * Fix yaml issues
      
      * Fix yaml issues
      
      * Fix indentation
      
      * Switch to hip-clang for main docker file
      
      * Use hip-clang in the readme
      
      * Fixes for jenkins
      
      * Use ccache to build
      
      * Combine file
      
      * Set restore keys
      
      * Change stage name
      
      * Build with ccache
      
      * Add missing dependency for ccache
      
      * Build debug with codecov
      
      * Fix workflow syntax
      
      * Fix list
      
      * Use quotes
      
      * Got to correct build path
      
      * Install lcov
      
      * Use sudo
      
      * Echo all commands
      
      * Setup tmate
      
      * Add verbose output
      
      * Build with cmake directly
      
      * Add pthread flag
      
      * Remove python config
      
      * Continue on error
      
      * Use on or off for cmake flag
      
      * Use always upload cache
      
      * Verbose output
      
      * Verbose output from build
      
      * Build one target
      
      * Reduce debug symbols
      
      * Increase garbage collection
      
      * Remove dmesg
      
      * Increase it to 20
      
      * Update rocm cmake version
      
      * Remove jobs from jenkins
      
      * Run on all 3 ubuntus
      
      * Remove gcc 5 jobs
      
      * Dont add flag on 16.04
      
      * Only upload coverage on 18.04
      
      * Dont build for ubuntu 20.04
      
      * Use matrix.os
      
      * Use O2 for hip-clang since lower optimizations are broken
      
      * Use rocm 3.0
      
      * Pass ccache as cmake variable instead of env variable
      
      * Build miopen from source
      
      * Show ccache statistics
      
      * Print log information
      
      * Set compression level
      
      * Use hash dir
      
      * Set hashdir
      
      * Install clang ocl from system
      
      * Up compression level
      
      * Add locale
      
      * Increase cache size to 1G
      
      * Lower compression level to 9
      
      * Remove split dwarf
      
      * Remove Og
      
      * Add back Og
      
      * Seperate debug and codecov
      
      * Add missing backlash
      
      * Garbage collect more often
      
      * Add missing locales package
      
      * Use Os
      
      * Install onednn in docker and run tests
      
      * Include target headers in tests
      
      * Increase timeout
      
      * Remove if condtion
      
      * Make flag public
      
      * Suppress memory leaks in onednn
      
      * Use equal
      
      * Add gh annotations
      
      * Update rocm-cmake version
      
      * Add ldconfig
      Co-authored-by: default avatarShucai Xiao <shucai@gmail.com>
      ceb4ca09
  25. 22 Oct, 2020 1 commit
  26. 04 Oct, 2020 1 commit
  27. 13 Aug, 2020 1 commit
  28. 10 Aug, 2020 1 commit
    • Shucai Xiao's avatar
      Convert value to json (#592) · a5648d9c
      Shucai Xiao authored
      
      
      * Add initial serialization
      
      * Formatting
      
      * Add unit tests
      
      * Formatting
      
      * Add tests for serialization
      
      * Formatting
      
      * Use or not and
      
      * Add value test
      
      * Formatting
      
      * Add more tests
      
      * Add shape serialization
      
      * Formatting
      
      * Add serializtion for literal and argument
      
      * Formatting
      
      * Serialize empty types
      
      * Formatting
      
      * Tidy fixes
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Reformat value type macro
      
      * Formatting
      
      * Handle enum types
      
      * Formatting
      
      * Add float_equal
      
      * Fix tidy issue
      
      * Use declval for better sfinae
      
      * Formatting
      
      * backup code change
      
      * Fix maro name
      
      * Add more test cases to improve coverage
      
      * Formatting
      
      * Add more tests
      
      * Formatting
      
      * Fix assertion
      
      * code backup
      
      * clang format
      
      * Fix bug with keyless assignment
      
      * Formatting
      
      * add implementation of converting json to migraphx::value
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * code refactor
      
      * add unit tests
      
      * separate json conversion from serialization
      
      * clang format
      
      * add the missing files
      
      * refine code
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * code backup
      
      * fix cppcheck error
      
      * refine an implementation
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments and add more unit tests
      
      * clang format
      
      * code backup to try
      
      * fix review comments and add more unit tests
      
      * clang format
      
      * some refinement
      
      * clang format
      
      * fix review comments
      
      * remove unnecessary code
      
      * add assignment of null
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      a5648d9c
  29. 08 Jul, 2020 1 commit
    • kahmed10's avatar
      Nd pooling gpu (#551) · d1258e80
      kahmed10 authored
      
      
      * initial progress
      
      * formatting
      
      * add pooling changes
      
      * formatting
      
      * change eliminate_pad
      
      * formatting
      
      * rename var
      
      * fomratting
      
      * update op shape test and compute
      
      * formatting
      
      * revert conv constructor
      
      * formatting
      
      * change initializer
      
      * formatting
      
      * fix tidy
      
      * change quant conv and shape check
      
      * add tests and fixes
      
      * formatting
      
      * fix type
      
      * fix conv test
      
      * formatting
      
      * add pooling and bn tests
      
      * formatting
      
      * add inconsistent attr tests
      
      * fix padding issue
      
      * formatting
      
      * progress on 1d to 2d
      
      * formatting
      
      * change compute and compile functions
      
      * formatting
      
      * fix duplicate
      
      * fix conflict
      
      * fix issue with 1d conv
      
      * formatting
      
      * add check for 3d limit
      
      * rename function
      
      * formatting
      
      * update to MIOPen 2.3
      
      * add support for nd pooling
      
      * formatting
      
      * test miopen 2.4
      
      * change function name
      
      * rename functions
      
      * formatting
      
      * add op_shape test
      
      * add gpu ops tests
      
      * formatting
      
      * add pkg-config
      
      * change functions
      
      * formatting
      
      * change to copy_backward
      
      * formatting
      
      * test diff miopen version
      
      * add pooling shape tests
      
      * temp disable test
      
      * revert to miopen 2.4
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      d1258e80
  30. 06 Apr, 2020 1 commit
    • Paul Fultz II's avatar
      Upgrade to hcc 3.0 (#487) · e184395c
      Paul Fultz II authored
      * Upgrade compiler to hcc
      
      * Add missing perl package for hcc
      
      * Update hip and rocblas version
      
      * Add flag to speedup build
      
      * Remove symlink
      
      * Add no-gpu-rdc to linking
      
      * Add missing define
      
      * Disable no unused argument
      
      * Suppress warning
      
      * Add definition to all targets
      
      * Fix compiler flags
      
      * Refer to rocblas
      e184395c
  31. 24 Feb, 2020 1 commit
  32. 05 Nov, 2019 1 commit
  33. 04 Sep, 2019 1 commit
  34. 12 Jul, 2019 1 commit
  35. 11 Jul, 2019 2 commits
  36. 07 Jun, 2019 1 commit
  37. 05 Jun, 2019 2 commits