1. 01 Jun, 2023 3 commits
  2. 31 May, 2023 3 commits
  3. 29 May, 2023 1 commit
  4. 25 May, 2023 1 commit
  5. 19 May, 2023 1 commit
  6. 22 Mar, 2023 1 commit
  7. 13 Mar, 2023 1 commit
  8. 16 Feb, 2023 1 commit
  9. 31 Jan, 2023 1 commit
    • Umang Yadav's avatar
      hipRTC fixes (#1531) · 91cc7242
      Umang Yadav authored
      Added CMakeFlag for hipRTC. MIGRAPHX_USE_HIPRTC.
      Added stages in Jenkins for hipRTC.
      Fixes for some of the pending issues from hipRTC.
      91cc7242
  10. 06 Jan, 2023 1 commit
  11. 26 Sep, 2022 1 commit
    • Charlie Lin's avatar
      Rewrite ONNX parse batch norm (#1362) · c00f8202
      Charlie Lin authored
      Rewrites the BatchNormalization ONNX operator into other MIGX operators
      - Added handling of 1D input tensor case (edge case in ONNX spec)
      Removes the spatial and per_activation functionality (not in the ONNX spec)
      - Did not remove the batch_norm_inference related code as the TensorFlow parser still uses it
      - Can remove that code when the TF version is updated
      c00f8202
  12. 12 Jul, 2022 1 commit
    • Paul Fultz II's avatar
      Add tests for C API (#1266) · a7a32a9e
      Paul Fultz II authored
      This will ensure that migraphx.h can be included from a C compiler, and check that the C API can be called. This includes stdbool.h which is needed when using bool from C.
      a7a32a9e
  13. 16 Jun, 2022 1 commit
  14. 29 Mar, 2022 1 commit
  15. 05 Nov, 2021 1 commit
  16. 28 Sep, 2021 1 commit
  17. 26 Jul, 2021 1 commit
  18. 25 Jul, 2021 1 commit
  19. 29 Apr, 2021 1 commit
    • SJW's avatar
      MLIR MIOpen Dialect integration (phase 1) (#768) (#769) · 56584fa2
      SJW authored
      
      
      * MLIR MIOpen Dialect integration (phase 1) (#768)
      
      * Added Findmlir.cmake (using environment variables to import)
      
      * Added mlir_conv pass to GPU target
      
        * Apply to any gpu::convolution if supported by MLIR
      
        * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution
      
        * Capture binary in dictionary for matching convolutions
      
        * Build a code_object_op with the binary and execution dimensions
      
        * Substitute for the gpu::convolution
      
      * Changed the parameters for the code_object to reflect the generated MLIR kernel
      
      * Expanded out MemRefDescriptor fields in param list
      
      * Also updated for MLIR C-API changes
      
      * * fixed global_size calculation
      
      * MLIR MIOpen Dialect integration (phase 1) (#768)
      
      * Added Findmlir.cmake (using environment variables to import)
      
      * Added mlir_conv pass to GPU target
      
        * Apply to any gpu::convolution if supported by MLIR
      
        * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution
      
        * Capture binary in dictionary for matching convolutions
      
        * Build a code_object_op with the binary and execution dimensions
      
        * Substitute for the gpu::convolution
      
      * Changed the parameters for the code_object to reflect the generated MLIR kernel
      
      * Expanded out MemRefDescriptor fields in param list
      
      * Also updated for MLIR C-API changes
      
      * * Added command line option: --enable_mlir
      
      * * fixed command line switch
      
      * updated for new MLIR API changes
      
      * * Added cget llvm-project-mlir to import MIIR API libraries into Dockerfile
        * removed cmake Findmlir
      
      * updated for changes in MIIR C-API
      
      * * updated CMakeLists.txt to allow disable of MLIR import
      
      * fixed memory leaks and removed copies
      
      * updated for 5D memrefs
      
      * * formatting
      
      * * fixed review comments
      
      * * fixed merge issues
      
      * hip gcnDeviceName now includes specifiers at the end
        * use major/minor values instead
      
      * * disable MLIR by default
      
      * * removed command-line switch --enable-mlir
      
      * * fix unused when MLIR disabled
      
      * * enable jenkins enable/test MLIR
      
      * * format
      
      * * fixed clang-tidy
      
      * * added new type
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      56584fa2
  20. 09 Apr, 2021 1 commit
    • Paul Fultz II's avatar
      Upgrade docker to rocm 4.1 and drop hcc (#795) · 6d937d80
      Paul Fultz II authored
      * Fix tidy warnings for 4.1
      
      * Formatting
      
      * Upgrade to 4.1 in docker
      
      * Remove hcc build and enable ubsan on clang debug
      
      * Add missing openmp package
      
      * Construct directly
      
      * Construct directly
      
      * Upgrade rocm-cmake version
      6d937d80
  21. 08 Jan, 2021 1 commit
    • Paul Fultz II's avatar
      Revamp CI infrastucture (#706) · ceb4ca09
      Paul Fultz II authored
      
      
      * Add build and test github workflow
      
      * Fix cget command
      
      * Remove def-requirements.txt
      
      * Add tmate session to debug workflow
      
      * Run tmate session after installing dependencies
      
      * Print date periodically
      
      * Add clang tidy action
      
      * Seperate build and run container in two different jobs
      
      * Run bash script
      
      * Remove interactive flag
      
      * Try to mount the files
      
      * Try to use the github workspace
      
      * WIthout double braces
      
      * Use env variable
      
      * Pipe bash script in
      
      * Run using hip-clang
      
      * Use correct path
      
      * Add verbose
      
      * Remove j flag
      
      * Only run for onnx file to debug
      
      * Manually run clang-tidy
      
      * Remove quiet flag
      
      * Print header file
      
      * Printout environment
      
      * Remove extra defines
      
      * Remove fixits and config flag
      
      * Show ldd
      
      * Add tmate session
      
      * Run onnx protobuf first
      
      * Generate proto for tensorflow
      
      * Update cppcheck version
      
      * Fix some cppcheck issues
      
      * Add const
      
      * Cppcheck fixes
      
      * Formatting
      
      * Fix more cppcheck issues
      
      * Run two jobs
      
      * Cache analysis and run format checking
      
      * Fix yaml issues
      
      * Fix yaml issues
      
      * Fix indentation
      
      * Switch to hip-clang for main docker file
      
      * Use hip-clang in the readme
      
      * Fixes for jenkins
      
      * Use ccache to build
      
      * Combine file
      
      * Set restore keys
      
      * Change stage name
      
      * Build with ccache
      
      * Add missing dependency for ccache
      
      * Build debug with codecov
      
      * Fix workflow syntax
      
      * Fix list
      
      * Use quotes
      
      * Got to correct build path
      
      * Install lcov
      
      * Use sudo
      
      * Echo all commands
      
      * Setup tmate
      
      * Add verbose output
      
      * Build with cmake directly
      
      * Add pthread flag
      
      * Remove python config
      
      * Continue on error
      
      * Use on or off for cmake flag
      
      * Use always upload cache
      
      * Verbose output
      
      * Verbose output from build
      
      * Build one target
      
      * Reduce debug symbols
      
      * Increase garbage collection
      
      * Remove dmesg
      
      * Increase it to 20
      
      * Update rocm cmake version
      
      * Remove jobs from jenkins
      
      * Run on all 3 ubuntus
      
      * Remove gcc 5 jobs
      
      * Dont add flag on 16.04
      
      * Only upload coverage on 18.04
      
      * Dont build for ubuntu 20.04
      
      * Use matrix.os
      
      * Use O2 for hip-clang since lower optimizations are broken
      
      * Use rocm 3.0
      
      * Pass ccache as cmake variable instead of env variable
      
      * Build miopen from source
      
      * Show ccache statistics
      
      * Print log information
      
      * Set compression level
      
      * Use hash dir
      
      * Set hashdir
      
      * Install clang ocl from system
      
      * Up compression level
      
      * Add locale
      
      * Increase cache size to 1G
      
      * Lower compression level to 9
      
      * Remove split dwarf
      
      * Remove Og
      
      * Add back Og
      
      * Seperate debug and codecov
      
      * Add missing backlash
      
      * Garbage collect more often
      
      * Add missing locales package
      
      * Use Os
      
      * Install onednn in docker and run tests
      
      * Include target headers in tests
      
      * Increase timeout
      
      * Remove if condtion
      
      * Make flag public
      
      * Suppress memory leaks in onednn
      
      * Use equal
      
      * Add gh annotations
      
      * Update rocm-cmake version
      
      * Add ldconfig
      Co-authored-by: default avatarShucai Xiao <shucai@gmail.com>
      ceb4ca09
  22. 14 Dec, 2020 1 commit
    • Paul Fultz II's avatar
      Use dnnl for cpu backend (#688) · 406afeb8
      Paul Fultz II authored
      
      
      * Add flag to enable cpu backend
      
      * Make buffers shared
      
      * Enable optimizations
      
      * Add onednn
      
      * Formatting
      
      * Formatting
      
      * Add dnnl header
      
      * Formatting
      
      * Rewrite rnn first
      
      * Formatting
      
      * Call reference implementation
      
      * Formatting
      
      * Make literal data shared
      
      * Formatting
      
      * Add convolution
      
      * Formatting
      
      * Compensate for dilation
      
      * Formatting
      
      * Use name/make_op instead
      
      * Formatting
      
      * Rename gemm header
      
      * Formatting
      
      * Add dnnl convolution/gemm operators
      
      * Formatting
      
      * Add eliminate_contiguous
      
      * Add faster pointwise operators
      
      * Formatting
      
      * Formatting
      
      * Formatting
      
      * Add dnnl op class
      
      * Formatting
      
      * Add add op
      
      * Formatting
      
      * Add concat operator
      
      * Formatting
      
      * Add more ops
      
      * Create descriptor during finalization
      
      * Formatting
      
      * Dont rewrite pooling
      
      * Enable memory coloring
      
      * Formatting
      
      * Add output aliases
      
      * Formatting
      
      * Fix errors
      
      * Formatting
      
      * Convert literals
      
      * Add missing file
      
      * Remove batch_norm
      
      * Formatting
      
      * Use strides
      
      * Formatting
      
      * Add some debug checks
      
      * Formatting
      
      * Fix big in adjusting shape for gemm
      
      * Formatting
      
      * Fix fallback dot operator
      
      * Zero initialize buffers
      
      * Add suport for group convolutions
      
      * Formatting
      
      * Make adjust allocation target independent
      
      * Formatting
      
      * Enable adjust_allocation for gpu/cpu
      
      * Formatting
      
      * Add copy to allocation model
      
      * Formatting
      
      * Add copy operator
      
      * Formatting
      
      * Better handling of output parameters in adjust_allocation
      
      * Formatting
      
      * Build with dnnl
      
      * Make dnnl required
      
      * Fix compile error
      
      * Tidy fixes
      
      * Formatting
      
      * Tidy fixes
      
      * Formatting
      
      * Fix more tidy issues
      
      * Formatting
      
      * Add mul op
      
      * Add mul op
      
      * Set c compiler to clang as well
      
      * Compensate for normalized compute shape
      
      * Formatting
      
      * Fix cppcheck errors
      
      * Formatting
      
      * Add onednn library to hcc
      
      * Guard clang pragmas
      
      * Disable cpu mode for gcc for now
      
      * Leave it enabled it for gcc 7
      
      * Fix cppcheck suppresion
      
      * Fix compile error on gcc 5
      
      * Remove unused code
      Co-authored-by: default avatarShucai Xiao <shucai.xiao@amd.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      406afeb8
  23. 25 Nov, 2020 1 commit
    • Paul Fultz II's avatar
      Fix issues with python 'ImportError' (#690) · 1c417ae7
      Paul Fultz II authored
      * Fix issue with module wrapper
      
      * Formatting
      
      * Remove custom build of python 3.6
      
      * Fix print function in tests
      
      * Formatting
      
      * Install onnx
      
      * Download the models
      
      * Add gcc 5
      
      * Boost time limit for now
      
      * Try to use oauth instead of app
      
      * Disable all warnings
      1c417ae7
  24. 18 Nov, 2020 2 commits
  25. 10 Nov, 2020 1 commit
  26. 22 Oct, 2020 1 commit
  27. 07 Oct, 2020 1 commit
  28. 03 Oct, 2020 2 commits
  29. 02 Oct, 2020 1 commit
  30. 01 Oct, 2020 1 commit
  31. 30 Sep, 2020 1 commit
    • Paul Fultz II's avatar
      Add hip clang builds to jenkins (#651) · f28a62ea
      Paul Fultz II authored
      * Make global variables const
      
      * Tidy fixes
      
      * Disable some lints
      
      * Formatting
      
      * Fix tidy const
      
      * Formatting
      
      * Add missing const keywords
      
      * Formatting
      
      * More fixes
      
      * Fix remaining tidy issues
      
      * Formatting
      
      * Fix rocblas function call
      
      * Formatting
      
      * Fix nodiscard warnings
      
      * Formatting
      
      * Use named parameters
      
      * Remove overload
      
      * Add overload
      
      * Remove noncps
      
      * Use named param for node
      
      * Add auto register header
      
      * Use named parameters
      
      * Refactor jenkinsfile
      
      * Fix shadow
      
      * Add missing body variable
      
      * Add more const methods
      
      * Add hip-clang docker builds
      
      * Remove comments
      
      * Add clang-format
      
      * Add more const
      
      * Formatting
      
      * Rename stage
      
      * Disable check
      
      * Add another const
      
      * Add python 2 dev packages
      
      * Add sphinx to dockerfile
      f28a62ea
  32. 09 Jul, 2020 1 commit
  33. 18 Jun, 2020 1 commit
    • Paul Fultz II's avatar
      Test with onnx runtime (#552) · 59e36b72
      Paul Fultz II authored
      * Build and test onnxrt
      
      * Add sudo command
      
      * Add sudo
      
      * Add pkgconfig
      
      * Make root user
      
      * Move unstash out
      
      * Remove noncps
      
      * Add NonCPS back
      
      * Remove all noncps
      
      * Use each method
      
      * Move unstash command
      
      * Unstash before
      
      * Move stash command up
      
      * Move unstash to noncps function
      
      * Remove noncps
      
      * Use a function to unstash
      
      * Remove call to unused function
      
      * Change order of args
      
      * Add another rocmtestnode overload
      
      * List files
      
      * Use capital R
      
      * Search in build directory
      
      * Use force
      
      * Use newer cmake with onnx
      
      * Install requirements
      
      * Print out pip list
      
      * Install pip3
      
      * Add cxxflags for hip
      
      * Generate locale
      
      * Install wheel with pip3
      
      * Disable pip installation
      
      * Disable build wheel
      59e36b72
  34. 16 Apr, 2020 1 commit