1. 30 May, 2023 1 commit
  2. 26 May, 2023 2 commits
  3. 29 Sep, 2022 1 commit
  4. 22 Jun, 2022 1 commit
  5. 08 Apr, 2022 1 commit
  6. 25 Mar, 2022 1 commit
  7. 02 Mar, 2022 1 commit
  8. 05 Aug, 2021 1 commit
    • Paul Fultz II's avatar
      Add gpu driver and improvements to pointwise codegen (#851) · 29fa2666
      Paul Fultz II authored
      
      
      * Add method to compile pointwise
      
      * Formatting
      
      * Add lambda
      
      * Add semicolon
      
      * Rename variable
      
      * Add driver to run jit kernels
      
      * Formatting
      
      * Add context
      
      * Formatting
      
      * Make seperate driver folder
      
      * Add more general gpu driver
      
      * Formatting
      
      * Print out wll time
      
      * Formatting
      
      * Run multiple times and skip first run
      
      * Formatting
      
      * Seperate time_op
      
      * Run an op for comparison
      
      * Formatting
      
      * Add debug asserts
      
      * Formatting
      
      * Change parameer name
      
      * Formatting
      
      * Fix argument order
      
      * Formatting
      
      * Add preloading
      
      * Formatting
      
      * Allow a different data type
      
      * Formatting
      
      * Pipeline transformations
      
      * Formatting
      
      * Add vectorization
      
      * Formatting
      
      * Reduce dims
      
      * Formatting
      
      * Compile with launch params as constant
      
      * Formatting
      
      * Make sure buffer can be vecotrized
      
      * Formatting
      
      * Enable vectorization and preloading
      
      * Formatting
      
      * Add print header
      
      * Formatting
      
      * Avoid allocating to large of LDS
      
      * Formatting
      
      * Add some vec functions to a seperate header
      
      * Formatting
      
      * Add stride loops
      
      * Formatting
      
      * Improve the transform pipeline
      
      * Formatting
      
      * Add const
      
      * Fix shape check
      
      * Formatting
      
      * Just check stride axis is zero
      
      * Remove extra finc_vector_axis overload
      
      * Simplify some mroe functions
      
      * Formatting
      
      * Remove some more extra functions
      
      * Formatting
      
      * Simplify more decltypes
      
      * Add another const
      
      * Fix test
      
      * Get buffer pointer different for older compilers
      Co-authored-by: default avatarShucai Xiao <shucai@gmail.com>
      Co-authored-by: default avatarChris Austen <causten@users.noreply.github.com>
      29fa2666
  9. 29 Apr, 2021 1 commit
    • SJW's avatar
      MLIR MIOpen Dialect integration (phase 1) (#768) (#769) · 56584fa2
      SJW authored
      
      
      * MLIR MIOpen Dialect integration (phase 1) (#768)
      
      * Added Findmlir.cmake (using environment variables to import)
      
      * Added mlir_conv pass to GPU target
      
        * Apply to any gpu::convolution if supported by MLIR
      
        * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution
      
        * Capture binary in dictionary for matching convolutions
      
        * Build a code_object_op with the binary and execution dimensions
      
        * Substitute for the gpu::convolution
      
      * Changed the parameters for the code_object to reflect the generated MLIR kernel
      
      * Expanded out MemRefDescriptor fields in param list
      
      * Also updated for MLIR C-API changes
      
      * * fixed global_size calculation
      
      * MLIR MIOpen Dialect integration (phase 1) (#768)
      
      * Added Findmlir.cmake (using environment variables to import)
      
      * Added mlir_conv pass to GPU target
      
        * Apply to any gpu::convolution if supported by MLIR
      
        * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution
      
        * Capture binary in dictionary for matching convolutions
      
        * Build a code_object_op with the binary and execution dimensions
      
        * Substitute for the gpu::convolution
      
      * Changed the parameters for the code_object to reflect the generated MLIR kernel
      
      * Expanded out MemRefDescriptor fields in param list
      
      * Also updated for MLIR C-API changes
      
      * * Added command line option: --enable_mlir
      
      * * fixed command line switch
      
      * updated for new MLIR API changes
      
      * * Added cget llvm-project-mlir to import MIIR API libraries into Dockerfile
        * removed cmake Findmlir
      
      * updated for changes in MIIR C-API
      
      * * updated CMakeLists.txt to allow disable of MLIR import
      
      * fixed memory leaks and removed copies
      
      * updated for 5D memrefs
      
      * * formatting
      
      * * fixed review comments
      
      * * fixed merge issues
      
      * hip gcnDeviceName now includes specifiers at the end
        * use major/minor values instead
      
      * * disable MLIR by default
      
      * * removed command-line switch --enable-mlir
      
      * * fix unused when MLIR disabled
      
      * * enable jenkins enable/test MLIR
      
      * * format
      
      * * fixed clang-tidy
      
      * * added new type
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      56584fa2
  10. 22 Apr, 2021 1 commit
    • Paul Fultz II's avatar
      Cpu fusions using post_ops (#781) · f7befe50
      Paul Fultz II authored
      
      
      * Add eliminate_data_type pass
      
      * Formatting
      
      * Auto convert quant ops
      
      * Formatting
      
      * Flip the order of decompose
      
      * Compute max size differently
      
      * Formatting
      
      * Clamp values in convert
      
      * Formatting
      
      * Fix loss of precision in reduce
      
      * Formatting
      
      * Fix bugs in reduction
      
      * Fix accumulator type in reference softmax implementation
      
      * Formatting
      
      * Update convert test
      
      * Remove unused variables
      
      * Remove unnecessary quant_dot check
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * Remove unused code
      
      * Remove duplicate ops
      
      * Remove blaze dependency
      
      * Use set since shape::type_t is no hashable on gcc 5
      
      * Formatting
      
      * Add dnnl binary op
      
      * Formatting
      
      * Add binary and eltwise
      
      * Formatting
      
      * Add softmax
      
      * Formatting
      
      * Remove unused operators
      
      * Add missing files
      
      * Formatting
      
      * Add lrn
      
      * Formatting
      
      * Add deconvolution
      
      * Formatting
      
      * Change allocate default
      
      * Add reorder
      
      * Formatting
      
      * Add reductions
      
      * Formatting
      
      * Sort lines
      
      * Change literals in another loop
      
      * Add pow operator
      
      * Formatting
      
      * Add pow operator
      
      * Formatting
      
      * Make sure shapes are packed
      
      * Allow broadcasted inputs
      
      * Remove unused operators
      
      * Simplify functions
      
      * Remove softmax
      
      * Add sub and erf functions
      
      * Formatting
      
      * Fix bug
      
      * Formatting
      
      * Improve parallism
      
      * Formatting
      
      * Allow multiple batch dimensions
      
      * Formatting
      
      * Move literal transforms out of lowering
      
      * Formatting
      
      * Add gather operator
      
      * Sort lines
      
      * Add early exit for carry
      
      * Formatting
      
      * Add missing concat
      
      * Rename macro
      
      * Fix deep nesting
      
      * Formatting
      
      * Fix cppcheck issues
      
      * Remov else
      
      * Move attribute to typedef
      
      * Formatting
      
      * Disable maybe-uninitialized warning since its broken on gcc
      
      * Add constexpr default constructor
      
      * Formatting
      
      * Fix compiler warnings
      
      * Fix adjust_allocation test
      
      * Add layernorm matcher
      
      * Add gelu_erf matcher
      
      * Formatting
      
      * Add gelu_tanh matcher
      
      * Formatting
      
      * Remove match namespace
      
      * Formatting
      
      * Use matcher instead of string
      
      * Formatting
      
      * Add fusions
      
      * Formatting
      
      * Add post op field
      
      * Formatting
      
      * Make post_ops serializable
      
      * Formatting
      
      * Add eltwise fusions
      
      * Formatting
      
      * Fix null conversions
      
      * Formatting
      
      * Add fuse_ops source files
      
      * Formatting
      
      * Set binary post op index correctly
      
      * Formatting
      
      * Fix serialization bugs
      
      * Check if used once
      
      * Formatting
      
      * Fix error in get_primitive_attr
      
      * Formatting
      
      * Add compile function
      
      * Formatting
      
      * Limit fusions
      
      * Formatting
      
      * Disable with env variable instead of using compile arg
      
      * Formatting
      
      * Fix implicit conversion to bool
      
      * Declar on seperate lines
      
      * Formatting
      
      * Fix cppcheck issues
      
      * Fix ICE in pack_join
      
      * Formatting
      
      * Use const ref
      
      * Make enum hashable
      
      * Formatting
      
      * Add explicit this
      
      * Fix merge issues
      
      * Fix dangling ref
      
      * Formatting
      
      * Add test for compile
      
      * Formatting
      
      * Add more value tests
      
      * Formatting
      Co-authored-by: default avatarShucai Xiao <shucai@gmail.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      f7befe50
  11. 28 Jan, 2021 1 commit
  12. 16 Nov, 2020 1 commit
    • Shucai Xiao's avatar
      Normalize ops (#667) · 8443ecd1
      Shucai Xiao authored
      
      
      * add a pass to normalize ops
      
      * clang format
      
      * add unit tests
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * add support for slice in the normalize_op function
      
      * clang format
      
      * add operation method api for whether we need to call normalize_op
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * rename a function namejJ
      
      * clang format
      
      * change compute_shape to normalize_compute_shape for corresponding operators
      
      * clang format
      
      * remove unnecessary code
      
      * fix various issues
      
      * clang format
      
      * add attributes to operators having axis attributes
      
      * clang format
      
      * fixed jenkins build error
      
      * clang format
      
      * fix a bug related to slice
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * rename a file
      
      * fix cppcheck error
      
      * some code refinement
      
      * clang format
      
      * change attributes to enum
      
      * clang format
      
      * refine the enum
      
      * clang format
      
      * remove unnecessary code
      
      * add unit tests for more code coverage and fixed a bug
      
      * clang format
      
      * remove unnecessary changes
      
      * change normalize_axes to normalize
      
      * clang format
      
      * revert back the changes in broadcast.hpp
      
      * rename normalize_axes to normalize
      
      * fix review comments
      
      * clang format
      
      * Add flag to enable cpu backend
      
      * Make buffers shared
      
      * Enable optimizations
      
      * Formatting
      
      * Try to avoid ambiguous assign in value class
      
      * fixed a build error
      
      * clang format
      
      * add the normalize_ops pass to the ref target
      
      * refactor program to module to normalize_ops pass
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      8443ecd1
  13. 11 Sep, 2020 1 commit
  14. 26 Aug, 2020 1 commit
    • Paul Fultz II's avatar
      Add C++ ability to construct operators by name (#616) · 002eb4e2
      Paul Fultz II authored
      
      
      * Add make_op function
      
      * Formatting
      
      * Add more values
      
      * Formatting
      
      * Remove templates parse_conv functions
      
      * Formatting
      
      * Remove mat_mul template
      
      * Formatting
      
      * Reduce header includes
      
      * Fix compiling for gpu
      
      * Formatting
      
      * Use make_op in lowering
      
      * Formatting
      
      * Sort lines
      
      * Formatting
      
      * Add more tests
      
      * Formatting
      
      * Fix tidy error
      
      * Formatting
      
      * Add const refs
      
      * Add explicit this
      
      * Add more const refs
      
      * Sort the program
      
      * Remove commented out code
      
      * Formatting
      
      * Infer gpu prefix
      
      * Formatting
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      002eb4e2
  15. 13 Aug, 2020 1 commit
  16. 10 Aug, 2020 1 commit
    • Shucai Xiao's avatar
      Convert value to json (#592) · a5648d9c
      Shucai Xiao authored
      
      
      * Add initial serialization
      
      * Formatting
      
      * Add unit tests
      
      * Formatting
      
      * Add tests for serialization
      
      * Formatting
      
      * Use or not and
      
      * Add value test
      
      * Formatting
      
      * Add more tests
      
      * Add shape serialization
      
      * Formatting
      
      * Add serializtion for literal and argument
      
      * Formatting
      
      * Serialize empty types
      
      * Formatting
      
      * Tidy fixes
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Reformat value type macro
      
      * Formatting
      
      * Handle enum types
      
      * Formatting
      
      * Add float_equal
      
      * Fix tidy issue
      
      * Use declval for better sfinae
      
      * Formatting
      
      * backup code change
      
      * Fix maro name
      
      * Add more test cases to improve coverage
      
      * Formatting
      
      * Add more tests
      
      * Formatting
      
      * Fix assertion
      
      * code backup
      
      * clang format
      
      * Fix bug with keyless assignment
      
      * Formatting
      
      * add implementation of converting json to migraphx::value
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * code refactor
      
      * add unit tests
      
      * separate json conversion from serialization
      
      * clang format
      
      * add the missing files
      
      * refine code
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * code backup
      
      * fix cppcheck error
      
      * refine an implementation
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments and add more unit tests
      
      * clang format
      
      * code backup to try
      
      * fix review comments and add more unit tests
      
      * clang format
      
      * some refinement
      
      * clang format
      
      * fix review comments
      
      * remove unnecessary code
      
      * add assignment of null
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      a5648d9c
  17. 29 Jul, 2020 1 commit
    • Paul Fultz II's avatar
      Add serialization framework (#577) · 9c3c3c5f
      Paul Fultz II authored
      
      
      * Add initial serialization
      
      * Formatting
      
      * Add unit tests
      
      * Formatting
      
      * Add tests for serialization
      
      * Formatting
      
      * Use or not and
      
      * Add value test
      
      * Formatting
      
      * Add more tests
      
      * Add shape serialization
      
      * Formatting
      
      * Add serializtion for literal and argument
      
      * Formatting
      
      * Serialize empty types
      
      * Formatting
      
      * Tidy fixes
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Reformat value type macro
      
      * Formatting
      
      * Handle enum types
      
      * Formatting
      
      * Add float_equal
      
      * Fix tidy issue
      
      * Use declval for better sfinae
      
      * Formatting
      
      * Fix maro name
      
      * Add more test cases to improve coverage
      
      * Formatting
      
      * Add more tests
      
      * Formatting
      
      * Fix assertion
      
      * Fix bug with keyless assignment
      
      * Formatting
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      9c3c3c5f