1. 30 Jul, 2023 1 commit
    • Paul Fultz II's avatar
      Enable tuning for MLIR (#1965) · be6ecff6
      Paul Fultz II authored
      * Add initial tuning support
      
      * Format
      
      * Add extra param
      
      * Format
      
      * Use exauhstive flag
      
      * Format
      
      * Set expected shapes
      
      * Format
      
      * Format
      
      * Fix missing symbol
      
      * Format
      
      * Add missing license header
      
      * Format
      
      * Update src/targets/gpu/include/migraphx/gpu/mlir.hpp
      be6ecff6
  2. 29 Jul, 2023 1 commit
  3. 28 Jul, 2023 3 commits
  4. 27 Jul, 2023 1 commit
  5. 26 Jul, 2023 1 commit
  6. 25 Jul, 2023 3 commits
  7. 23 Jul, 2023 1 commit
  8. 22 Jul, 2023 3 commits
  9. 21 Jul, 2023 3 commits
    • Umang Yadav's avatar
      Add back clamping and add tests (#1969) · 6957243c
      Umang Yadav authored
      Fixes #1957
      
      Clamping was removed in #1853.
      
      Turns out clamping as necessary to handle overflow/underflow cases. during downcasting, if it overflowed then without clamping it returned infinity.
      6957243c
    • Umang Yadav's avatar
      Use `optimize_module` pass for the quantization to fp16 (#1974) · 6f1f4b59
      Umang Yadav authored
      Fixes #1746
      
      BatchNorm only has x as the runtime input parameter for the following equation. All the other parameters are compile-time constants and related operations can be const-folded before quantizing to fp16 to preserve precision.
      6f1f4b59
    • Umang Yadav's avatar
      Make global workitems multiple of local workitems (#1976) · 3216fe52
      Umang Yadav authored
      HIP requires global work items in multiple of local work items. If it is not it is not guaranteed to generate correct results all the time.
      Fixes #1977
      Fixes #1644
      MIGraphX CI has moved to rocm-5.6 which doesn't require hipRTC workarounds
      3216fe52
  10. 19 Jul, 2023 3 commits
  11. 18 Jul, 2023 1 commit
  12. 17 Jul, 2023 3 commits
  13. 16 Jul, 2023 1 commit
  14. 13 Jul, 2023 2 commits
    • Krzysztof Drewniak's avatar
      [NFC] Update MLIR usage to account for upstream merge (#1924) · c4765a6d
      Krzysztof Drewniak authored
      Allows the rocMLIR CI (which builds rocMLIR tip against MIGraphX tip) to pass.
      c4765a6d
    • Charlie Lin's avatar
      Update deconvolution -> convolution_backwards and Dynamic Shape Support (#1801) · 4edf1195
      Charlie Lin authored
      Renames deconvolution -> convolution_backwards to be more consistent with the literature
      Note: this is not the cross-correlation operator (which is the adjoint of convolution). This is technically a standard convolution operator combined with an upsampling operator rather than a downsampling operator.
      Adds unit tests for the padding, strides, dilations, and other op attributes.
      Throws on auto_pad attribute since it has not been implemented
      Previously it read the attribute and set it but then did nothing with it
      Extended for dynamic shapes
      Does not support using asymmetric padding (padding_L != padding_R) and output_shape with dynamic shapes.
      4edf1195
  15. 11 Jul, 2023 1 commit
  16. 10 Jul, 2023 2 commits
  17. 09 Jul, 2023 1 commit
  18. 08 Jul, 2023 2 commits
  19. 06 Jul, 2023 3 commits
    • Artur Wojcik's avatar
    • Paul Fultz II's avatar
      Use MIGRAPHX_GLOBAL (#1918) · c45b34c3
      Paul Fultz II authored
      This will also annotate the function with the block size so the compiler can do a better job of optimizing.
      c45b34c3
    • Paul Fultz II's avatar
      Enable eval to handle multiple contexts (#1751) · 072fd5cc
      Paul Fultz II authored
      This is to help enable multi-target execution. We store a vector of targets and contexts. Currently this will only compile a single target, the PR #1672 is needed to enable multiple targets.
      
      This will also serialize the targets and contexts.
      
      When using the execution_environment or prog.get_context() it will always use the context from the first target assuming this is the "primary" target. Although, its unlikely a user would use execution_environment with a multi-target environment.
      072fd5cc
  20. 05 Jul, 2023 2 commits
  21. 02 Jul, 2023 2 commits
    • Charlie Lin's avatar
      Dynamic shape ref `clip` operator (#1862) · 3f566882
      Charlie Lin authored
      Updates ref version of clip to work with dynamic shapes
      Encountered in agentmodel
      3f566882
    • Paul Fultz II's avatar
      Improvement to ck integration (#1859) · 3c9df3b4
      Paul Fultz II authored
      Add a CI job to test CK
      Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK
      Continue tuning even when there is invalid configs
      Fix a bug with parallel compilation not using all available threads
      Add additional test for gemms using half types
      Removed int32 as supported type since it doesnt pass our test suite
      3c9df3b4