1. 16 Sep, 2023 1 commit
  2. 13 Sep, 2023 1 commit
  3. 12 Sep, 2023 1 commit
  4. 10 Aug, 2023 1 commit
  5. 09 Aug, 2023 1 commit
  6. 08 Aug, 2023 1 commit
  7. 30 Jul, 2023 1 commit
    • Paul Fultz II's avatar
      Enable tuning for MLIR (#1965) · be6ecff6
      Paul Fultz II authored
      * Add initial tuning support
      
      * Format
      
      * Add extra param
      
      * Format
      
      * Use exauhstive flag
      
      * Format
      
      * Set expected shapes
      
      * Format
      
      * Format
      
      * Fix missing symbol
      
      * Format
      
      * Add missing license header
      
      * Format
      
      * Update src/targets/gpu/include/migraphx/gpu/mlir.hpp
      be6ecff6
  8. 19 Jul, 2023 1 commit
  9. 18 Jul, 2023 1 commit
  10. 13 Jul, 2023 1 commit
    • Charlie Lin's avatar
      Update deconvolution -> convolution_backwards and Dynamic Shape Support (#1801) · 4edf1195
      Charlie Lin authored
      Renames deconvolution -> convolution_backwards to be more consistent with the literature
      Note: this is not the cross-correlation operator (which is the adjoint of convolution). This is technically a standard convolution operator combined with an upsampling operator rather than a downsampling operator.
      Adds unit tests for the padding, strides, dilations, and other op attributes.
      Throws on auto_pad attribute since it has not been implemented
      Previously it read the attribute and set it but then did nothing with it
      Extended for dynamic shapes
      Does not support using asymmetric padding (padding_L != padding_R) and output_shape with dynamic shapes.
      4edf1195
  11. 08 Jul, 2023 1 commit
    • Artur Wojcik's avatar
      export API symbols from dynamic libraries (#1892) · c04fbc92
      Artur Wojcik authored
      Export API symbols for migraphx, migraphx_ref, migraphx_cpu, migrphx_gpu, migraphx_device, migraphx_tf, and migraphx_onnx. There is a separate PR for migrahx_c.
      
      API symbol exporting affects only Windows. It is transparent on Linux.
      c04fbc92
  12. 05 Jul, 2023 1 commit
  13. 02 Jul, 2023 1 commit
    • Paul Fultz II's avatar
      Improvement to ck integration (#1859) · 3c9df3b4
      Paul Fultz II authored
      Add a CI job to test CK
      Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK
      Continue tuning even when there is invalid configs
      Fix a bug with parallel compilation not using all available threads
      Add additional test for gemms using half types
      Removed int32 as supported type since it doesnt pass our test suite
      3c9df3b4
  14. 08 Jun, 2023 1 commit
  15. 24 May, 2023 1 commit
  16. 17 May, 2023 1 commit
  17. 04 May, 2023 1 commit
    • Zhuoran Yin's avatar
      [mlir] Adding quant convolution fusion as anchor op (#1683) · 7f105952
      Zhuoran Yin authored
      Exposed the mlir_enabled() call the decide for lowering pipeline's enablement
      Disabled the rewrite quantization pipeline in mlir compilation
      Added quant convolution as anchor ops
      Fixed the return type expectations
      Added the fall back hip implementation for quantizelinear and dequantizelinear
      Will need advises to improve the implementation for quantizelinear
      7f105952
  18. 24 Apr, 2023 1 commit
    • Charlie Lin's avatar
      Dynamic shape hip::copy_to_gpu and hip::copy_from_gpu (#1694) · 84acaea0
      Charlie Lin authored
      Updates the hip::copy_to_gpu and hip::copy_from_gpu operators to work with dynamic shapes
      
      Allows for offload_copy to be used with dynamic batch
      
      Changed assert in select_module because the argument might now be smaller with how offload_copy will work with dynamic batch. (maximum buffer size will be used)
      84acaea0
  19. 06 Apr, 2023 1 commit
  20. 05 Apr, 2023 1 commit
  21. 30 Mar, 2023 1 commit
  22. 28 Mar, 2023 1 commit
  23. 21 Mar, 2023 1 commit
  24. 18 Mar, 2023 1 commit
  25. 01 Mar, 2023 1 commit
  26. 16 Feb, 2023 1 commit
  27. 14 Feb, 2023 1 commit
  28. 06 Feb, 2023 1 commit
  29. 07 Dec, 2022 1 commit
  30. 07 Nov, 2022 1 commit
  31. 02 Nov, 2022 1 commit
  32. 19 Oct, 2022 2 commits
  33. 18 Oct, 2022 1 commit
  34. 13 Oct, 2022 1 commit
  35. 04 Oct, 2022 1 commit
  36. 29 Sep, 2022 1 commit
  37. 28 Sep, 2022 1 commit
    • Umang Yadav's avatar
      Add compute_fp32 flag for quant_gemm tests (#1360) · 70e63960
      Umang Yadav authored
      test_gpu_pack_int8_args fails on gfx908 machine, because it doesn't set compute_fp32 flag correctly. This PR fixes the test such that it checks for the device-name, and rocblas-versions and sets this flag accordingly.
      70e63960
  38. 26 Sep, 2022 1 commit
  39. 23 Sep, 2022 1 commit