1. 30 Nov, 2023 1 commit
  2. 28 Nov, 2023 1 commit
  3. 17 Nov, 2023 1 commit
  4. 08 Nov, 2023 2 commits
  5. 30 Oct, 2023 2 commits
  6. 16 Oct, 2023 1 commit
    • Paul Fultz II's avatar
      Enable MLIR by default for more cases (#2274) · 650ba45f
      Paul Fultz II authored
      This will enable MLIR by default for these cases:
      
      Any convolution fusion
      Any int8 gemm fusion
      All Navi3 standalone convolutions
      With a flag(ie MIGRAPHX_ENABLE_MLIR) to enable MLIR for floating-point gemm fusions
      Except:
      
      3x3 winnograd convolutions fusions (except on Navi)
      K > 2048 on gemm (as CK)
      Also there is MIGRAPHX_DISABLE_MLIR to disable MLIR completely.
      650ba45f
  7. 06 Oct, 2023 1 commit
  8. 28 Sep, 2023 1 commit
    • Umang Yadav's avatar
      Add options to set tolerances inside MIGraphX driver (#2213) · 69d8d789
      Umang Yadav authored
      MIGraphX verification by default uses normalized RMS error as the basis for the verification.  This change adds some logic to allow migraphx to do "np.allclose" type of elementwise verification using atol and rtol.
      
      Commit also includes changes to consistently pass "gold" or "expected" results as the second argument for "verify_range()" calls.  Default RMS tolerance inside driver is set to 0.001 which IMO is high for FP32 compared to what we had earlier. Need better defaults
      69d8d789
  9. 27 Sep, 2023 1 commit
  10. 13 Sep, 2023 1 commit
  11. 18 Aug, 2023 1 commit
  12. 10 Aug, 2023 1 commit
  13. 06 Aug, 2023 1 commit
  14. 30 Jul, 2023 1 commit
    • Paul Fultz II's avatar
      Enable tuning for MLIR (#1965) · be6ecff6
      Paul Fultz II authored
      * Add initial tuning support
      
      * Format
      
      * Add extra param
      
      * Format
      
      * Use exauhstive flag
      
      * Format
      
      * Set expected shapes
      
      * Format
      
      * Format
      
      * Fix missing symbol
      
      * Format
      
      * Add missing license header
      
      * Format
      
      * Update src/targets/gpu/include/migraphx/gpu/mlir.hpp
      be6ecff6
  15. 16 Jul, 2023 1 commit
  16. 05 Jul, 2023 1 commit
  17. 29 Jun, 2023 1 commit
  18. 22 Jun, 2023 1 commit
  19. 19 May, 2023 1 commit
  20. 17 May, 2023 1 commit
  21. 04 May, 2023 1 commit
    • Zhuoran Yin's avatar
      [mlir] Adding quant convolution fusion as anchor op (#1683) · 7f105952
      Zhuoran Yin authored
      Exposed the mlir_enabled() call the decide for lowering pipeline's enablement
      Disabled the rewrite quantization pipeline in mlir compilation
      Added quant convolution as anchor ops
      Fixed the return type expectations
      Added the fall back hip implementation for quantizelinear and dequantizelinear
      Will need advises to improve the implementation for quantizelinear
      7f105952
  22. 13 Apr, 2023 1 commit
  23. 06 Apr, 2023 1 commit
    • Charlie Lin's avatar
      Driver dynamic batch update (#1652) · adccec52
      Charlie Lin authored
      Examples..
      
      bin/driver verify /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --batch 3 --dyn-input-dim @data "[{min:1, max:4}, 3, 224, 224]"
      
      bin/driver compile /codes/onnx_models/resnet50-v1-7/resnet50-v1-7.onnx --split-single-dyn-dim --default-dyn-dim "{min:1, max:10}" --output resnet50_batch1-10.mxr
      
      bin/driver perf resnet50_batch1-10.mxr --batch 4
      adccec52
  24. 27 Mar, 2023 1 commit
  25. 18 Mar, 2023 1 commit
  26. 31 Jan, 2023 1 commit
    • Umang Yadav's avatar
      hipRTC fixes (#1531) · 91cc7242
      Umang Yadav authored
      Added CMakeFlag for hipRTC. MIGRAPHX_USE_HIPRTC.
      Added stages in Jenkins for hipRTC.
      Fixes for some of the pending issues from hipRTC.
      91cc7242
  27. 06 Dec, 2022 2 commits
  28. 27 Oct, 2022 1 commit
  29. 18 Oct, 2022 1 commit
  30. 13 Oct, 2022 1 commit
    • Charlie Lin's avatar
      Refactor dynamic padding mode (#1387) · 32f6388c
      Charlie Lin authored
      Removes use_dynamic_same_auto_pad
      Change padding_mode to be used for dynamic padding
      Move compute_padded_shape to pad_calc.cpp as it will be used in other dynamic padding cases
      Fix same_lower compute_padded_shape bug and add a test.
      32f6388c
  31. 04 Oct, 2022 1 commit
  32. 29 Sep, 2022 1 commit
  33. 28 Sep, 2022 1 commit
    • Umang Yadav's avatar
      Add compute_fp32 flag for quant_gemm tests (#1360) · 70e63960
      Umang Yadav authored
      test_gpu_pack_int8_args fails on gfx908 machine, because it doesn't set compute_fp32 flag correctly. This PR fixes the test such that it checks for the device-name, and rocblas-versions and sets this flag accordingly.
      70e63960
  34. 27 Sep, 2022 1 commit
  35. 23 Sep, 2022 1 commit
  36. 16 Sep, 2022 1 commit
  37. 15 Sep, 2022 1 commit