"doc/vscode:/vscode.git/clone" did not exist on "ea75849d03f904102c76deddc649507c0ce02c14"
  1. 24 Oct, 2023 1 commit
  2. 16 Oct, 2023 1 commit
    • Paul Fultz II's avatar
      Enable MLIR by default for more cases (#2274) · 650ba45f
      Paul Fultz II authored
      This will enable MLIR by default for these cases:
      
      Any convolution fusion
      Any int8 gemm fusion
      All Navi3 standalone convolutions
      With a flag(ie MIGRAPHX_ENABLE_MLIR) to enable MLIR for floating-point gemm fusions
      Except:
      
      3x3 winnograd convolutions fusions (except on Navi)
      K > 2048 on gemm (as CK)
      Also there is MIGRAPHX_DISABLE_MLIR to disable MLIR completely.
      650ba45f
  3. 12 Oct, 2023 1 commit
  4. 28 Sep, 2023 1 commit
  5. 27 Sep, 2023 1 commit
    • Krzysztof Drewniak's avatar
      [mlir] Apply is_mlir_conv predicate in standalone MLIr offloading (#2249) · a761ffaa
      Krzysztof Drewniak authored
      Currently, the is_mlir_conv predicate wasn't being used when
      offloading standalone convolutions to MLIR on Navi3x, which caused
      failures relating to being unable to construct the MLIR program when a
      3D convlolution was passed in.
      
      This commit amends the standalone lowering to use said predicate, as
      well as to include quant_convolution and quant_dot into the set of
      operations that get a standalone lowering.
      a761ffaa
  6. 16 Sep, 2023 1 commit
  7. 15 Sep, 2023 1 commit
  8. 13 Sep, 2023 1 commit
  9. 11 Sep, 2023 1 commit
  10. 11 Aug, 2023 1 commit
  11. 08 Aug, 2023 1 commit
  12. 22 Jul, 2023 1 commit
  13. 29 Jun, 2023 1 commit
  14. 22 Jun, 2023 1 commit
  15. 05 May, 2023 1 commit
  16. 04 May, 2023 1 commit
    • Zhuoran Yin's avatar
      [mlir] Adding quant convolution fusion as anchor op (#1683) · 7f105952
      Zhuoran Yin authored
      Exposed the mlir_enabled() call the decide for lowering pipeline's enablement
      Disabled the rewrite quantization pipeline in mlir compilation
      Added quant convolution as anchor ops
      Fixed the return type expectations
      Added the fall back hip implementation for quantizelinear and dequantizelinear
      Will need advises to improve the implementation for quantizelinear
      7f105952
  17. 27 Mar, 2023 1 commit
  18. 13 Mar, 2023 1 commit
  19. 18 Oct, 2022 1 commit
  20. 19 Sep, 2022 1 commit
    • Paul Fultz II's avatar
      Improve layernorm and reductions performance (#1348) · 97a1ed2d
      Paul Fultz II authored
      Compute mean and variance in same reduction
      Set block size to numbers divisible by 32 instead powers of 2
      Global is also set exactly instead of being divisible by block size
      More exact matching of global/local can help get rid of branching/loops
      Reduce vectors first before doing dpp_reduce
      Explicitly vectorize array operators since the compiler doesnt always vectorize them
      Still uses old for loop when its computing at compile-time since the reinterpret_cast nor the all the vector types is supported
      97a1ed2d
  21. 03 Jul, 2022 1 commit
    • Paul Fultz II's avatar
      Add mlir fusion (#1251) · ca8a54fe
      Paul Fultz II authored
      * Add mlir c api
      
      * Formatting
      
      * Create a type attribute
      
      * Formatting
      
      * Parse module
      
      * Formatting
      
      * Add mlir dump function
      
      * Add test case
      
      * Formatting
      
      * Fix tidy issues
      
      * Update mlit version
      
      * Update to newer mlir
      
      * Format
      
      * Move mlir to the gpu and update the test
      
      * Formatting
      
      * Fix bug when appending module
      
      * Format
      
      * Remove old cmake flag
      
      * Update message
      
      * Add return
      
      * Format
      
      * Add mlir_compile
      
      * Format
      
      * Register dialect
      
      * Handle unsinged integers
      
      * Dont provide output for return instruction
      
      * Format
      
      * Add code to insert memrefs
      
      * Format
      
      * Add mlir verification
      
      * Formatting
      
      * Enable pointwise_fusion
      
      * Disable eliminate_data_type
      
      * Set kernal name
      
      * Format
      
      * Fix device name
      
      * Formatting
      
      * Fix output arg
      
      * Format
      
      * Updates
      
      * Upate hash
      
      * Add fuse_mlir pass
      
      * Format
      
      * Add fuse mlir
      
      * Format
      
      * Update mlir
      
      * Sort parameter names
      
      * Format
      
      * Reenable disabled passes
      
      * Remove old mlir conv
      
      * Remove asym default padding
      
      * Add more verbose tracing
      
      * Format
      
      * Fix compilation errors
      
      * Format
      
      * Whitelist operators
      
      * Format
      
      * Add namespace
      
      * Format
      
      * Update triple
      
      * Format
      
      * Use func dialect
      
      * Format
      
      * Use func.return
      
      * Format
      
      * Upgrade mlir version
      
      * Add comment
      
      * Handle symetrical padding
      
      * Format
      
      * Cleanup debug output
      
      * Format
      
      * List failed tests
      
      * Move mlir compile to jit pipeline
      
      * Format
      
      * Update version
      
      * Add source locations
      
      * Format
      
      * Correctly add module
      
      * Format
      
      * Update failed tests
      
      * Fix failures when mlir is disabled
      
      * Format
      
      * Update mlir version
      
      * Check type for fp32
      
      * Format
      
      * Remove failed test
      
      * Update mlir in driver
      
      * Tidy fixes
      
      * Foramt
      
      * Tidy fixes
      
      * Format
      
      * Fix const
      
      * Remove from requirements
      
      * Fix cmake version
      
      * Fix tidy warning
      
      * Use another ifdef
      
      * Fix tidy
      
      * Other tidy fix
      
      * Format
      
      * Update hash
      
      * Add missing license files
      
      * Format
      
      * Format
      
      * Fix fnction name
      ca8a54fe