1. 02 Jul, 2023 1 commit
    • Paul Fultz II's avatar
      Improvement to ck integration (#1859) · 3c9df3b4
      Paul Fultz II authored
      Add a CI job to test CK
      Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK
      Continue tuning even when there is invalid configs
      Fix a bug with parallel compilation not using all available threads
      Add additional test for gemms using half types
      Removed int32 as supported type since it doesnt pass our test suite
      3c9df3b4
  2. 08 Jun, 2023 1 commit
  3. 24 May, 2023 1 commit
  4. 02 Nov, 2022 1 commit
  5. 22 Jun, 2022 1 commit
  6. 29 Mar, 2022 1 commit
    • Paul Fultz II's avatar
      Refactor runtime compiled kernels to use the same compile_ops pipeline (#1125) · 661046c6
      Paul Fultz II authored
      This adds the infrastructure so we can compile everything in parallel, whereas before only pointwise kernels were compiled in parallel. This will also directly integrate with lowering and the gpu-driver. The kernels for pointwise and roialign are using this infrastructure. Scatternd is not since it does require standard shape.
      
      This also makes it easier to add new runtime compiled kernels in the future.
      661046c6
  7. 08 Feb, 2022 1 commit
  8. 18 Nov, 2021 1 commit
  9. 11 Nov, 2021 1 commit
    • Paul Fultz II's avatar
      Conditionally enable pointwise fusion (#992) · 157935ff
      Paul Fultz II authored
      This enables the pointwise fusions using the MIGRAPHX_ENABLE_POINTWISE_FUSION env variable. Its disabled by default since MIOpen fusions need to be refactored.
      
      This also adds a compile_ops pass to compile the pointwise modules. All tests except test_gpu_fast_math passes with MIGRAPHX_ENABLE_POINTWISE_FUSION=1 set.
      157935ff