1. 01 Aug, 2023 1 commit
  2. 25 Jul, 2023 3 commits
  3. 23 Jul, 2023 1 commit
  4. 22 Jul, 2023 3 commits
  5. 21 Jul, 2023 3 commits
    • Umang Yadav's avatar
      Add back clamping and add tests (#1969) · 6957243c
      Umang Yadav authored
      Fixes #1957
      
      Clamping was removed in #1853.
      
      Turns out clamping as necessary to handle overflow/underflow cases. during downcasting, if it overflowed then without clamping it returned infinity.
      6957243c
    • Umang Yadav's avatar
      Use `optimize_module` pass for the quantization to fp16 (#1974) · 6f1f4b59
      Umang Yadav authored
      Fixes #1746
      
      BatchNorm only has x as the runtime input parameter for the following equation. All the other parameters are compile-time constants and related operations can be const-folded before quantizing to fp16 to preserve precision.
      6f1f4b59
    • Umang Yadav's avatar
      Make global workitems multiple of local workitems (#1976) · 3216fe52
      Umang Yadav authored
      HIP requires global work items in multiple of local work items. If it is not it is not guaranteed to generate correct results all the time.
      Fixes #1977
      Fixes #1644
      MIGraphX CI has moved to rocm-5.6 which doesn't require hipRTC workarounds
      3216fe52
  6. 19 Jul, 2023 3 commits
  7. 18 Jul, 2023 2 commits
  8. 17 Jul, 2023 3 commits
  9. 16 Jul, 2023 1 commit
  10. 13 Jul, 2023 4 commits
  11. 11 Jul, 2023 4 commits
  12. 10 Jul, 2023 2 commits
  13. 09 Jul, 2023 1 commit
  14. 08 Jul, 2023 2 commits
  15. 06 Jul, 2023 3 commits
    • Artur Wojcik's avatar
    • Paul Fultz II's avatar
      Use MIGRAPHX_GLOBAL (#1918) · c45b34c3
      Paul Fultz II authored
      This will also annotate the function with the block size so the compiler can do a better job of optimizing.
      c45b34c3
    • Paul Fultz II's avatar
      Enable eval to handle multiple contexts (#1751) · 072fd5cc
      Paul Fultz II authored
      This is to help enable multi-target execution. We store a vector of targets and contexts. Currently this will only compile a single target, the PR #1672 is needed to enable multiple targets.
      
      This will also serialize the targets and contexts.
      
      When using the execution_environment or prog.get_context() it will always use the context from the first target assuming this is the "primary" target. Although, its unlikely a user would use execution_environment with a multi-target environment.
      072fd5cc
  16. 05 Jul, 2023 2 commits
  17. 02 Jul, 2023 2 commits
    • Charlie Lin's avatar
      Dynamic shape ref `clip` operator (#1862) · 3f566882
      Charlie Lin authored
      Updates ref version of clip to work with dynamic shapes
      Encountered in agentmodel
      3f566882
    • Paul Fultz II's avatar
      Improvement to ck integration (#1859) · 3c9df3b4
      Paul Fultz II authored
      Add a CI job to test CK
      Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK
      Continue tuning even when there is invalid configs
      Fix a bug with parallel compilation not using all available threads
      Add additional test for gemms using half types
      Removed int32 as supported type since it doesnt pass our test suite
      3c9df3b4