1. 11 Aug, 2023 1 commit
  2. 02 Aug, 2023 2 commits
  3. 01 Aug, 2023 2 commits
  4. 25 Jul, 2023 3 commits
  5. 23 Jul, 2023 1 commit
  6. 22 Jul, 2023 3 commits
  7. 21 Jul, 2023 3 commits
    • Umang Yadav's avatar
      Add back clamping and add tests (#1969) · 6957243c
      Umang Yadav authored
      Fixes #1957
      
      Clamping was removed in #1853.
      
      Turns out clamping as necessary to handle overflow/underflow cases. during downcasting, if it overflowed then without clamping it returned infinity.
      6957243c
    • Umang Yadav's avatar
      Use `optimize_module` pass for the quantization to fp16 (#1974) · 6f1f4b59
      Umang Yadav authored
      Fixes #1746
      
      BatchNorm only has x as the runtime input parameter for the following equation. All the other parameters are compile-time constants and related operations can be const-folded before quantizing to fp16 to preserve precision.
      6f1f4b59
    • Umang Yadav's avatar
      Make global workitems multiple of local workitems (#1976) · 3216fe52
      Umang Yadav authored
      HIP requires global work items in multiple of local work items. If it is not it is not guaranteed to generate correct results all the time.
      Fixes #1977
      Fixes #1644
      MIGraphX CI has moved to rocm-5.6 which doesn't require hipRTC workarounds
      3216fe52
  8. 19 Jul, 2023 3 commits
  9. 18 Jul, 2023 2 commits
  10. 17 Jul, 2023 3 commits
  11. 16 Jul, 2023 1 commit
  12. 13 Jul, 2023 4 commits
  13. 11 Jul, 2023 4 commits
  14. 10 Jul, 2023 2 commits
  15. 09 Jul, 2023 1 commit
  16. 08 Jul, 2023 2 commits
  17. 06 Jul, 2023 3 commits
    • Artur Wojcik's avatar
    • Paul Fultz II's avatar
      Use MIGRAPHX_GLOBAL (#1918) · c45b34c3
      Paul Fultz II authored
      This will also annotate the function with the block size so the compiler can do a better job of optimizing.
      c45b34c3
    • Paul Fultz II's avatar
      Enable eval to handle multiple contexts (#1751) · 072fd5cc
      Paul Fultz II authored
      This is to help enable multi-target execution. We store a vector of targets and contexts. Currently this will only compile a single target, the PR #1672 is needed to enable multiple targets.
      
      This will also serialize the targets and contexts.
      
      When using the execution_environment or prog.get_context() it will always use the context from the first target assuming this is the "primary" target. Although, its unlikely a user would use execution_environment with a multi-target environment.
      072fd5cc