1. 21 Aug, 2023 1 commit
  2. 09 Aug, 2023 1 commit
  3. 06 Aug, 2023 1 commit
  4. 03 Aug, 2023 1 commit
  5. 01 Aug, 2023 3 commits
  6. 31 Jul, 2023 2 commits
    • Lakhinder Walia's avatar
      Lw/fix half shape (#2000) · e4dc75ea
      Lakhinder Walia authored
      * Use shape of Instruction (instead of a default) in add_return()
      
      * Instruction validation fix: not to use a default shape value for comparison
      
      * Fix instruction::replace() to recompute shape for "@return"
      
      * handle the case of missing shape in an Instruction related Test
      
      * use compute_shape() to get op shapes + test case for tuple_type
      
      * add test case shape_test/return_shape_tuple
      
      * Add test for @return to check for half type
      
      * Move @return unit-tests around..; Address review comments
      
      * Broken comparison fix: comparison to a (default) shape of tuple_type
      
      * Test cases: (add) return_shape_empty & (modify) return_shape_tuple
      
      * modify the assert() statement
      e4dc75ea
    • Artur Wojcik's avatar
  7. 30 Jul, 2023 2 commits
  8. 29 Jul, 2023 1 commit
  9. 28 Jul, 2023 3 commits
  10. 27 Jul, 2023 2 commits
  11. 26 Jul, 2023 3 commits
  12. 25 Jul, 2023 4 commits
  13. 24 Jul, 2023 1 commit
  14. 23 Jul, 2023 2 commits
  15. 22 Jul, 2023 5 commits
  16. 21 Jul, 2023 3 commits
    • Umang Yadav's avatar
      Add back clamping and add tests (#1969) · 6957243c
      Umang Yadav authored
      Fixes #1957
      
      Clamping was removed in #1853.
      
      Turns out clamping as necessary to handle overflow/underflow cases. during downcasting, if it overflowed then without clamping it returned infinity.
      6957243c
    • Umang Yadav's avatar
      Use `optimize_module` pass for the quantization to fp16 (#1974) · 6f1f4b59
      Umang Yadav authored
      Fixes #1746
      
      BatchNorm only has x as the runtime input parameter for the following equation. All the other parameters are compile-time constants and related operations can be const-folded before quantizing to fp16 to preserve precision.
      6f1f4b59
    • Umang Yadav's avatar
      Make global workitems multiple of local workitems (#1976) · 3216fe52
      Umang Yadav authored
      HIP requires global work items in multiple of local work items. If it is not it is not guaranteed to generate correct results all the time.
      Fixes #1977
      Fixes #1644
      MIGraphX CI has moved to rocm-5.6 which doesn't require hipRTC workarounds
      3216fe52
  17. 19 Jul, 2023 4 commits
  18. 18 Jul, 2023 1 commit