1. 28 Jul, 2023 2 commits
  2. 27 Jul, 2023 1 commit
  3. 26 Jul, 2023 1 commit
  4. 25 Jul, 2023 1 commit
  5. 23 Jul, 2023 1 commit
  6. 22 Jul, 2023 2 commits
  7. 21 Jul, 2023 2 commits
    • Umang Yadav's avatar
      Add back clamping and add tests (#1969) · 6957243c
      Umang Yadav authored
      Fixes #1957
      
      Clamping was removed in #1853.
      
      Turns out clamping as necessary to handle overflow/underflow cases. during downcasting, if it overflowed then without clamping it returned infinity.
      6957243c
    • Umang Yadav's avatar
      Use `optimize_module` pass for the quantization to fp16 (#1974) · 6f1f4b59
      Umang Yadav authored
      Fixes #1746
      
      BatchNorm only has x as the runtime input parameter for the following equation. All the other parameters are compile-time constants and related operations can be const-folded before quantizing to fp16 to preserve precision.
      6f1f4b59
  8. 19 Jul, 2023 2 commits
  9. 16 Jul, 2023 1 commit
  10. 13 Jul, 2023 1 commit
    • Charlie Lin's avatar
      Update deconvolution -> convolution_backwards and Dynamic Shape Support (#1801) · 4edf1195
      Charlie Lin authored
      Renames deconvolution -> convolution_backwards to be more consistent with the literature
      Note: this is not the cross-correlation operator (which is the adjoint of convolution). This is technically a standard convolution operator combined with an upsampling operator rather than a downsampling operator.
      Adds unit tests for the padding, strides, dilations, and other op attributes.
      Throws on auto_pad attribute since it has not been implemented
      Previously it read the attribute and set it but then did nothing with it
      Extended for dynamic shapes
      Does not support using asymmetric padding (padding_L != padding_R) and output_shape with dynamic shapes.
      4edf1195
  11. 10 Jul, 2023 3 commits
  12. 08 Jul, 2023 2 commits
  13. 06 Jul, 2023 2 commits
    • Artur Wojcik's avatar
    • Paul Fultz II's avatar
      Enable eval to handle multiple contexts (#1751) · 072fd5cc
      Paul Fultz II authored
      This is to help enable multi-target execution. We store a vector of targets and contexts. Currently this will only compile a single target, the PR #1672 is needed to enable multiple targets.
      
      This will also serialize the targets and contexts.
      
      When using the execution_environment or prog.get_context() it will always use the context from the first target assuming this is the "primary" target. Although, its unlikely a user would use execution_environment with a multi-target environment.
      072fd5cc
  14. 05 Jul, 2023 1 commit
  15. 02 Jul, 2023 3 commits
  16. 29 Jun, 2023 1 commit
  17. 27 Jun, 2023 1 commit
  18. 23 Jun, 2023 1 commit
  19. 22 Jun, 2023 1 commit
  20. 20 Jun, 2023 1 commit
  21. 17 Jun, 2023 1 commit
  22. 16 Jun, 2023 1 commit
  23. 15 Jun, 2023 1 commit
    • Brian Pickrell's avatar
      fix parse_instancenorm to create broadcast and multibroadcast instruc… (#1715) · 41ba30d5
      Brian Pickrell authored
      * fix parse_instancenorm to create broadcast and multibroadcast instructions with two dynamic shape arguments instead of 1.  Their make_op() functions don't support dynamic shapes when called with one input.  This caused an error when parsing an ONNX 3duunet model
      
      * Use add_common_op() to create multibroadcast op.
      
      * add verification and parsing test for instance_norm with dynamic input.  Parse test doesn't pass.
      
      * fix for test; still doesn't pass
      
      * another fix for test; still doesn't pass
      
      * work in progress, instance_norm_dyn_batch_test works but instance_norm_test doesn't
      
      * fix onnx instancenorm tests to match parser changes.  Passes all check tests
      
      * Updated comments explaining usage of add_common_op()
      
      * hand-merged conflicts with develop
      
      * fix instance_norm_half_test after merge
      
      * add Onnx test instance_norm_dyn_batch_half_test
      
      * add shape test cases broadcast_1in_dyn_error and multibroadcast_1in_dyn_error_0
      41ba30d5
  24. 13 Jun, 2023 1 commit
  25. 12 Jun, 2023 1 commit
  26. 05 Jun, 2023 1 commit
  27. 02 Jun, 2023 1 commit
  28. 01 Jun, 2023 1 commit
  29. 31 May, 2023 1 commit
  30. 30 May, 2023 1 commit
    • Paul Fultz II's avatar
      Improvements to driver output (#1710) · d32ab85b
      Paul Fultz II authored
      Use generate_argument instead of generate_literal for python output as generate_literal doesnt exists
      Shorten the names for variables from the main module
      Use prefix p_ for parameters
      Use shorter variable m for main module in python
      d32ab85b