- 10 Jul, 2023 3 commits
-
-
Paul Fultz II authored
-
github-actions[bot] authored
-
Brian Pickrell authored
Changes to the way Pooling operation calculates pooling when there's padding. Old code would clip off any padding values before computing; for instance if an Average pooling window contained 0 1 2 where the 0 is padding, the result was 1.5 instead of 1.0. See Issue 1766
-
- 08 Jul, 2023 2 commits
-
-
Artur Wojcik authored
-
Artur Wojcik authored
Export API symbols for migraphx, migraphx_ref, migraphx_cpu, migrphx_gpu, migraphx_device, migraphx_tf, and migraphx_onnx. There is a separate PR for migrahx_c. API symbol exporting affects only Windows. It is transparent on Linux.
-
- 06 Jul, 2023 2 commits
-
-
Artur Wojcik authored
-
Paul Fultz II authored
This is to help enable multi-target execution. We store a vector of targets and contexts. Currently this will only compile a single target, the PR #1672 is needed to enable multiple targets. This will also serialize the targets and contexts. When using the execution_environment or prog.get_context() it will always use the context from the first target assuming this is the "primary" target. Although, its unlikely a user would use execution_environment with a multi-target environment.
-
- 05 Jul, 2023 1 commit
-
-
kahmed10 authored
Fixes the failing test case in #1815. Added a test that would otherwise fail without the change.
-
- 02 Jul, 2023 3 commits
-
-
Charlie Lin authored
Updates ref version of clip to work with dynamic shapes Encountered in agentmodel
-
Paul Fultz II authored
Add a CI job to test CK Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK Continue tuning even when there is invalid configs Fix a bug with parallel compilation not using all available threads Add additional test for gemms using half types Removed int32 as supported type since it doesnt pass our test suite
-
Umang Yadav authored
-
- 29 Jun, 2023 1 commit
-
-
Krzysztof Drewniak authored
Bump MLIR commit to include latest supported pointwise ops. Expand the MLIR approve list Ensure that operations such as tanh() that don't have integer implementations (at least in MLIR) aren't used within MLIR modules. Add additional tests.
-
- 27 Jun, 2023 1 commit
-
-
Artur Wojcik authored
-
- 23 Jun, 2023 1 commit
-
-
Umang Yadav authored
Fixes #1852 Fixes #1847
-
- 22 Jun, 2023 1 commit
-
-
Zhuoran Yin authored
Add mlir quant_dot operator support
-
- 20 Jun, 2023 1 commit
-
-
github-actions[bot] authored
Co-authored-by:
causten <causten@users.noreply.github.com> Co-authored-by:
Ted Themistokleous <107195283+TedThemistokleous@users.noreply.github.com>
-
- 17 Jun, 2023 1 commit
-
-
Umang Yadav authored
* Fix convert for the NaNs * NaNs can't be compared, use std::isnan() * formatting * formatting * formatting * add extra tests
-
- 16 Jun, 2023 1 commit
-
-
Charlie Lin authored
* initial * Added tests and new functionality * Update optimals handling * Simplify conditionals * Ref test, update docs * Remove comment, suggestion unclear --------- Co-authored-by:Umang Yadav <29876643+umangyadav@users.noreply.github.com>
-
- 15 Jun, 2023 1 commit
-
-
Brian Pickrell authored
* fix parse_instancenorm to create broadcast and multibroadcast instructions with two dynamic shape arguments instead of 1. Their make_op() functions don't support dynamic shapes when called with one input. This caused an error when parsing an ONNX 3duunet model * Use add_common_op() to create multibroadcast op. * add verification and parsing test for instance_norm with dynamic input. Parse test doesn't pass. * fix for test; still doesn't pass * another fix for test; still doesn't pass * work in progress, instance_norm_dyn_batch_test works but instance_norm_test doesn't * fix onnx instancenorm tests to match parser changes. Passes all check tests * Updated comments explaining usage of add_common_op() * hand-merged conflicts with develop * fix instance_norm_half_test after merge * add Onnx test instance_norm_dyn_batch_half_test * add shape test cases broadcast_1in_dyn_error and multibroadcast_1in_dyn_error_0
-
- 13 Jun, 2023 1 commit
-
-
Charlie Lin authored
-
- 12 Jun, 2023 1 commit
-
-
Paul Fultz II authored
-
- 05 Jun, 2023 1 commit
-
-
Charlie Lin authored
Changed the doc for find_permutation(shape) to be more clear that it is finding the permutation that would make the shape standard
-
- 02 Jun, 2023 1 commit
-
-
Chris Austen authored
-
- 01 Jun, 2023 1 commit
-
-
Umang Yadav authored
By converting to fp32 : fp16 3d-unet model accuracy comes out the same as FP32 accuracy. By using reduce_sum method on Fp16 : accuracy comes out ~0.9% lower compared to fp32 while keeping entire model in fp16.
-
- 31 May, 2023 1 commit
-
-
Umang Yadav authored
partially solves #1656 This PR only handles compilation part of multitarget.
-
- 30 May, 2023 1 commit
-
-
Paul Fultz II authored
Use generate_argument instead of generate_literal for python output as generate_literal doesnt exists Shorten the names for variables from the main module Use prefix p_ for parameters Use shorter variable m for main module in python
-
- 25 May, 2023 1 commit
-
-
Ted Themistokleous authored
Use std::numeric_limits::min/max() functions plus the appropriate value to encode -inf/inf
-
- 20 May, 2023 1 commit
-
-
Umang Yadav authored
* use half hip functions to compute max and min * add verify test for min and max
-
- 19 May, 2023 1 commit
-
-
Zhuoran Yin authored
Co-authored-by:Paul Fultz II <pfultz2@yahoo.com>
-
- 17 May, 2023 2 commits
-
-
Chris Austen authored
Move CI to support the rocm5.5 release
-
shivadbhavsar authored
Adding support for broadcasted scalars to unsqueeze op. Specifying steps other than 1 is disallowed in this implementation since we want the output the always be a tensor. We can support varying step sizes if we allow a broadcasted scalar output from this op.
-
- 11 May, 2023 1 commit
-
-
github-actions[bot] authored
Co-authored-by:causten <causten@users.noreply.github.com>
-
- 05 May, 2023 1 commit
-
-
Charlie Lin authored
Python API with documentation updates
-
- 04 May, 2023 2 commits
-
-
Paul Fultz II authored
When multiplying either the input or output across the K dimensions then the multiple can be applied to the constant which can then be folded with propagate_const.
-
Zhuoran Yin authored
Exposed the mlir_enabled() call the decide for lowering pipeline's enablement Disabled the rewrite quantization pipeline in mlir compilation Added quant convolution as anchor ops Fixed the return type expectations Added the fall back hip implementation for quantizelinear and dequantizelinear Will need advises to improve the implementation for quantizelinear
-
- 03 May, 2023 1 commit
-
-
Charlie Lin authored
Relies on Removed split_single_dyn_dim compile flag #1711 Exposes dynamic_dimension as a opaque object with dynamic_dimensions and optimals Exposes ONNX dyn_input_dims and default_dyn_dim to run with dynamic batch Updates api.py to be able to create objects from aggregate initialization (used for dynamic_dimension) Uses offload copy for now
-
- 02 May, 2023 1 commit
-
-
Paul Fultz II authored
Improves the constant propagation for bert models. Larger batch size no longer use as large of constants. Also improves the speed of model compilation
-
- 28 Apr, 2023 1 commit
-
-
Charlie Lin authored
-
- 24 Apr, 2023 2 commits
-
-
Paul Fultz II authored
This fixes #1700
-
Paul Fultz II authored
-