- 15 Sep, 2022 1 commit
-
-
Lixun Zhang authored
* Replaced `find_library` with `find_package` to locate MLIR static library * Unified the include dir for headers and remove backward compatibility * Embedded the external/include dir into the exported library
-
- 14 Sep, 2022 4 commits
-
-
turneram authored
The verify tests from pr #1354 were still causing some codecov timeouts after merge. This PR further reduces the problem sizes to avoid these failures.
-
Umang Yadav authored
* fix slice_dim1 for case
-
Paul Fultz II authored
* Implement concat using jit compilation
-
shivadbhavsar authored
expose underlying p data inter for migraphx.argument Update python api documentation
-
- 13 Sep, 2022 1 commit
-
-
turneram authored
Improves performance for 4/6 GEMMs used by huggingface BERT models with batch_size>1 by using a non-batched rocBLAS call for GEMMs where the B input has a broadcasted batch dimension. The four verify tests added reflect the actual configurations used by bert-base-cased, with varied batch sizes. Also adds a matcher to simplify_reshapes to move multibroadcasts after concats.
-
- 09 Sep, 2022 1 commit
-
-
Chris Austen authored
migraphx version is now 2.4
-
- 08 Sep, 2022 2 commits
-
-
Paul Fultz II authored
* Remove unused headers
-
Charlie Lin authored
Fixes TF literal parsing for relu6. previously always made a float type literal, breaks for float16 as an example
-
- 07 Sep, 2022 1 commit
-
-
Paul Fultz II authored
* Fix accuracy bug when vectorizing slices
-
- 06 Sep, 2022 1 commit
-
-
Paul Fultz II authored
Using not and or improves readability. The cppcheck rule will help ensure we are doing it consistently.
-
- 31 Aug, 2022 1 commit
-
-
turneram authored
Rewrite_gelu pass replaces the gelu formula of x * (1/2) * (1 + erf(x/sqrt(2))) with the sigmoid approximation of x * Sigmoid(x * 1.702)
-
- 29 Aug, 2022 1 commit
-
-
Umang Yadav authored
reshape op requires standard shape. During simplify_algebra, it inserts reshapes without checking for this requirement.
-
- 27 Aug, 2022 2 commits
-
-
Paul Fultz II authored
* Track kernel time
-
Paul Fultz II authored
This will rewrite dot operators like X(Y + b) to XY + Xb when b is constant as we can fold the add away. This improves handling pointwise with broadcasted operators, this helps improves const propagation. Improve gemm fusion with a mul_add Improve support for broadcast shapes in gemm
-
- 26 Aug, 2022 1 commit
-
-
Charlie Lin authored
-
- 24 Aug, 2022 1 commit
-
-
Charlie Lin authored
declare a missing std::size
-
- 23 Aug, 2022 1 commit
-
-
Charlie Lin authored
Has NMS op output a dynamic shape (ONNX spec behavior) Allows for dynamic input shape to NMS op
-
- 21 Aug, 2022 1 commit
-
-
varunsh authored
* Update is_supported * Return object from is_supported * Return by reference in interator
-
- 19 Aug, 2022 3 commits
-
-
Paul Fultz II authored
-
Charlie Lin authored
remove print from source
-
kahmed10 authored
* fix json strings in driver models
-
- 18 Aug, 2022 1 commit
-
-
shivadbhavsar authored
Add function argument_from_pointer to allow directly passing a migraphx.shape object and a memory address. Expose the is_compiled() method from migraphx::program. Expose the enum types under migraphx::op.
-
- 17 Aug, 2022 3 commits
-
-
Chris Austen authored
-
Paul Fultz II authored
-
Paul Fultz II authored
* Horizontally fuse contiguous
-
- 16 Aug, 2022 2 commits
-
-
Paul Fultz II authored
-
Umang Yadav authored
Removes unnecessary semi-colon after call to MACRO
-
- 12 Aug, 2022 2 commits
-
-
Charlie Lin authored
-
Krzysztof Drewniak authored
Once https://github.com/ROCmSoftwarePlatform/llvm-project-mlir/pull/690 lands, the ABI for MLIR-generated kernels will change. This commit prepares MIGraphX for the change by conditionally selecting the new ABI if MLIR reports a sufficiently high API version in its headers.
-
- 11 Aug, 2022 1 commit
-
-
Chris Austen authored
Change Dockerfile to use main instead of master for ORT operations
-
- 09 Aug, 2022 2 commits
-
-
Paul Fultz II authored
* Allow license_stamper.py to be ran from any directory * Format Co-authored-by:kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
Umang Yadav authored
* fix rocblas pointer mode * fix formatting * formatting * revert header change Co-authored-by:umangyadav <umang.yadav@amd.com>
-
- 08 Aug, 2022 1 commit
-
-
Ted Themistokleous authored
* Imply type of literal returned based on input protobuff for zero element constant values. This saves us the default behavior as the onnx parsing assumes that every zero value is float. This way we're still grabbing relevant type information from the protobuff instead and wont fail our data type checks for if them/else blocks from onnx * Revert "Imply type of literal returned based on input protobuff for zero element constant values." This reverts commit 390bb853 . * Add test case to parse in empty constant int64 proto buffer I think the previous test case was aliasing an issue where we default to float but need to actually read in int64 instead of int32 * fixup! Add test case to parse in empty constant int64 proto buffer * Add test for non empty int64 scalar Add one item in the np array to use for the constant we're parsing in. * Draft partial fix * Fix test failures from previous change to read in protobuf data types correctly for empty constants. Instead of assuming things are empty and thus we default to float, reading in the correct types broke some assumptions code was using for an empty literal. * Fix formatting and naming * Fix naming with var in constant_one_val_int64_test Co-authored-by:
charlie <charlie.lin@amd.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 04 Aug, 2022 2 commits
-
-
Charlie Lin authored
* Dynamic shape handling in shape object * rewrite empty lens multibroadcast test * Shape class changes to handle dynamic * More throw errors for functions that don't make sense for dynamic shape * Print output changes * Serialization changes * Fixing serialization errors * Remove const on dyn_dim copy getters * Dynamic shape tests * Fix serialize errors * Add dyn_data struct to avoid ambiguous constructor * Tidy fix: emplace_back() over for loop * Tidy fix: use move * Use std::initializer_list in constructor Reverts the dyn_data struct change Should get around the ambiguous braced initialization list error * avoid typedef * element_space, min,max,opt _lens change * formatting * Comments fix * dynamic bytes() test * Seralize and reflect changes * formatting * Test the dynamic lens functions * progress * Formatting * Dynamic conv draft progress * Add operator<< tests for coverage * Coverage update * Add to conv dynamic batch test * Dynamic image size test * Dynamic weight handling * Dyn image shape test change, fix dyn weight cond * Comment update * Dynamic weights shape test and fix * Use ternary operator * Tidy fixes * Handle dynamic graph input shapes in ONNX parser * Formatting * Handle dynamic shape for convolution * formatting * cppcheck fixes * Add onnx test files * Fix typo * Disable auto_pad for dynamic input shape * check_shapes object checks for allowing dynamic shapes * Fix any_of * Change to maintain const objectness * Formatting * Check shapes allow dynamic * Refactor compute_shape() call into op.compute() Allows for per operator differences with handling dynamic shape Fix operation.hpp change to use the generator * Comment fix * Refactor normalize_attributes() calls to use max_lens() * Comment addition * Update other normalize_attributes() calls * Change to using constructor and add tests * Use const member function * Add more dynamic shape support * Add tests for error code coverage * Fix opt shape bug and add shape tests * capture all by ref * Fix typo with img shape calculation * Add more tests * dynamic auto pad attempt Linker error with pad_calc.cpp * Fix parse dyn auto_pad Should only need to use dynamic auto pad when the image shape or kernel shape are dynamic. For a dynamic batch size, the auto pad calculation is the same. * Fix linking error * Fix auto_pad bug Fixed input tensor with auto_pad setting on * auto_pad onnx tests * Fix auto_pad calculation, evaluate in ref_conv add ref_ops tests * Add shape tests, fix bugs * Refactor first two output dynamic len calculation * Conv MLIR test update * i64 MLIR test fix * Fix MLIR test typo Co-authored-by:Chris Austen <causten@users.noreply.github.com>
-
Umang Yadav authored
-
- 02 Aug, 2022 2 commits
-
-
Paul Fultz II authored
* Improve type printing in driver * Improve error with incorrect order for command * Add spell checking of arguments * Add validations and required checking * Add required arguments and groups
-
jungpark-mlir authored
-
- 30 Jul, 2022 1 commit
-
-
kahmed10 authored
Added an Accuracy checker to the tools directory. Currently compares ONNX FP32 models against ORT CPUEP
-
- 29 Jul, 2022 1 commit
-
-
Umang Yadav authored
Currently, while copying a host buffer to the device, it first registers/maps the host buffer pointer to address space of the device. If the host buffer has been allocated by the hipHostMalloc then, it is implicitly registered to the device's address space, and no need to register again. This PR adds a check for the same.
-