- 28 Feb, 2022 6 commits
-
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
- 26 Feb, 2022 3 commits
-
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
- 25 Feb, 2022 5 commits
-
-
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
Paul Fultz II authored
Needed for custom_op so we can generically convert the C type back to the C++ type in the function pointer.
-
Paul Fultz II authored
wrapped in a any_ptr class so the type can be checked at runtime for a mismatch.
-
- 24 Feb, 2022 1 commit
-
-
Paul Fultz II authored
Make doc/CMakeLists.txt standalone Switch to use rocm-cmake modules for document generation Add CONFIGURE_DEPENDS to file(GLOB) so it will update without an explicit cmake run Add STRINGS property for build type to make it easier to switch build types with ccmake Various fixes and improvements
-
- 23 Feb, 2022 1 commit
-
-
Shucai Xiao authored
This PR is the resolve two problems in the issue#999, i.e., non_standard_shape input to reshape and reduce_mean. Three fixes: Any operator that has a standard shape requirement will add a contiguous input for its input. Eliminate_contiguous, when computing whether a contiguous can be removed, we should use all the updated args, not just the one that is being checked. In two optimization in the simplify_reshape, we remove the contiguous in the reshaper name list, since eliminate_contiguous will remove the contiguous if it can be removed. the solution is add an attribute to the operator that requires standard input shape, then in the auto_contiguous pass, add a contiguous to every input of such operators.
-
- 22 Feb, 2022 1 commit
-
-
Paul authored
-
- 17 Feb, 2022 1 commit
-
-
dependabot[bot] authored
Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.5.2 to 2.5.3. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v2.5.2...v2.5.3 ) --- updated-dependencies: - dependency-name: tensorflow dependency-type: direct:production ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- 16 Feb, 2022 2 commits
-
-
Umang Yadav authored
Support nonstandard shapes like slice, broadcast and transpose for the unsqueeze op
-
kahmed10 authored
-
- 14 Feb, 2022 1 commit
-
-
- 11 Feb, 2022 2 commits
-
-
kahmed10 authored
* add submodule test * remove for loop * simplify reshape test
-
Chris Austen authored
Update copyright for a python file that was modified when added to our code
-
- 09 Feb, 2022 6 commits
-
-
Paul Fultz II authored
There is now a MIGRAPHX_DISABLE_POINTWISE_FUSION to disable it
-
Umang Yadav authored
Support slice, broadcast and transpose shapes for the squeeze op.
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
- 08 Feb, 2022 7 commits
-
-
Charlie Lin authored
Changed MessagePack file extensions to mxr.
-
Paul Fultz II authored
This causes incorrect memory coloring, which was causing the accuracy failures in the vision model when enabling the pointwise fusions. Resnet50, inceptionv3 and inceptionv4 do verify now in the driver.
-
Paul Fultz II authored
Enforce types to avoid compilation error in pointwise fusions This fixes compile failure: gpt-2, fp16 on Navi
-
Khalique Ahmed authored
-
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
- 04 Feb, 2022 2 commits
-
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
- 02 Feb, 2022 1 commit
-
-
Paul Fultz II authored
Currently, MIGRAPHX_TRACE_EVAL=2 prints out the entire output buffer, but this can produce a lot of output. To make it easier to inspect and debug, using MIGRAPHX_TRACE_EVAL=2 now only prints 10 elements from the buffer(the first 5 and last 5) and shows any fp classifications found in the buffer(ie nans, infinity, etc). The previous behavior can still be enabled with MIGRAPHX_TRACE_EVAL=3.
-
- 01 Feb, 2022 1 commit
-
-
Paul Fultz II authored
This will also check the types using mypy on the CI.
-