- 16 Aug, 2023 1 commit
-
-
Ted Themistokleous authored
Make reshape work like contiguous to perform the copy and then add proper aliasing in lowering if we're unable to perform a replace instruction
-
- 15 Aug, 2023 4 commits
-
-
Ted Themistokleous authored
-
Ted Themistokleous authored
Not needed so we don;'t need the extra contiguous as we're trying to eliminate the need to use contiguous
-
Ted Themistokleous authored
Don't need this anymore
-
Ted Themistokleous authored
-
- 11 Aug, 2023 1 commit
-
-
Ted Themistokleous authored
Unable to get dims() to populate correclty. fails at replace_instruction in lowering
-
- 09 Aug, 2023 2 commits
-
-
Ted Themistokleous authored
These will get cleaned up later but result in us adding contiguous after every reshape prior to us performing a find_reshape_alias matcher
-
Ted Themistokleous authored
use this to find a reshape->contiguous and then determine if aliasing can be done thus use the proper reshape or reshape lazy operator
-
- 04 Aug, 2023 2 commits
-
-
Ted Themistokleous authored
this will get filtered out once proper order and aliasing is determined
-
Ted Themistokleous authored
-
- 28 Jul, 2023 3 commits
-
-
Ted Themistokleous authored
Case here is to let reshape fall into a contiguous to do the copy required for nonstandard shape->standard shape convert.
-
Ted Themistokleous authored
-
Ted Themistokleous authored
Just make the old reshape into reshape_lazy since this is going to be performing aliasing after we perform a proper pass of reshapes.
-
- 05 Jul, 2023 3 commits
-
-
kahmed10 authored
Fixes the failing test case in #1815. Added a test that would otherwise fail without the change.
-
Umang Yadav authored
Needed to run multi-targeted program where "main" isn't the only root module. There could be many root modules other than main.
-
Artur Wojcik authored
-
- 02 Jul, 2023 3 commits
-
-
Charlie Lin authored
Updates ref version of clip to work with dynamic shapes Encountered in agentmodel
-
Paul Fultz II authored
Add a CI job to test CK Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK Continue tuning even when there is invalid configs Fix a bug with parallel compilation not using all available threads Add additional test for gemms using half types Removed int32 as supported type since it doesnt pass our test suite
-
Umang Yadav authored
-
- 01 Jul, 2023 1 commit
-
-
Paul Fultz II authored
This will show the instructions that are inserted for each onnx operator. This should help debugging where certain instructions come from.
-
- 30 Jun, 2023 2 commits
-
-
Umang Yadav authored
-
Paul Fultz II authored
-
- 29 Jun, 2023 2 commits
-
-
Krzysztof Drewniak authored
Bump MLIR commit to include latest supported pointwise ops. Expand the MLIR approve list Ensure that operations such as tanh() that don't have integer implementations (at least in MLIR) aren't used within MLIR modules. Add additional tests.
-
Artur Wojcik authored
Co-authored-by:Artur Wojcik <artur.wojcik@amd.com>
-
- 28 Jun, 2023 2 commits
-
-
Umang Yadav authored
-
Krzysztof Drewniak authored
Update `mlir_program` to only create one dialect registry, and to call registerRocMLIRPasses() (which is needed and may not be thread-safe) exactly once. In addition, use a single thread pool across all contexts. This is recommended practice upstream for libraries that perform a lot of compile jobs, and saves on the overhead of creating and destroying a lot of threads
-
- 27 Jun, 2023 1 commit
-
-
Artur Wojcik authored
-
- 26 Jun, 2023 1 commit
-
-
Umang Yadav authored
-
- 23 Jun, 2023 1 commit
-
-
Umang Yadav authored
Fixes #1852 Fixes #1847
-
- 22 Jun, 2023 2 commits
-
-
Zhuoran Yin authored
Add mlir quant_dot operator support
-
Ted Themistokleous authored
* Update instal_prereqs.sh to handle 22.04 defines Needed to run containers with 22.04 * Add Dockerfile for Ubuntu 22.04 and ROCm 5.5 Updated dockerfile to use ROCm 5.5 and Ubuntu 22.04 for use with building MIGraphX Able to run make -j$(nproc) check successfully with this * Clean this up since its breaking CI * cleanup install preq some more. -use one protobuf version -remove extra python3.8 installs from 3.10 case * Move comment for protobuf comment * Move Dockerfile for 22.04 to Dockerfiles/ folder * Move and rename 2204 docker file remove Docker_** from the name. Move these to tools/docker * Add pip3 installs to be shared between python versions * Add Package pin from repo.radeon.com * Add CMAKE_ARG ONNX_USE_PROTOBUF_SHARED_LIBS for every default python dist Set this to be default as part of installing prereqs --------- Co-authored-by:
Charlie Lin <charlie.lin@amd.com> Co-authored-by:
Umang Yadav <29876643+umangyadav@users.noreply.github.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 21 Jun, 2023 2 commits
-
-
Paul Fultz II authored
Co-authored-by:kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
Umang Yadav authored
Co-authored-by:kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 20 Jun, 2023 1 commit
-
-
github-actions[bot] authored
Co-authored-by:
causten <causten@users.noreply.github.com> Co-authored-by:
Ted Themistokleous <107195283+TedThemistokleous@users.noreply.github.com>
-
- 17 Jun, 2023 3 commits
-
-
Ted Themistokleous authored
* Add trace for SIMPLIFY_ALGEBRA matches * Fix format * handle review comments from Umang -int to size_t for trace -move env arg to top of simplify_algebra.cpp -handle overload beter for find_matches * Rename trace_mod param to trace_pass More representative naming for what this trace flag does
-
turneram authored
* Add initial ck_gemm code * Format * Add additional src files * Format * Add include * Simplify fuse_ck * Format * Rename var * Enable pass * Update ck version * Fix include * Add group stride * Disable warnings for ck headers * Format * Add unpack array * Add interface to enable tuning * Format * Update compile_ops to handle tuning config * Format * Add some comments * Move time_op to migraphx_gpu * Add banchmarking * Refactor * Format * Add lift class macro * Use device name * Format * Generate configs * Format * Pass tuning parameter * Move data type to is_ck_gemm matcher * Format * Add problem_cache to avoid retuning same configs * Format * Format * Mark the problems * Format * Use is_null * Format * Resize vector * Only tune with exaustive tuning * Format * Use assert * FOrmat * Tidy fixes * More tidy fixes * Format * Add license to missing files * Format * Use transform * Format * Fix tidy * Format * Fix cppcheck issues * Format * Add static_assert * Add ops header * Add assertion in batcher * Format * Improve the batch fold check * Format * Add where op workaround for CK * Skip if any input is not a supported ck type * Format * Check batch is standard * Format * Remove redundant static keyword * Update commit hash * Fix error when running without --exhaustive-tune * Formatting * Formatting * Remove fuse_ck_gemm_softmax_gemm * Update ck hash * Correct spelling mistake * Remove commented out logic from fuse_ck * Remove unused include and add comment * Formatting * Remove redundant get_shape and remove ck_gemm from names * Formatting * Allow for mixed types with int8 gemms * Formatting * Add back find_package from merge * Update CK commit hash and add gfx940 to fuse_ops supported archs * Formatting * Update CK hash
-
Umang Yadav authored
* Fix convert for the NaNs * NaNs can't be compared, use std::isnan() * formatting * formatting * formatting * add extra tests
-
- 16 Jun, 2023 2 commits
-
-
Charlie Lin authored
* initial * Added tests and new functionality * Update optimals handling * Simplify conditionals * Ref test, update docs * Remove comment, suggestion unclear --------- Co-authored-by:Umang Yadav <29876643+umangyadav@users.noreply.github.com>
-
Paul Fultz II authored
-
- 15 Jun, 2023 1 commit
-
-
Umang Yadav authored
-