- 27 Jun, 2021 1 commit
-
-
Shucai Xiao authored
* Add definitions for all pointwise operators * Formatting * Add cpp generator class * Formatting * Move compilation to core * Formatting * Add clock to tmp name * Add dynamic loader * Formatting * Add tests for code gen * Formatting * Add test for literals * Formatting * Use with_char * Add missing header * Fix mismerge * Ignore tidy warning * Fxx gcc 5 errors * Apply fixits * Skip signed bitwise of status * Remove unused parameters * Explicitly add c++14 flag * Fix tidy warning * unify the compute function signature * clang format * make another change * unify the compute function * clang format * remove unnecessary code * more refinement about the operator compute funciton * clang format * add an overload function * clang format * add support for axes inputs for sequeeze/unsqueeze/reduce_sum * clang format * fix build problems * backup code changes * clang format * Add tuple type to shape class * Formatting * fix a bug in parsing quantizelinear operator * clang format * fix a cppcheck error * disable different versions of unit tests for different onnx version * clang format * upgrade onnx to 1.8 * update onnx to 1.8.1 * disable two more real models * clang format * Make data member private * Formatting * Add sub arguments * Formatting * Trun clang format off * Disable clang-format * fix review comments * fix the function of assign axes in parsing the squeeze operator * add unit tests and fix a bug * clang format * fix review comments * clang format * fix a build error * backup code changes * clang format * add more unit tests and add parsing opset version * clang format * Improve visiting tuples * Formatting * fix cppcheck error * adding installing the onnx package * resolve no protobuf compiler * add an inline subgraph pass * clang format * Add more argument tests * Formatting * Handle tuple in load * Formatting * code backup * clang format * Remove .o files * Add tuple type to api * Formatting * fix build errors * clang format * code backup * code backup * add unit tests for the inline subgraph * clang format * refine the inline subgraph and parse if operator * clang format * fix cppcheck issue * clang format * add unit test for inline subgraph pass * clang format * fix format issue * remove the context from the if operator * clang format * simplify the compute functions * Fix tidy warnings * fix cppcheck error * clang format * fix cppcheck error * Fix tidy warnings * fix a cppcheck error * clang format * Add a test for share method * Formatting * Add a test cpp_type * add unit tests for more code coverage * clang format * add unit tests to have more code coverage * clang format * try a comment in jenkins build * include the install onnnx line * code backup * reorder the dependenciesd installed * refine dockerfile * fix review comments * clang format * remove unnecessary overload function * fix cppcheck error * change back the argument test * Suppress tidy warning * add the operator get_tuple_elem * clang format * add get_tuple_elem to operator include file * chang if to support multiple operation outputs * clang format * optimize inline subgraph * clang format * code backup * clang format * fix bug * refine unit tests for tuple output of the if operator * clang format * refine a instruction replacement code * add a unit test and sort all the unit tests alphabetically * fix cppcheck error * add more unit tests for multiple op outputs * clang format * fix cppcheck error * Update pass manager to get modules after every pass * more unit test to cover more scenarios * clang format * fixed a bug in a unit test * add more tests * clang format * add more unit tests to have more code coverage * fix a bug in a unit test * Add program overload for module * Formatting * Hash modules for quicker lookup of modules * Bump file version * Add methods to remove modules * Formatting * add the tuple type to the support list * Eliminate unused modules * Formatting * Fix test errors * Foramtting * Fix tidy issues * fix problem related to inline subgraph * clang format * fix review comments * fix review comments * fix review comments * fix review comments * clang format * fix a unit test * one more code change * remove an optimization related to the if operator * clang format * fix review comments Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 16 Jun, 2021 1 commit
-
-
Shucai Xiao authored
* backup implementation of resize enhancement * clang format * code backup for the resize * clang format * fix build error for resize operator * clang format * tmp code backup * clang format * code backup * clang format * remove changes in parse_resize * remove unnecessary changes * clang format * add unit test for the bug * clang format * remove print code * remove a semi-colon * clang format * fix a tidy error * clang format * add contiguous for nonstd input for the resize operator * clang format * code backup * clang format * fix build error * code backup * clang format * code backup * code backup * clang format * add unit tests for resize_linear * clang format * refine a function name * clang format * fix cppcheck error * clang format * fix cppcheck error * fix review comments * clang format * backup code changes * clang format * add unit tests for resize operator * clang format * remove an unused header file * remove an unused header file * remove unrelated unit tests * refine parsing resize inputs * clang format * fix cppcheck error * fix cppcheck error * remove unnecessary code * clang format * fix cppcheck error * clang format * fixed a bug * clang format * fix review comments * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 15 Jun, 2021 1 commit
-
-
Shucai Xiao authored
* add a flag to indicate int8x4 input format * clang format * code backup * clang format * code backup * clang format * code backup * clang format * code backup * clang format * code backup * clang format * remove log info * remove unnecessary changes * fix cppcheck error * add unit tests to have more code coverage * clang format * add debug info * remove log info * fix cppcheck error * clang format * clang format * add one more unit tests for more scenarios * fix cppcheck error * clang format * fix review comments * clang format * rename p to m * fix review comments * refine unit tests * clang format * refine unit tests and fixed a bug * clang format * fix build error related to rocm4.2 * fix a bug related to alpha and beta * refine two unit tests related to int8_gemm * fix cppcheck error * refine unit test to pass on mi100 * add unit test for packing int8 args * clang format * change unit tests back * disable some unit tests for gpu * clang format * refine unit tests to run on mi100 * clang format * refine unit tests * refine unit tests * clang format * change back a unit test Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 11 Jun, 2021 1 commit
-
-
Paul Fultz II authored
Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 10 Jun, 2021 2 commits
-
-
Cagri Eryilmaz authored
* init reverseOp branch: ref op + ref test. WIP * first passing basic test * cleanup * additional axis implementation * additional test * ref op implementation vec to int for axis * ref op test change for axis * initial gpu files and test * updates to implementation and test * fixed some issues * clang format * cleanup * formatting * removing comments * changes to parse_slice.cpp debug copy * cleanup + additional axis for reverse instruction * formatting * remove local size, back to default * update tests: replace with std functions * multiple axis for reverse op * fix a build error * clang format * changes to parse_slice.cpp debug copy * cleanup + additional axis for reverse instruction * formatting * axes update to parse slice * typo * more tests * fix a bug for the reverse device function * clang format * fix a bug * clang format * ref test updates, multiaxis * formatting * formatting, cleanup bool op * casting for tidy warning * tidy fix * remove bool, add steps, check only negative axis * clang-format * step op for parse slice * cleanup & format * missing axis for logsoftmax_nonstd_input_test * updated onnx file for logsoftmax_nonstd_input_test * updates to parse slice. tests for slice+reverse, slice+step+reverse * removing tests for slice+step+reverse as step requires normalization, will move it to other branch. removed related lines and tests * duplicate test removal * some refinement of the code * clang format * undefined behavior fix * undef behavior v2 * formatting * formatting & updates * change to parse slice * update to parse_slice for undef/asan + test update * formatting * remove header, no if * assertions + change the loop from axis to steps for logsoftmax test segfault Co-authored-by:
Shucai Xiao <Shucai.Xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
* Add optional header * Formatting * Use optional in the matcher * Foramtting * Remove program from tests * Formatting * Dont bind or match non-local variables * Formatting * Fix gcc 5 error * Format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 09 Jun, 2021 2 commits
-
-
kahmed10 authored
* alternative impl * formatting * add gpu pass to insert pad * formatting * update onnx test, still need cleanup * formatting * update tf_test * modify existing tests * formatting * remove print * code cleanup * formatting * code cleanup * formatting * fix tidy and cppcheck * remove variable * add test * formatting * add test and address comments * formatting Co-authored-by:
Shucai Xiao <shucai@gmail.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
* Enable libstdc++ debug mode * Add is_end function * Compare addresses in a map or set * Formatting * Check end * Fix comparision of instruction_ref * Formatting * Some more iterator fixes * Formatting * Fix assert * Fix invalid iterators * Fix debug print in program * Remove debug flag for now * Set correct bool type Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 08 Jun, 2021 1 commit
-
-
Cagri Eryilmaz authored
* init reverseOp branch: ref op + ref test. WIP * first passing basic test * cleanup * additional axis implementation * additional test * ref op implementation vec to int for axis * ref op test change for axis * initial gpu files and test * updates to implementation and test * fixed some issues * clang format * cleanup * formatting * removing comments * remove local size, back to default * update tests: replace with std functions * multiple axis for reverse op * fix a build error * clang format * more tests * fix a bug for the reverse device function * clang format * fix a bug * clang format * ref test updates, multiaxis * formatting Co-authored-by:
Shucai Xiao <Shucai.Xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 02 Jun, 2021 1 commit
-
-
Paul Fultz II authored
Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 26 May, 2021 1 commit
-
-
Shucai Xiao authored
* add the operator step * clang formatJ * add unit tests * clang format * add more unit test for step op * clang format * add more unit tests * clang format * fix review comments * clang format * rename two unit tests Co-authored-by:Paul Fultz II <pfultz2@yahoo.com>
-
- 25 May, 2021 1 commit
-
-
Paul Fultz II authored
* Add timing to trace eval * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 24 May, 2021 2 commits
-
-
Shucai Xiao authored
* backup implementation of resize enhancement * clang format * code backup for the resize * clang format * fix build error for resize operator * clang format * tmp code backup * clang format * remove changes in parse_resize * remove unnecessary changes * clang format * add unit test for the bug * clang format * remove print code * remove a semi-colon * clang format * fix a tidy error * fix review comments * clang format Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
* rename merge_from to merge_to * refine comments * code backup * clang format * The first version that can reduce scratch memory usage * code backup * clang format * code backup * clang format * fixed a bug related to removing gemm copy * clang format * code backup * clang format * fix review comments * clang format * fix unit test failure * code backup * clang format * code base for further investigation * code with both the forward and backward approach to compute the conflict table * clang format * clang format * backup changes * remove unnecessary file * remove unnecessary code * code backup * clang format * code backup * clang format' * fix a bug in the code * clang format * code backup * clang format * remove unused code * remove unused code * rename some functions * remove print code * code backup * add dominator to scheduling * add dominator algorithm to remove unnecessary conflicts * Remove comment * Use erase_if instead * Formatting * Code clean up: * Formatting * Add dominator info class * Formatting * Add dom_info * Formatting * Add test case and fix some bugs * Formatting * Add unit test for scheduler * Formatting * Use index map instead of distance * Formatting * Add memory coloring test * Check for conflict in memory coloring * Formatting * Use 1 stream by default * Update to use modules * Formatting * Skip live on entry check * Formatting * Formatting * Fix tidy warning * Fix tidy warning * Formatting * Add nolint * Use C++17 to build everything when using clang * Remove input names * Formatting * Remove input names * Keep order of params * Formatting Co-authored-by:
Shucai Xiao <Shucai.Xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 23 May, 2021 1 commit
-
-
Paul Fultz II authored
* Create lazy range * Formatting * Use lazy iota
-
- 11 May, 2021 1 commit
-
-
Shucai Xiao authored
* upgrade protobuf to 3.16.0 to be compatible with onnxruntime * change the link flag to fix a protobuf version conflict with ort
-
- 07 May, 2021 1 commit
-
-
Paul Fultz II authored
* Update pass manager to get modules after every pass * Add program overload for module * Formatting * Hash modules for quicker lookup of modules * Bump file version * Add methods to remove modules * Formatting * Eliminate unused modules * Formatting * Fix test errors * Foramtting * Fix tidy issues
-
- 06 May, 2021 1 commit
-
-
Paul Fultz II authored
* Use hipStreamSynchronize instead of device sync * Formatting * Suppress FPs * Use sync_stream instead of device * Formatting * Fix python bindings * Formatting
-
- 03 May, 2021 2 commits
-
-
Paul Fultz II authored
* Hash instruction in module for faster checking if instruction is in module * Formatting * Fix cppcheck error Co-authored-by:
Shucai Xiao <shucai@gmail.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
* Remove unused data types * Formatting * Reduce types generated for hip kernels * Formatting * Fix onnx tests * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 01 May, 2021 2 commits
- 29 Apr, 2021 1 commit
-
-
SJW authored
* MLIR MIOpen Dialect integration (phase 1) (#768) * Added Findmlir.cmake (using environment variables to import) * Added mlir_conv pass to GPU target * Apply to any gpu::convolution if supported by MLIR * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution * Capture binary in dictionary for matching convolutions * Build a code_object_op with the binary and execution dimensions * Substitute for the gpu::convolution * Changed the parameters for the code_object to reflect the generated MLIR kernel * Expanded out MemRefDescriptor fields in param list * Also updated for MLIR C-API changes * * fixed global_size calculation * MLIR MIOpen Dialect integration (phase 1) (#768) * Added Findmlir.cmake (using environment variables to import) * Added mlir_conv pass to GPU target * Apply to any gpu::convolution if supported by MLIR * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution * Capture binary in dictionary for matching convolutions * Build a code_object_op with the binary and execution dimensions * Substitute for the gpu::convolution * Changed the parameters for the code_object to reflect the generated MLIR kernel * Expanded out MemRefDescriptor fields in param list * Also updated for MLIR C-API changes * * Added command line option: --enable_mlir * * fixed command line switch * updated for new MLIR API changes * * Added cget llvm-project-mlir to import MIIR API libraries into Dockerfile * removed cmake Findmlir * updated for changes in MIIR C-API * * updated CMakeLists.txt to allow disable of MLIR import * fixed memory leaks and removed copies * updated for 5D memrefs * * formatting * * fixed review comments * * fixed merge issues * hip gcnDeviceName now includes specifiers at the end * use major/minor values instead * * disable MLIR by default * * removed command-line switch --enable-mlir * * fix unused when MLIR disabled * * enable jenkins enable/test MLIR * * format * * fixed clang-tidy * * added new type Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 27 Apr, 2021 2 commits
-
-
Paul Fultz II authored
* Add definitions for all pointwise operators * Formatting * Add cpp generator class * Formatting * Move compilation to core * Formatting * Add clock to tmp name * Add dynamic loader * Formatting * Add tests for code gen * Formatting * Add test for literals * Formatting * Use with_char * Add missing header * Fix mismerge * Ignore tidy warning * Fxx gcc 5 errors * Apply fixits * Skip signed bitwise of status * Remove unused parameters * Explicitly add c++14 flag * Fix tidy warning * Add tuple type to shape class * Formatting * Make data member private * Formatting * Add sub arguments * Formatting * Trun clang format off * Disable clang-format * Improve visiting tuples * Formatting * Add more argument tests * Formatting * Handle tuple in load * Formatting * Remove .o files * Add tuple type to api * Formatting * Fix tidy warnings * Fix tidy warnings * Add a test for share method * Formatting * Add a test cpp_type * Suppress tidy warning Co-authored-by:Shucai Xiao <Shucai.Xiao@amd.com>
-
Paul Fultz II authored
-
- 26 Apr, 2021 1 commit
-
-
turneram authored
* Add scan struct; add initial tests; initial algorithm by cases; refactor into one algorithm; clean up code * Rename; restructure; begin adding additional attributes * refactor to use shape_for_each; temporarily drop reverse mode * Add back reverse mode with shape_for_each_reverse; update tests; add axis bounds check * Begin adding to onnx parser * Add to onnx parser * Fix onnx test * Fix CI warnings * Update algorithm to use slice+par_for; update gen_onnx; remove .o files; remove redundant axis normalizing * Add exclusive mode * Add reverse mode * Remove .pyc file * Fix warning * Remove shape_for_each_reverse; clean up pointer usage for exclusive cases * Remove unused variable * Fix onnx test * Add test case to op_shape_test * Formatting * Formatting * Fix tidy warning * Formatting * Formatting * Formatting * Increase code coverage * Formatting * refine the script for creating the cumsum onnx file * Alphabetize includes for operators.hpp * Revise onnx test * Remove redundant bounds check * Formatting and style * Alphabetize tests * Remove duplicate tests from merge * Fix tidy warning for sub_test Co-authored-by:
Shucai Xiao <Shucai.Xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 23 Apr, 2021 2 commits
-
-
Shucai Xiao authored
* add support for axes inputs for sequeeze/unsqueeze/reduce_sum * clang format * fix build problems * backup code changes * clang format * fix a bug in parsing quantizelinear operator * clang format * fix a cppcheck error * disable different versions of unit tests for different onnx version * clang format * upgrade onnx to 1.8 * update onnx to 1.8.1 * disable two more real models * clang format * fix review comments * fix the function of assign axes in parsing the squeeze operator * add unit tests and fix a bug * clang format * fix review comments * clang format * fix a build error * backup code changes * clang format * add more unit tests and add parsing opset version * clang format * fix cppcheck error * adding installing the onnx package * resolve no protobuf compiler * fix cppcheck error * add unit tests for more code coverage * clang format * try a comment in jenkins build * include the install onnnx line * code backup * reorder the dependenciesd installed * refine dockerfile * fix review comments * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Shucai Xiao authored
* code backup * clang format * add a matcher related to the special resize case for optimization * clang format * code backup * clang format * code backup * remove unnecessary code * add optimization for the where op * clang format * fix cppcheck error * add a unit test for optimize resize * clang format * remove unnecessary header include * code backup * clang format * add unit tests for optimizing resize * clang format * add more unit test for optimizing where op * clang format * remove unnecessary code * add one more optimzation to remove contiguous * clang format * add a pointwise requirement * clang format * fix cppcheck error * add one more unit test * fixed a bug * clang format * remove unnecessary code * clang format * fix a build error * fix review comments * clang format * fix a review comments * clang format * code refinement * clang format * refine more code * refine more code * fix a bug related to reshape_cont optimization * clang format * fix a review comment * removed an unnecessary comment * refine code according to comments * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 22 Apr, 2021 1 commit
-
-
Paul Fultz II authored
* Add eliminate_data_type pass * Formatting * Auto convert quant ops * Formatting * Flip the order of decompose * Compute max size differently * Formatting * Clamp values in convert * Formatting * Fix loss of precision in reduce * Formatting * Fix bugs in reduction * Fix accumulator type in reference softmax implementation * Formatting * Update convert test * Remove unused variables * Remove unnecessary quant_dot check * Formatting * Add tests * Formatting * Remove unused code * Remove duplicate ops * Remove blaze dependency * Use set since shape::type_t is no hashable on gcc 5 * Formatting * Add dnnl binary op * Formatting * Add binary and eltwise * Formatting * Add softmax * Formatting * Remove unused operators * Add missing files * Formatting * Add lrn * Formatting * Add deconvolution * Formatting * Change allocate default * Add reorder * Formatting * Add reductions * Formatting * Sort lines * Change literals in another loop * Add pow operator * Formatting * Add pow operator * Formatting * Make sure shapes are packed * Allow broadcasted inputs * Remove unused operators * Simplify functions * Remove softmax * Add sub and erf functions * Formatting * Fix bug * Formatting * Improve parallism * Formatting * Allow multiple batch dimensions * Formatting * Move literal transforms out of lowering * Formatting * Add gather operator * Sort lines * Add early exit for carry * Formatting * Add missing concat * Rename macro * Fix deep nesting * Formatting * Fix cppcheck issues * Remov else * Move attribute to typedef * Formatting * Disable maybe-uninitialized warning since its broken on gcc * Add constexpr default constructor * Formatting * Fix compiler warnings * Fix adjust_allocation test * Add layernorm matcher * Add gelu_erf matcher * Formatting * Add gelu_tanh matcher * Formatting * Remove match namespace * Formatting * Use matcher instead of string * Formatting * Add fusions * Formatting * Add post op field * Formatting * Make post_ops serializable * Formatting * Add eltwise fusions * Formatting * Fix null conversions * Formatting * Add fuse_ops source files * Formatting * Set binary post op index correctly * Formatting * Fix serialization bugs * Check if used once * Formatting * Fix error in get_primitive_attr * Formatting * Add compile function * Formatting * Limit fusions * Formatting * Disable with env variable instead of using compile arg * Formatting * Fix implicit conversion to bool * Declar on seperate lines * Formatting * Fix cppcheck issues * Fix ICE in pack_join * Formatting * Use const ref * Make enum hashable * Formatting * Add explicit this * Fix merge issues * Fix dangling ref * Formatting * Add test for compile * Formatting * Add more value tests * Formatting Co-authored-by:
Shucai Xiao <shucai@gmail.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 21 Apr, 2021 1 commit
-
-
Paul authored
-
- 19 Apr, 2021 1 commit
-
-
Paul Fultz II authored
* Add definitions for all pointwise operators * Formatting * Add cpp generator class * Formatting * Move compilation to core * Formatting * Add clock to tmp name * Add dynamic loader * Formatting * Add tests for code gen * Formatting * Add test for literals * Formatting * Use with_char * Add missing header * Fix mismerge * Ignore tidy warning * Fxx gcc 5 errors * Apply fixits * Skip signed bitwise of status * Remove unused parameters * Explicitly add c++14 flag * Fix tidy warning * Remove .o files
-
- 13 Apr, 2021 1 commit
-
-
turneram authored
* Add version command and option to driver to print migraphx version being used * Address review comments * Address review comments * Fix leftover mistake * Add newline to version.h.in * Address review comments * Fix tidy warning * Remove unneccesary line from cmake file * Formatting; remove .o files Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 09 Apr, 2021 1 commit
-
-
Paul Fultz II authored
* Fix tidy warnings for 4.1 * Formatting * Upgrade to 4.1 in docker * Remove hcc build and enable ubsan on clang debug * Add missing openmp package * Construct directly * Construct directly * Upgrade rocm-cmake version
-
- 08 Apr, 2021 1 commit
-
-
Paul authored
-
- 07 Apr, 2021 1 commit
-
-
Shucai Xiao authored
* add a contiguous check before the reshape in parsing resize * clang format * add a unit test to the fix in parse_resize * clang format
-
- 05 Apr, 2021 2 commits
-
-
Paul Fultz II authored
* Add method to compute shape with same layout * Formatting * Fix permutation with ambiguous layouts * Formatting * Propagate layout for pointwise operators * Formatting * Propagate layout for more operators * Formatting * Sort with lens * Formatting * Simplify permutation sorting * Formatting * Propagate layout for concat operator * Formatting * Use copy * Formatting * Remove header Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Shucai Xiao authored
* code cleanup * clang format * backup code * clang format * remove unnecessary code * clang format * add module print function * code backup * refine the module::print function * refine the module:to_value() function * code backup * backup code changes * code backup * remove to_value and from_value function from the module class * rename a function * rename the if operator * refine the if operator * refine the print function of module and program * code backup * code backup * fix a build warning * fix overload of compute_shape function * code backup * fix unit test error * fix cppcheck error * fix the issue related to the overload of compute_shape * fix review comments * fix cppcheck error * change the return name of if_op to be if * clang format * fix two unit tests * clang format * rename variables * clang format * remove the unused compute_op function * clang format * add lowering of if operator and compute_op function * clang format * add parsing if operator in onnx file * clang format * fix clang tidy format * clang format * add the gpu implementation of the if operator * enhance the validate function and uncomment a unit test * clang format * remove unnecessary code * add sub_module processing in ref passes * clang format * clang format * fix a hang issue related to the valid function * fix an issue in replace_refs * clang format * fix review comments * clang format * fix cppcheck error * clang format * add a unit test for more code coverage * clang format * fix review comments and add test for more code coverage * clang format * fix cppcheck error * clang format * fix cppcheck error * fix a cppcheck error * clang format * backup code * clang format * fix cppcheck error * clang format * some code refinement * clang format * code backup to handle submodules in module compilation * clang format * code backup * clang format * code backup * clang format * fix a bug related to literal id * fix a bug in gpu execution * change the way of compiling a graph * clang format * backup more changes * clang format * refine pass log information * remove unnecessary code * clang format * temp changes backup * clang format * add module name prefix to scratch memory id in hip_memory_allocation * clang format * change to copy the cond input by inserting a copy instruction * clang format * change to use the if output argument as the submodule output so can remove a gpu_copy * clang format * consider submodule in some compile passes * clang format * fix review comments * clang format * fix issues related to scratch memory * clang format * remove unnecessary code * fix cppcheck error * clang format * reslove the implicit dependencies issue related to submodule * clang format * fix cppcheck error * clang format * backup temp changes * clang format * fixed an bug in the has_instruction function * clang format * fix the return value of the gpu implementation of the if operator * fix a bug in the compute_shape function in the gpu implementation * add an if onnx unit test * clang format * add more unit tests * clang format * tmp code backup * clang format * fix a sync problem related to copy cond argument from gpu to cpu * clang format * change the compile offload copy flag setting * clang format * enable copy from cpu to be able to do synchronous copy * clang format * add more unit tests * add more unit tests * add more ref unit tests * clang format * fixed a bug error * tmp code backup * clang format * fixed an onnx verify unit test * add more unit tests * clang format * reverse a change * fix cppcheck error * fix cppcheck error * fix to print all instructions in program execution * clang format * fix bugs related to memory coloring and offload copy to be true * clang format * remove unnecessary include header file * sort test cases in ref_cpu_ops alphabetically * clang format * add a flag to disable cpu target in verification test * change the way to disable some tests * clang format * disable verify unit test of the if operators * add a function call to have more code coverage * fix a build error * fix review comments * fix review comments * clang format * add a api gpu unit test for more code coverage * clang format * change to use instruction.size() as node index * move the calc_implicit_deps function to module class as a member function * clang format * move the offload_copy flag setting to lowering * clang format * assign the module_eval lambda function to a variable to simplify code * clang format * move the compute function from ref/gpu implementation to the main if operator * clang format * fix cpp check error * add a unit test for more code coverage * clang format * add unit test to calculate implicit deps * add a python unit test * clang format * refine a unit test to have more code coverage * clang format * chang the way of wrap up arguments for sub modules * clang format * fix some build errors * code cleanup * refine unit tests to have more code coverage * clang format * refine unit test to have more code coverage * code backup * clang format * add memory coloring test * refine memory coloring unit test * clang format * remove an unnecessary line * remove an unused line * remove an unnecessary parameter in the lambda function * clang format * refine a unit test * remove an unnecessary line * refine unit tests to have more code coverage * clang format * combine two lines * add one more unit test for more code coverage * clang format * add one more unit test * clang format * fix review comments * refine a print out information * fix review comments * clang format * change the sync copy to using a gpu device sync * clang format * remove unnecessary code Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 26 Mar, 2021 1 commit
-
-
Paul Fultz II authored
* Add code object op * Formattting * Add more value tests * Formatting * Fix from_value conversion from binary * Formatting * Dont use offload copy * Remove iostream header * Fix compilation errors * Formatting * Rename var * Add missing files * Formatting * Remove duplicate variable * Remove comment * Template the function so sfinae will work * Formatting * Use template specialization since ADL is broken on hcc * Formatting * Annotate the constructor with HD for hcc * Make variable const Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 25 Mar, 2021 1 commit
-
-
Paul Fultz II authored
* Add eliminate_data_type pass * Formatting * Auto convert quant ops * Formatting * Flip the order of decompose * Compute max size differently * Formatting * Clamp values in convert * Formatting * Fix loss of precision in reduce * Formatting * Fix bugs in reduction * Fix accumulator type in reference softmax implementation * Formatting * Update convert test * Remove unused variables * Remove unnecessary quant_dot check * Formatting * Add tests * Formatting * Remove unused code * Remove duplicate ops * Remove blaze dependency * Use set since shape::type_t is no hashable on gcc 5 * Formatting * Add dnnl binary op * Formatting * Add binary and eltwise * Formatting * Add softmax * Formatting * Remove unused operators * Add missing files * Formatting * Add lrn * Formatting * Add deconvolution * Formatting * Change allocate default * Add reorder * Formatting * Add reductions * Formatting * Sort lines * Change literals in another loop * Add pow operator * Formatting * Add pow operator * Formatting * Make sure shapes are packed * Allow broadcasted inputs * Remove unused operators * Simplify functions * Remove softmax * Add sub and erf functions * Formatting * Fix bug * Formatting * Improve parallism * Formatting * Allow multiple batch dimensions * Formatting * Move literal transforms out of lowering * Formatting * Add gather operator * Sort lines * Add early exit for carry * Formatting * Add missing concat * Rename macro * Fix deep nesting * Formatting * Fix cppcheck issues * Remov else * Move attribute to typedef * Formatting * Disable maybe-uninitialized warning since its broken on gcc * Add constexpr default constructor * Formatting * Fix compiler warnings * Fix adjust_allocation test * Add layernorm matcher * Add gelu_erf matcher * Formatting * Add gelu_tanh matcher * Formatting * Remove match namespace * Formatting * Use matcher instead of string * Formatting * Add fusions * Formatting * Make input a const ref * Make this explicit for gcc 5 Co-authored-by:
Shucai Xiao <shucai@gmail.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 18 Mar, 2021 1 commit
-
-
kahmed10 authored
* fix relu6 * add more transposes * add parse_tf calls * progress on multi_outputs * formatting * add multi output test * add comment and update migraphx.py * fix compile * formatting * update tools/api * formatting * fix function call * fix generate * simplify tests * formatting * rename tests * enclose braces * add more tests * update comments * rename file and add default param * formatting * fix tidy and change type * formatting older files Co-authored-by:Paul Fultz II <pfultz2@yahoo.com>
-