- 20 Oct, 2023 2 commits
-
-
Umang Yadav authored
-
Umang Yadav authored
-
- 19 Oct, 2023 1 commit
-
-
Umang Yadav authored
-
- 18 Oct, 2023 1 commit
-
-
Umang Yadav authored
-
- 08 Aug, 2023 1 commit
-
-
Paul Fultz II authored
-
- 03 Aug, 2023 1 commit
-
-
Umang Yadav authored
* use implicit deps for the sorting * use BFS for sorting program
-
- 31 Jul, 2023 1 commit
-
-
Lakhinder Walia authored
* Use shape of Instruction (instead of a default) in add_return() * Instruction validation fix: not to use a default shape value for comparison * Fix instruction::replace() to recompute shape for "@return" * handle the case of missing shape in an Instruction related Test * use compute_shape() to get op shapes + test case for tuple_type * add test case shape_test/return_shape_tuple * Add test for @return to check for half type * Move @return unit-tests around..; Address review comments * Broken comparison fix: comparison to a (default) shape of tuple_type * Test cases: (add) return_shape_empty & (modify) return_shape_tuple * modify the assert() statement
-
- 24 Jul, 2023 1 commit
-
-
umangyadav authored
-
- 06 Jul, 2023 1 commit
-
-
Paul Fultz II authored
This is to help enable multi-target execution. We store a vector of targets and contexts. Currently this will only compile a single target, the PR #1672 is needed to enable multiple targets. This will also serialize the targets and contexts. When using the execution_environment or prog.get_context() it will always use the context from the first target assuming this is the "primary" target. Although, its unlikely a user would use execution_environment with a multi-target environment.
-
- 02 Jul, 2023 1 commit
-
-
Umang Yadav authored
-
- 30 May, 2023 1 commit
-
-
Paul Fultz II authored
Use generate_argument instead of generate_literal for python output as generate_literal doesnt exists Shorten the names for variables from the main module Use prefix p_ for parameters Use shorter variable m for main module in python
-
- 06 Apr, 2023 1 commit
-
-
Paul Fultz II authored
Automatically fuse multiple reductions and pointwise operations.
-
- 21 Mar, 2023 1 commit
-
-
Charlie Lin authored
Refactor to have select_module use output parameters Disable select_module verify tests on cpu
-
- 26 Jan, 2023 1 commit
-
-
Paul Fultz II authored
* Create the shape correctly in python printout
-
- 14 Dec, 2022 1 commit
-
-
Paul Fultz II authored
* Print python code
-
- 27 Oct, 2022 1 commit
-
-
Chris Austen authored
Upgraded Dockerfiles and fixed tidy issues to make Ubuntu 20.04 and ROCm 5.3.0 the default
-
- 29 Sep, 2022 1 commit
-
-
Paul Fultz II authored
* Fix invalid program from find_splits
-
- 06 Sep, 2022 1 commit
-
-
Paul Fultz II authored
Using not and or improves readability. The cppcheck rule will help ensure we are doing it consistently.
-
- 19 Aug, 2022 1 commit
-
-
kahmed10 authored
* fix json strings in driver models
-
- 06 Jul, 2022 1 commit
-
-
Paul Fultz II authored
*In the verification tests, check that saving and reloading the program is the same program. This also fixes serialization to always load instructions in the same order. There is also fixes for deconv and quant_conv which didn't save the solution id, and was broken for serialization.
-
- 03 Jul, 2022 1 commit
-
-
Paul Fultz II authored
* Add mlir c api * Formatting * Create a type attribute * Formatting * Parse module * Formatting * Add mlir dump function * Add test case * Formatting * Fix tidy issues * Update mlit version * Update to newer mlir * Format * Move mlir to the gpu and update the test * Formatting * Fix bug when appending module * Format * Remove old cmake flag * Update message * Add return * Format * Add mlir_compile * Format * Register dialect * Handle unsinged integers * Dont provide output for return instruction * Format * Add code to insert memrefs * Format * Add mlir verification * Formatting * Enable pointwise_fusion * Disable eliminate_data_type * Set kernal name * Format * Fix device name * Formatting * Fix output arg * Format * Updates * Upate hash * Add fuse_mlir pass * Format * Add fuse mlir * Format * Update mlir * Sort parameter names * Format * Reenable disabled passes * Remove old mlir conv * Remove asym default padding * Add more verbose tracing * Format * Fix compilation errors * Format * Whitelist operators * Format * Add namespace * Format * Update triple * Format * Use func dialect * Format * Use func.return * Format * Upgrade mlir version * Add comment * Handle symetrical padding * Format * Cleanup debug output * Format * List failed tests * Move mlir compile to jit pipeline * Format * Update version * Add source locations * Format * Correctly add module * Format * Update failed tests * Fix failures when mlir is disabled * Format * Update mlir version * Check type for fp32 * Format * Remove failed test * Update mlir in driver * Tidy fixes * Foramt * Tidy fixes * Format * Fix const * Remove from requirements * Fix cmake version * Fix tidy warning * Use another ifdef * Fix tidy * Other tidy fix * Format * Update hash * Add missing license files * Format * Format * Fix fnction name
-
- 30 Jun, 2022 1 commit
-
-
Paul Fultz II authored
This is an extension to insert_module_instructions, but instead of just inserting from a module, it can insert a range or a vector of instructions.
-
- 29 Jun, 2022 1 commit
-
-
Paul Fultz II authored
Compiles significantly faster than constructing all the objects. It also reduces recompiles as well.
-
- 26 Jun, 2022 1 commit
-
-
Paul Fultz II authored
* Add function to get a module tree * Get parent module in the pass manager
-
- 22 Jun, 2022 1 commit
-
-
Ted Themistokleous authored
Updated each source file in the repo with the existing license.
-
- 16 Jun, 2022 1 commit
-
-
Charlie Lin authored
* Use custom distance function * Pass module, skip order check if other module * Change other valid() * Remove unnecessary declaration * test multiple module dependency * Refactor to make more clear * Code cleanup * Simplify fix * Test EXPECT Co-authored-by:Paul Fultz II <pfultz2@yahoo.com>
-
- 10 Jun, 2022 1 commit
-
-
Paul Fultz II authored
Consolidate the vectorize and preload Add vectorization to reduction Co-authored-by:kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 26 May, 2022 1 commit
-
-
Paul Fultz II authored
* Upgrade to cppcheck 2.8
-
- 11 Mar, 2022 1 commit
-
-
Shucai Xiao authored
The module::debug_print(ins) is very slow, which makes the trave_eval==1/2 very slow. The reason is printing an ins involves search the whole module to get the instruction, the print it. This change is to fix that by calling module::print() to get names of all instructions of a program, then print the instruction by getting its name from a hash map.
-
- 09 Dec, 2021 1 commit
-
-
Paul Fultz II authored
Fuse last instruction in fuse_pointwise This is also fixes a bug with using an invalid iterator.
-
- 19 Oct, 2021 1 commit
-
-
Paul Fultz II authored
Adds a pass to fuse pointwise operators into one "pointwsie" op that has a submodule which does the calculation.
-
- 21 Sep, 2021 1 commit
-
-
Paul Fultz II authored
Needed to bypass passes when fusing pointwise operators into a module.
-
- 16 Sep, 2021 1 commit
-
-
Shucai Xiao authored
Add Loop operator for opset version 13. Notes: 1) Default max iteration number is 10 if no max iteration number is provided 2) To change the max iter number, a user can set the max_loop_iterations in the onnx_option struct when parsing a model. 3) The returned shape of the scan output is from the max_loop_iterations even the actual loop num is less than that. This issue also applies to other operators like NonZero and NonMaxSuppression. A issue #948 is created to track this and to be resolved later. Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 25 Aug, 2021 1 commit
-
-
Shucai Xiao authored
* always keep parameters * clang format * fix tidy error * clang format * add more unit tests to have more code coverage * fixed a bug to ensure get_parameter_names to return ordered parameter names * clang format * remove unnecessary print out * refine a code change * clang format * add a unit test to check parameter is not removed by dead code elimination * clang format * rename a function name Co-authored-by:Chris Austen <causten@users.noreply.github.com>
-
- 08 Jul, 2021 1 commit
-
-
Paul Fultz II authored
* Add preallocate method * Add preallocate_param pass * Preallocate buffers on the cpu * Formatting * Preallocate on the gpu * Add missing cpp file * Formatting * Add lifetime function * Formatting * Always allocate * Fix tidy warning * Add const * Add missing lifetime annotations Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 27 Jun, 2021 1 commit
-
-
Shucai Xiao authored
* Add definitions for all pointwise operators * Formatting * Add cpp generator class * Formatting * Move compilation to core * Formatting * Add clock to tmp name * Add dynamic loader * Formatting * Add tests for code gen * Formatting * Add test for literals * Formatting * Use with_char * Add missing header * Fix mismerge * Ignore tidy warning * Fxx gcc 5 errors * Apply fixits * Skip signed bitwise of status * Remove unused parameters * Explicitly add c++14 flag * Fix tidy warning * unify the compute function signature * clang format * make another change * unify the compute function * clang format * remove unnecessary code * more refinement about the operator compute funciton * clang format * add an overload function * clang format * add support for axes inputs for sequeeze/unsqueeze/reduce_sum * clang format * fix build problems * backup code changes * clang format * Add tuple type to shape class * Formatting * fix a bug in parsing quantizelinear operator * clang format * fix a cppcheck error * disable different versions of unit tests for different onnx version * clang format * upgrade onnx to 1.8 * update onnx to 1.8.1 * disable two more real models * clang format * Make data member private * Formatting * Add sub arguments * Formatting * Trun clang format off * Disable clang-format * fix review comments * fix the function of assign axes in parsing the squeeze operator * add unit tests and fix a bug * clang format * fix review comments * clang format * fix a build error * backup code changes * clang format * add more unit tests and add parsing opset version * clang format * Improve visiting tuples * Formatting * fix cppcheck error * adding installing the onnx package * resolve no protobuf compiler * add an inline subgraph pass * clang format * Add more argument tests * Formatting * Handle tuple in load * Formatting * code backup * clang format * Remove .o files * Add tuple type to api * Formatting * fix build errors * clang format * code backup * code backup * add unit tests for the inline subgraph * clang format * refine the inline subgraph and parse if operator * clang format * fix cppcheck issue * clang format * add unit test for inline subgraph pass * clang format * fix format issue * remove the context from the if operator * clang format * simplify the compute functions * Fix tidy warnings * fix cppcheck error * clang format * fix cppcheck error * Fix tidy warnings * fix a cppcheck error * clang format * Add a test for share method * Formatting * Add a test cpp_type * add unit tests for more code coverage * clang format * add unit tests to have more code coverage * clang format * try a comment in jenkins build * include the install onnnx line * code backup * reorder the dependenciesd installed * refine dockerfile * fix review comments * clang format * remove unnecessary overload function * fix cppcheck error * change back the argument test * Suppress tidy warning * add the operator get_tuple_elem * clang format * add get_tuple_elem to operator include file * chang if to support multiple operation outputs * clang format * optimize inline subgraph * clang format * code backup * clang format * fix bug * refine unit tests for tuple output of the if operator * clang format * refine a instruction replacement code * add a unit test and sort all the unit tests alphabetically * fix cppcheck error * add more unit tests for multiple op outputs * clang format * fix cppcheck error * Update pass manager to get modules after every pass * more unit test to cover more scenarios * clang format * fixed a bug in a unit test * add more tests * clang format * add more unit tests to have more code coverage * fix a bug in a unit test * Add program overload for module * Formatting * Hash modules for quicker lookup of modules * Bump file version * Add methods to remove modules * Formatting * add the tuple type to the support list * Eliminate unused modules * Formatting * Fix test errors * Foramtting * Fix tidy issues * fix problem related to inline subgraph * clang format * fix review comments * fix review comments * fix review comments * fix review comments * clang format * fix a unit test * one more code change * remove an optimization related to the if operator * clang format * fix review comments Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 09 Jun, 2021 1 commit
-
-
Paul Fultz II authored
* Enable libstdc++ debug mode * Add is_end function * Compare addresses in a map or set * Formatting * Check end * Fix comparision of instruction_ref * Formatting * Some more iterator fixes * Formatting * Fix assert * Fix invalid iterators * Fix debug print in program * Remove debug flag for now * Set correct bool type Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 24 May, 2021 1 commit
-
-
Paul Fultz II authored
* rename merge_from to merge_to * refine comments * code backup * clang format * The first version that can reduce scratch memory usage * code backup * clang format * code backup * clang format * fixed a bug related to removing gemm copy * clang format * code backup * clang format * fix review comments * clang format * fix unit test failure * code backup * clang format * code base for further investigation * code with both the forward and backward approach to compute the conflict table * clang format * clang format * backup changes * remove unnecessary file * remove unnecessary code * code backup * clang format * code backup * clang format' * fix a bug in the code * clang format * code backup * clang format * remove unused code * remove unused code * rename some functions * remove print code * code backup * add dominator to scheduling * add dominator algorithm to remove unnecessary conflicts * Remove comment * Use erase_if instead * Formatting * Code clean up: * Formatting * Add dominator info class * Formatting * Add dom_info * Formatting * Add test case and fix some bugs * Formatting * Add unit test for scheduler * Formatting * Use index map instead of distance * Formatting * Add memory coloring test * Check for conflict in memory coloring * Formatting * Use 1 stream by default * Update to use modules * Formatting * Skip live on entry check * Formatting * Formatting * Fix tidy warning * Fix tidy warning * Formatting * Add nolint * Use C++17 to build everything when using clang * Remove input names * Formatting * Remove input names * Keep order of params * Formatting Co-authored-by:
Shucai Xiao <Shucai.Xiao@amd.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 07 May, 2021 1 commit
-
-
Paul Fultz II authored
* Update pass manager to get modules after every pass * Add program overload for module * Formatting * Hash modules for quicker lookup of modules * Bump file version * Add methods to remove modules * Formatting * Eliminate unused modules * Formatting * Fix test errors * Foramtting * Fix tidy issues
-
- 03 May, 2021 1 commit
-
-
Paul Fultz II authored
* Hash instruction in module for faster checking if instruction is in module * Formatting * Fix cppcheck error Co-authored-by:
Shucai Xiao <shucai@gmail.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-