- 11 Jul, 2022 4 commits
-
-
Ted Themistokleous authored
I may need additional checks for this, or to somehow find the matching division by zero, and cause a dangling reference so this gets flagged correctly at compile time. Current attempt inserts a divzero instruction that would later get picked up at the verify stage during compile. Not sure if this is correct incase we run into operator collisions down the road
-
Ted Themistokleous authored
This reverts commit fcc84214.
-
Ted Themistokleous authored
-
Ted Themistokleous authored
This reverts commit 4aeacc17.
-
- 08 Jul, 2022 2 commits
-
-
Ted Themistokleous authored
-
Ted Themistokleous authored
-
- 05 Jul, 2022 7 commits
-
-
Ted Themistokleous authored
Use this call to also skip converts when running a simplify_algebra pass over a program.
-
Ted Themistokleous authored
-
Ted Themistokleous authored
-
Ted Themistokleous authored
Making things more maintainable by splitting the unit tests that shared the baseline program for validation which resulted in an & case that was a bit more cumbersome to debug.
-
Ted Themistokleous authored
Allows us to not throw warnings instead of using [[maybe_unused]] flag instead.
-
Ted Themistokleous authored
Adds this to handle broadcasted values instead of just scalars
-
Ted Themistokleous authored
Used to avoid the case where 1e-12 is used and is matched as zero errornously resulting in removing the call with the incorrect value.
-
- 01 Jul, 2022 1 commit
-
-
Ted Themistokleous authored
-
- 30 Jun, 2022 17 commits
-
-
Ted Themistokleous authored
-
Ted Themistokleous authored
-
Ted Themistokleous authored
-
Ted Themistokleous authored
Throw an exception when this occurs to indicate our simpliciation passes resulted in a singularity somewhere. Related to #1236
-
Ted Themistokleous authored
Simplify addition zero multiplication and divide operations. Added approrpiate test cases with returns and replacing the instruction and operand to just return zero.
-
Ted Themistokleous authored
Using the unit/neg unit matchers to handle subtraction operations in the same steps. Added unit tests for both cases.
-
Ted Themistokleous authored
-
Ted Themistokleous authored
Part of changes that go wtih #1236. Reverts -1 divide operations to a simple negation of the parameter
-
Ted Themistokleous authored
Add handling for zero addition operations into the find_unit_ops() matcher functor.
-
Ted Themistokleous authored
Added test case and code to simplify zero additions between paremeters and literals during simplifications. In reference to issue #1236
-
Ted Themistokleous authored
Simplfies our code for all operations and reusing original unit tests for overalpping matcher.
-
Ted Themistokleous authored
Done to satisfy simplifications specified by #1236 . Just replace every parameter divided by 1 with itself. It's assumed that the eliminate_identity() pass will handle generated identity operators in our run_pass()
-
Ted Themistokleous authored
Save a multiply operation with that of a negation of input parameter x. Suggested improvement via #1236
-
Ted Themistokleous authored
-
Ted Themistokleous authored
Original use case of having a literal 1, instead of any other number in simplify_mul_add, resulted in the find_unit_mult_const function to optimize away the literal 1 causing this test to fail. on the final check. Switched the constant to a non zero & one value, and now correctly passes.
-
Ted Themistokleous authored
Commit for the day, work in progress as I'm failing one of our unit tests outside of the change
-
Paul Fultz II authored
This is an extension to insert_module_instructions, but instead of just inserting from a module, it can insert a range or a vector of instructions.
-
- 29 Jun, 2022 4 commits
-
-
Chris Austen authored
should be --fp16 , not --fp16ref
-
Charlie Lin authored
Allows PyTorch converted version of SSD-resnet34 to work
-
Paul Fultz II authored
Compiles significantly faster than constructing all the objects. It also reduces recompiles as well.
-
Umang Yadav authored
This PR only adds an example using MIOpen Calls.
-
- 28 Jun, 2022 2 commits
-
-
Umang Yadav authored
Add an example using rocBLAS Calls
-
Umang Yadav authored
This PR only adds an example using HIP kernel.
-
- 26 Jun, 2022 1 commit
-
-
Paul Fultz II authored
* Add function to get a module tree * Get parent module in the pass manager
-
- 25 Jun, 2022 2 commits
-
-
Brian Pickrell authored
One-line fix to register the op miopen_fusion. This error was causing loading of compiled model files (*.mxr) to fail.
-
Paul Fultz II authored
* Jit contiguous
-