- 25 Jul, 2022 1 commit
-
-
varunsh authored
* Add is_supported to the target * Add get_target_assignments * Rename assignment to target_assignments * Add ref target header to test * Add fpga target * Make context const in compute
-
- 22 Jul, 2022 1 commit
-
-
Umang Yadav authored
C++ API is not printing thrown exception string. this improves on it.
-
- 21 Jul, 2022 1 commit
-
-
Charlie Lin authored
Dynamic shape handling in shape object
-
- 19 Jul, 2022 3 commits
-
-
Umang Yadav authored
Bug 1: create_literal was using back_inserter to copy vector with already allocated size, causing double the size of literal. Fix 1 : not use back_inserter Bug 2: Input param to model can be from operation that has multiple output, in that case name of the input param would contain : e.g. input_1:0 Fix 2: Look for : and take substring
-
Charlie Lin authored
Depends on #1199 Adds ONNX parser functionality for dynamic input shapes. Uses options parameter in parse_onnx()
-
Charlie Lin authored
Changes to operator includes: removed some includes that were not used included argument.hpp where clang-tidy wanted it
-
- 12 Jul, 2022 3 commits
-
-
Paul Fultz II authored
Reduce header inclusion in op headers
-
Paul Fultz II authored
This will ensure that migraphx.h can be included from a C compiler, and check that the C API can be called. This includes stdbool.h which is needed when using bool from C.
-
Paul Fultz II authored
-
- 11 Jul, 2022 4 commits
-
-
turneram authored
-
Ted Themistokleous authored
-
Ted Themistokleous authored
This reverts commit 4aeacc17.
-
Paul Fultz II authored
* Only run __syncthreads when there is data to preload * Improve loops * Add const attribute to improve optimizations
-
- 08 Jul, 2022 5 commits
-
-
Paul Fultz II authored
Show the number of operators and per operator avg time in summary... Summary: gpu::gemm: 8.738ms / 73 = 0.119699ms, 64% gpu::triadd_layernorm: 0.831381ms / 24 = 0.0346409ms, 7%
-
Ted Themistokleous authored
-
Paul Fultz II authored
Improve the assembly dump to track where certain instruction come from.
-
varunsh authored
Added is_supported and get_target_assignments methods to the target and program, respectively, to eventually support multi-target compilation and execution.
-
Charlie Lin authored
Initial sketch for changes to shape to handle dynamic dimensions
-
- 07 Jul, 2022 1 commit
-
-
Paul Fultz II authored
Instead of just unsqueezing to an axis of 1 a step can be set to use instead. So instead of unsqueezing {3, 12} to {3, 1, 12} a step of 2 will unsqeeze to {3, 2, 6} instead
-
- 06 Jul, 2022 1 commit
-
-
Paul Fultz II authored
*In the verification tests, check that saving and reloading the program is the same program. This also fixes serialization to always load instructions in the same order. There is also fixes for deconv and quant_conv which didn't save the solution id, and was broken for serialization.
-
- 05 Jul, 2022 8 commits
-
-
Paul Fultz II authored
* Add softmax kernel
-
Ted Themistokleous authored
Use this call to also skip converts when running a simplify_algebra pass over a program.
-
Ted Themistokleous authored
-
Ted Themistokleous authored
-
Paul Fultz II authored
This reorders the transposes across slice to improve horizontal fusion for contiguous. This also improves eliminate_contiguous to remove contiguous better across splits.
-
Ted Themistokleous authored
Allows us to not throw warnings instead of using [[maybe_unused]] flag instead.
-
Ted Themistokleous authored
Adds this to handle broadcasted values instead of just scalars
-
Ted Themistokleous authored
Used to avoid the case where 1e-12 is used and is matched as zero errornously resulting in removing the call with the incorrect value.
-
- 03 Jul, 2022 1 commit
-
-
Paul Fultz II authored
* Add mlir c api * Formatting * Create a type attribute * Formatting * Parse module * Formatting * Add mlir dump function * Add test case * Formatting * Fix tidy issues * Update mlit version * Update to newer mlir * Format * Move mlir to the gpu and update the test * Formatting * Fix bug when appending module * Format * Remove old cmake flag * Update message * Add return * Format * Add mlir_compile * Format * Register dialect * Handle unsinged integers * Dont provide output for return instruction * Format * Add code to insert memrefs * Format * Add mlir verification * Formatting * Enable pointwise_fusion * Disable eliminate_data_type * Set kernal name * Format * Fix device name * Formatting * Fix output arg * Format * Updates * Upate hash * Add fuse_mlir pass * Format * Add fuse mlir * Format * Update mlir * Sort parameter names * Format * Reenable disabled passes * Remove old mlir conv * Remove asym default padding * Add more verbose tracing * Format * Fix compilation errors * Format * Whitelist operators * Format * Add namespace * Format * Update triple * Format * Use func dialect * Format * Use func.return * Format * Upgrade mlir version * Add comment * Handle symetrical padding * Format * Cleanup debug output * Format * List failed tests * Move mlir compile to jit pipeline * Format * Update version * Add source locations * Format * Correctly add module * Format * Update failed tests * Fix failures when mlir is disabled * Format * Update mlir version * Check type for fp32 * Format * Remove failed test * Update mlir in driver * Tidy fixes * Foramt * Tidy fixes * Format * Fix const * Remove from requirements * Fix cmake version * Fix tidy warning * Use another ifdef * Fix tidy * Other tidy fix * Format * Update hash * Add missing license files * Format * Format * Fix fnction name
-
- 01 Jul, 2022 1 commit
-
-
Ted Themistokleous authored
-
- 30 Jun, 2022 10 commits
-
-
Ted Themistokleous authored
-
Ted Themistokleous authored
-
Ted Themistokleous authored
-
Ted Themistokleous authored
Throw an exception when this occurs to indicate our simpliciation passes resulted in a singularity somewhere. Related to #1236
-
Ted Themistokleous authored
Simplify addition zero multiplication and divide operations. Added approrpiate test cases with returns and replacing the instruction and operand to just return zero.
-
Ted Themistokleous authored
Using the unit/neg unit matchers to handle subtraction operations in the same steps. Added unit tests for both cases.
-
Ted Themistokleous authored
-
Ted Themistokleous authored
Part of changes that go wtih #1236. Reverts -1 divide operations to a simple negation of the parameter
-
Ted Themistokleous authored
Add handling for zero addition operations into the find_unit_ops() matcher functor.
-
Ted Themistokleous authored
Added test case and code to simplify zero additions between paremeters and literals during simplifications. In reference to issue #1236
-