Commits · f7d987baa34e30423cafb6edd5ca32eee5f97841 · gaoqiong / MIGraphX

04 Oct, 2022 1 commit
- Stream sync Changset (#1358) · f7d987ba
  Ted Themistokleous authored Oct 04, 2022
```
Stream sync changes and associated API level changes
```
  f7d987ba
03 Oct, 2022 1 commit

Add output_alias and runs_on_offload_target flags for the custom ops (#1309) · c9ffb38d

Umang Yadav authored Oct 03, 2022

Adds two methods for the custom_ops virtual class.

bool runs_on_offload_target(), if the custom op runs directly on the gpu then it should be set to true. in this case, custom op expects its parameters to reside in GPU memory and writes output to the GPU memory. If it is set to false then, custom op expects it's parameter to reside on the host and puts back the result into the host memory.

output_alias, if output of the custom op is aliasing the input buffer. i.e. interpreting the same input buffer with differnet shape and strides.

Update as_vector() in C++ API to handle non-standard shapes. It required exposing element_index to space_index conversion method for the shape class.

c9ffb38d

06 Sep, 2022 1 commit
- Enable cppcheck rule for 'not', 'or' keywords (#1361) · d37a4df9
  Paul Fultz II authored Sep 06, 2022
```
Using not and or improves readability. The cppcheck rule will help ensure we are doing it consistently.
```
  d37a4df9
21 Aug, 2022 1 commit

Update is_supported (#1334) · 79e15ca9

varunsh authored Aug 21, 2022

* Update is_supported
* Return object from is_supported
* Return by reference in interator

79e15ca9

09 Aug, 2022 1 commit

Allow license_stamper.py to be ran from any directory (#1332) · 5bf4dee6

Paul Fultz II authored Aug 09, 2022



* Allow license_stamper.py to be ran from any directory

* Format
Co-authored-by: kahmed10 <15948690+kahmed10@users.noreply.github.com>

5bf4dee6

04 Aug, 2022 1 commit

Dynamic ref convolution op (#1224) · 67f77ac1

Charlie Lin authored Aug 04, 2022



* Dynamic shape handling in shape object

* rewrite empty lens multibroadcast test

* Shape class changes to handle dynamic
* More throw errors for functions that don't make sense for dynamic shape
* Print output changes
* Serialization changes

* Fixing serialization errors

* Remove const on dyn_dim copy getters

* Dynamic shape tests

* Fix serialize errors

* Add dyn_data struct to avoid ambiguous constructor

* Tidy fix: emplace_back() over for loop

* Tidy fix: use move

* Use std::initializer_list in constructor
Reverts the dyn_data struct change
Should get around the ambiguous braced initialization list error

* avoid typedef

* element_space, min,max,opt _lens change

* formatting

* Comments fix

* dynamic bytes() test

* Seralize and reflect changes

* formatting

* Test the dynamic lens functions

* progress

* Formatting

* Dynamic conv draft progress

* Add operator<< tests for coverage

* Coverage update

* Add to conv dynamic batch test

* Dynamic image size test

* Dynamic weight handling

* Dyn image shape test change, fix dyn weight cond

* Comment update

* Dynamic weights shape test and fix

* Use ternary operator

* Tidy fixes

* Handle dynamic graph input shapes in ONNX parser

* Formatting

* Handle dynamic shape for convolution

* formatting

* cppcheck fixes

* Add onnx test files

* Fix typo

* Disable auto_pad for dynamic input shape

* check_shapes object checks for allowing dynamic shapes

* Fix any_of

* Change to maintain const objectness

* Formatting

* Check shapes allow dynamic

* Refactor compute_shape() call into op.compute()
Allows for per operator differences with handling dynamic shape
Fix operation.hpp change to use the generator

* Comment fix

* Refactor normalize_attributes() calls to use max_lens()

* Comment addition

* Update other normalize_attributes() calls

* Change to using constructor and add tests

* Use const member function

* Add more dynamic shape support

* Add tests for error code coverage

* Fix opt shape bug and add shape tests

* capture all by ref

* Fix typo with img shape calculation

* Add more tests

* dynamic auto pad attempt
Linker error with pad_calc.cpp

* Fix parse dyn auto_pad
Should only need to use dynamic auto pad when the image shape or kernel
shape are dynamic. For a dynamic batch size, the auto pad calculation is
the same.

* Fix linking error

* Fix auto_pad bug
Fixed input tensor with auto_pad setting on

* auto_pad onnx tests

* Fix auto_pad calculation, evaluate in ref_conv
add ref_ops tests

* Add shape tests, fix bugs

* Refactor first two output dynamic len calculation

* Conv MLIR test update

* i64 MLIR test fix

* Fix MLIR test typo
Co-authored-by: Chris Austen <causten@users.noreply.github.com>

67f77ac1

30 Jul, 2022 1 commit

Add accuracy checker tool (#1315) · 16cb8377

kahmed10 authored Jul 30, 2022

Added an Accuracy checker to the tools directory.  Currently compares ONNX FP32 models against ORT CPUEP

16cb8377

22 Jul, 2022 1 commit
- Improve error reporting in the API (#1274) · c722117d
  Umang Yadav authored Jul 22, 2022
```
C++ API is not printing thrown exception string. this improves on it.
```
  c722117d
12 Jul, 2022 1 commit

Add tests for C API (#1266) · a7a32a9e

Paul Fultz II authored Jul 12, 2022

This will ensure that migraphx.h can be included from a C compiler, and check that the C API can be called. This includes stdbool.h which is needed when using bool from C.

a7a32a9e

08 Jul, 2022 1 commit

Add is_supported and get_target_assignments (#1269) · 8192f37f

varunsh authored Jul 07, 2022

Added is_supported and get_target_assignments methods to the target and program, respectively, to eventually support multi-target compilation and execution.

8192f37f

24 Jun, 2022 2 commits

Adding in check_stamped.py to tools/ (#1255) · 8c35fa94

Ted Themistokleous authored Jun 24, 2022

Used to determine what files contain a license and are stamped. If not we exit and return an error code that can be later ingested by another script, as well as a list of the outstanding files in questions.

Currently baked in the list of files we should support or not support with licenses in them a well as some stuff to quickly ignore

8c35fa94

Add compute_method for the experimental custom op (#1194) · edc7be5c

Umang Yadav authored Jun 24, 2022

Adds compute_method for the experimental custom ops.
Adds a test for the same using HIP APIs.
Depends on #1183
Solves #1101

edc7be5c

23 Jun, 2022 1 commit
- Fix code block issue with .ipynb files. (#1263) · e95b875f
  Ted Themistokleous authored Jun 22, 2022
```
Regenerate notebook header for licensing
```
  e95b875f
22 Jun, 2022 1 commit
- Update license files (#1248) · e44cecbc
  Ted Themistokleous authored Jun 22, 2022
```
Updated each source file in the repo with the existing license.
```
  e44cecbc
17 Jun, 2022 1 commit

Create allocate op and replace_allocate pass (#1183) · add6fb3b

kahmed10 authored Jun 17, 2022



* add allocate op header

* formatting

* add replace_allocate pass

* formatting

* move output param to remove_allocate pass

* formatting

* fix bugs in replace_allocate pass

* formatting

* fix verify if tests

* formatting

* move if op logic

* formatting

* cleanup lowering

* cleanup lowering

* formatting

* fix tidy

* formatting

* fix tidy

* add cpu allocate check

* formatting

* change cpu allocate in pass

* formatting

* add some tests for replace_allocate pass

* formatting

* pass by ref

* fix run_pass

* formatting

* update variable name for module

* update dce to use contains() and fix tidy

* formatting

* update cppcheck

* add if test

* formatting

* add if test

* rename var to mod_output_names

* formatting

* remove conditional

* update allocate op and tests

* formatting

* update replace_allocate tests

* update create_output_names() and conditional in replace_allocate

* formatting

* remove extra variable in replace_allocate

* update tools script for allocation_model
Co-authored-by: Umang Yadav <29876643+umangyadav@users.noreply.github.com>
Co-authored-by: Chris Austen <causten@users.noreply.github.com>
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>

add6fb3b

01 Jun, 2022 1 commit
- Update protobuf version (#1228) · 0cc6304d
  kahmed10 authored Jun 01, 2022
```
update protobuf version
```
  0cc6304d
13 May, 2022 1 commit

Update install_prereqs.sh for individual use (#1197) · 8c94ad07

Chris Austen authored May 13, 2022

Our documentation indicates a user with sudo can run the install_prereqs.sh file. Turns out that the file is not complete enough to run on Ubuntu 18.04/20.04 independently. I updated the file to resolve the failures.

resolves #1191

8c94ad07

28 Mar, 2022 1 commit

Use ifdef instead of comment for the auto-generated method declarations for... · 8e4d622f

Paul Fultz II authored Mar 28, 2022

Use ifdef instead of comment for the auto-generated method declarations for type erased classes (#1138)

It seems the formatting of comments are unreadable for larger methods, so instead just generate a struct with the methods in the interface and add a comment if its optional. It wraps this in #ifdef TYPE_ERASED_DECLARATION(assuming this would never be defined) instead of #if 0, so most editors can still provide syntax highlighting(although I think vscode with clangd will still gray it out unfortunately).

8e4d622f

24 Mar, 2022 1 commit
- Add initial experimental custom op (#1109) · 251cdd74
  Paul Fultz II authored Mar 24, 2022
```
This creates a custom op which has name() and compute_shape() methods. 
```
  251cdd74
15 Mar, 2022 1 commit

Expose APIs for the MIGraphX program (#1093) · 64e79a94

Umang Yadav authored Mar 15, 2022

API includes following
create_module,
get_main_module
add_instruction without module args
add_instruction with module args
add_parameter
add_return

64e79a94

09 Mar, 2022 1 commit
- Expose context in C++ API (#1118) · 0e6bd17c
  kahmed10 authored Mar 09, 2022
```
Add a callable C++ API to migraphx
```
  0e6bd17c
02 Mar, 2022 1 commit
- Clang format ver10 (#1106) · 9852aaef
  bpickrel authored Mar 02, 2022
```
Update the base version of clang-format from 5.0 to 10.0
```
  9852aaef
25 Feb, 2022 1 commit
- Add get_queue to context to get the current stream (#1097) · e5242676
  Paul Fultz II authored Feb 24, 2022
```
wrapped in a any_ptr class so the type can be checked at runtime for a mismatch.
```
  e5242676
16 Feb, 2022 1 commit
- Add assign_to method for C++ API (#1075) · ecb1545c
  kahmed10 authored Feb 16, 2022
  
  ecb1545c
01 Feb, 2022 1 commit
- Add python type annotations to api.py (#1061) · 2a79a9ff
  Paul Fultz II authored Feb 01, 2022
```
This will also check the types using mypy on the CI.
```
  2a79a9ff
07 Dec, 2021 1 commit

Test runner match input output using tensor names (#996) · 0f9b4072

Shucai Xiao authored Dec 07, 2021

1. Previous implementation assumes inputs and outputs .pb files are ordered, but it is not the case. So, we should use the name of the tensors in the input/output .pb files to match the input and output in the onnx model. (This change applies to the BERT_Squad model)
2. When parsing a model with dynamic input shape, current implementation uses the default batch_size for the unknown dims, which can cause parsing error for some cases (e.g. mask_rcnn model). The solution is we first read an input to get the shape, then use these shapes to parse the onnx model.

0f9b4072

05 Dec, 2021 1 commit
- Change in documentation for roctx knob (#984) · 26d90328
  Cagri authored Dec 05, 2021
```
Adds description for roctx knob of migraphx-driver in documentation.
```
  26d90328
22 Nov, 2021 1 commit

Helper script for rocTX run and parse (#985) · 4f9a0ce7

Cagri authored Nov 22, 2021

This provides a helper script to run rocTX markers with migraphx-driver and reduces the number of steps a user would go through running rocTX knob.
Run:
python roctx.py --run --onnx_file <ONNX_FILE> --migraphx_args "--onnx --gpu --fp16 --batch 16" --out outputfolder
Runs and parses the run output (JSON file). An example output is given below:

SUM MIN MAX
Marker start: gpu::convolution 5272 10 563
Marker start: gpu::add_relu 605 12 18
Marker start: gpu::gather 299 145 154
Marker start: gpu::mul_add 227 14 57
Marker start: gpu::sub 177 13 42
Marker start: gpu::concat 169 22 31
Marker start: gpu::triadd_relu 163 15 18
Marker start: load 141 0 3
Marker start: hip::hip_copy_literal 111 0 3
Marker start: gpu::add 58 13 17
Marker start: broadcast 52 0 3
Marker start: gpu::convert 31 15 16
Marker start: slice 11 0 1
Marker start: gpu::pooling 9 9 9
Marker start: step 2 2 2
Marker start: @param 2 0 1
Marker start: reshape 1 0 1
Marker start: hip::hip_allocate_memory 1 1 1
Marker start: check_context::migraphx::version_... 0 ERR ERR

TOTAL TIME: 7331 us

JSON FILE PATH: [...]/rpl_data_211019_195229_9369/input_results_211019_195229/trace.json
Parse:
python roctx.py --parse --json_path <JSON PATH FROM RUN>
Note: The parse knob is made available if the user wants to parse an already existing JSON output.

4f9a0ce7

17 Nov, 2021 1 commit

Handle removing contiguous on operators that use modules (#1005) · 785307c3

Paul Fultz II authored Nov 17, 2021

Currently, eliminate_contiguous will never remove contiguous for operators that use module inputs due to the fact that it doesn't pass the module inputs to compute_shape.

- Update to pass the module inputs correctly to compute_shape
- Fix the overloads of compute_shape so that when passed an empty vector of module inputs it will call the overload without module inputs
- Add tests with contiguous and pointwise module function.
- Move add_pointwise function to a seperate header to reuse across different tests

785307c3

11 Nov, 2021 1 commit

Conditionally enable pointwise fusion (#992) · 157935ff

Paul Fultz II authored Nov 10, 2021

This enables the pointwise fusions using the MIGRAPHX_ENABLE_POINTWISE_FUSION env variable. Its disabled by default since MIOpen fusions need to be refactored.

This also adds a compile_ops pass to compile the pointwise modules. All tests except test_gpu_fast_math passes with MIGRAPHX_ENABLE_POINTWISE_FUSION=1 set.

157935ff

18 Oct, 2021 1 commit

Allow constructing an operation with a format string (#976) · 77164f3c

Paul Fultz II authored Oct 18, 2021

Designed to allow a user to format the values needed for the json_string: migraphx::operation("reduce_mean", "{axes : [%i, %i, %i, %i]}", axes[0], axes[1], axes[2], axes[3]) instead of needing to use string concat or stringstream

77164f3c

15 Oct, 2021 1 commit

Enabling rocTX markers for migraphx-driver via roctx knob (#946) · 4a71ec8c

Cagri authored Oct 14, 2021



Added features:
This enables wrapping each migraphx operator with rocTX markers.
It adds new knob trace to migraphx-driver binary.

Limitation:

rocTX standalone does not output a file, it needs to be used with rocprof. Example command line:

/opt/rocm/bin/rocprof -i ./in.txt --hip-trace --roctx-trace --flush-rate 10ms --timestamp on -d cagri_out --obj-tracking on /opt/rocm/bin/migraphx-driver trace ./resnet50-v2-7.onnx --onnx --gpu
Co-authored-by: Shucai Xiao <shucai@gmail.com>

4a71ec8c

23 Sep, 2021 1 commit
- Make `compile_options` an opaque object for ABI compatibility (#955) · 95431eb7
  Umang Yadav authored Sep 23, 2021
```
Add forward compatibility support for compile options 
```
  95431eb7
17 Sep, 2021 1 commit

Make `file_options` an opaque object for ABI compatibility (#953) · 31dc067e

Umang Yadav authored Sep 17, 2021



make file_options struct an opaque object for ABI compatibility, uses make generate to auto-generate and includes  modified tests.
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>

31dc067e

16 Sep, 2021 1 commit

Loop operator (#853) · a275f590

Shucai Xiao authored Sep 16, 2021

Add Loop operator for opset version 13.
Notes: 1) Default max iteration number is 10 if no max iteration number is provided
2) To change the max iter number, a user can set the max_loop_iterations in the onnx_option struct when parsing a model.
3) The returned shape of the scan output is from the max_loop_iterations even the actual loop num is less than that. This issue also applies to other operators like NonZero and NonMaxSuppression. A issue #948 is created to track this and to be resolved later.
Co-authored-by: Paul <pfultz2@yahoo.com>
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

a275f590

07 Sep, 2021 1 commit
- Allow creating modules in a module pass (#931) · ac0f79aa
  Paul Fultz II authored Sep 07, 2021
```
* Add module pass manage
```
  ac0f79aa
24 Aug, 2021 1 commit
- Check the dockerfile installed the dependencies correctly (#920) · 6be85674
  Paul Fultz II authored Aug 24, 2021
  
  6be85674
05 Aug, 2021 1 commit

Test runner (#854) · 30966f6b

Shucai Xiao authored Aug 05, 2021



* add python test runner

* fix review comments

* move test runner to the tools folder

* raise an error if some cases failed

* clang format

* fix review comments
Co-authored-by: Paul Fultz II <pfultz2@yahoo.com>
Co-authored-by: Chris Austen <causten@users.noreply.github.com>

30966f6b

30 Jul, 2021 1 commit
- Fix pyflakes issue in tools (#899) · a110207a
  Paul Fultz II authored Jul 30, 2021
  
  a110207a
08 Jul, 2021 1 commit

Preallocate parameters on the CPU and unify preallocations (#840) · 427fc25c

Paul Fultz II authored Jul 08, 2021



* Add preallocate method

* Add preallocate_param pass

* Preallocate buffers on the cpu

* Formatting

* Preallocate on the gpu

* Add missing cpp file

* Formatting

* Add lifetime function

* Formatting

* Always allocate

* Fix tidy warning

* Add const

* Add missing lifetime annotations
Co-authored-by: mvermeulen <5479696+mvermeulen@users.noreply.github.com>

427fc25c