Commits · c2bafa5dba861ea170926b775969e79c632ef81b · gaoqiong / MIGraphX

28 Sep, 2023 1 commit

Add options to set tolerances inside MIGraphX driver (#2213) · 69d8d789

Umang Yadav authored Sep 28, 2023

MIGraphX verification by default uses normalized RMS error as the basis for the verification.  This change adds some logic to allow migraphx to do "np.allclose" type of elementwise verification using atol and rtol.

Commit also includes changes to consistently pass "gold" or "expected" results as the second argument for "verify_range()" calls.  Default RMS tolerance inside driver is set to 0.001 which IMO is high for FP32 compared to what we had earlier. Need better defaults

69d8d789

14 Sep, 2023 1 commit
- Print warning about miopen_fusion while generating mxr (#2082) · 752f13cf
  Umang Yadav authored Sep 14, 2023
```
MIOpen fusions are not serialized with tuned solutions. Print warnings for such cases.
```
  752f13cf
16 Jul, 2023 1 commit
- add verify namespace (#1952) · 68a9a23f
  Umang Yadav authored Jul 16, 2023
  
  68a9a23f
22 Jun, 2022 1 commit
- Update license files (#1248) · e44cecbc
  Ted Themistokleous authored Jun 22, 2022
```
Updated each source file in the repo with the existing license.
```
  e44cecbc
04 Nov, 2020 1 commit

Split cpu and reference implementation (#671) · 500d9441

Paul Fultz II authored Nov 04, 2020



* Add all_targets cmake target

* Rename target

* Add ref target

* Rename tests

* Refactor compiler target

* Formatting

* Verify for every target

* Formatting

* Add verify test suite

* Formatting

* Add initial test programs

* Formatting

* Add rnn tests

* Formatting

* Validate gpu

* Formatting

* Remove old gpu tests

* Fix gpu tests

* Fix ref error

* Fix tidy issues

* Formatting

* Tidy fixes

* Fix header in python api

* Rename to ref

* Use ref in verify_onnx

* Fix tidy issue

* Build with verbose on

* Fix typo

* Remove verbose

* rename some cpu prefix to ref
Co-authored-by: Shucai Xiao <Shucai.Xiao@amd.com>

500d9441

25 Aug, 2020 1 commit

Improve layernorm performance (#613) · 56b3bf58

Paul Fultz II authored Aug 25, 2020

* Use increment instead of division to compute register offset

* Formatting

* Limit layernorm to 1024 elements

* Formatting

* Add verification to driver

* Formatting

* Remove early return

* Use block_size 256

* Vectorize the kernel

* Formatting

* Convert to vector type

* Add layernorm tests

* Formatting

* Formatting

* Refactor layernorm to run both algos

* Formatting

* Fix compile error

* Fix tidy warnings

* Formatting

* Add layernorm function

* Formatting

56b3bf58

27 Nov, 2018 1 commit
- Rename more things to migraphx · 0b217041
  Paul authored Nov 27, 2018
  
  0b217041
14 Nov, 2018 1 commit
- Rename to migraphx · 96358e41
  Paul authored Nov 14, 2018
  
  96358e41
06 Nov, 2018 4 commits
- clang format for all changed files. · ca69e522
  Shucai Xiao authored Nov 05, 2018
  
  ca69e522
- Added inline namespace for all .hpp and .cpp file. · d79346d1
  Shucai Xiao authored Nov 05, 2018
  
  d79346d1
- clang format for all changed files. · a4026def
  Shucai Xiao authored Nov 05, 2018
  
  a4026def
- Added inline namespace for all .hpp and .cpp file. · e1ef1e17
  Shucai Xiao authored Nov 05, 2018
  
  e1ef1e17
18 Sep, 2018 1 commit
- Print program on failure · 385411e6
  Paul authored Sep 18, 2018
  
  385411e6
17 Sep, 2018 3 commits
- Formatting · 93562700
  Paul authored Sep 17, 2018
  
  93562700
- Print out info if data is zeros or nans · f0fd2995
  Paul authored Sep 17, 2018
  
  f0fd2995
- Add maxdiff · ee346df0
  Paul authored Sep 16, 2018
  
  ee346df0
30 Aug, 2018 1 commit
- Print out error amount · 8dbabda5
  Paul authored Aug 29, 2018
  
  8dbabda5
29 Aug, 2018 2 commits
- Formatting · d2778c9e
  Paul authored Aug 29, 2018
  
  d2778c9e
- Add missing file · 87ea2934
  Paul authored Aug 29, 2018
  
  87ea2934