- 17 May, 2022 13 commits
- 13 May, 2022 1 commit
-
-
Chris Austen authored
Our documentation indicates a user with sudo can run the install_prereqs.sh file. Turns out that the file is not complete enough to run on Ubuntu 18.04/20.04 independently. I updated the file to resolve the failures. resolves #1191
-
- 12 May, 2022 2 commits
- 11 May, 2022 14 commits
-
-
turneram authored
-
Paul Fultz II authored
Fuse layernorm and added triadd_layernorm fusion. This is a prep performance booster
-
turneram authored
-
turneram authored
-
turneram authored
-
turneram authored
-
turneram authored
-
turneram authored
-
turneram authored
-
turneram authored
-
turneram authored
-
turneram authored
-
turneram authored
-
Chris Austen authored
ONNX Models changed from master to main. Changing path reflect the proper location
-
- 10 May, 2022 1 commit
-
-
Umang Yadav authored
Expose add_literal method in C/C++ api
-
- 09 May, 2022 1 commit
-
-
Paul Fultz II authored
Improves performance for add_gelu. In bert it is 4x faster and for mul_add it is 50% faster than what we current have.
-
- 06 May, 2022 2 commits
-
-
Chris Austen authored
Move to CI containers to rocm 5.0.2 upgrade to 20.04 free up some more file space in github action environments
-
Paul Fultz II authored
Add compile tests for gpu math functions
-
- 05 May, 2022 1 commit
-
-
Paul Fultz II authored
Fixes the #error when using cppcheck. This no longer suppresses cppcheck errors when including those errors. This fixes the cppcheck errors that was there already.
-
- 03 May, 2022 5 commits