- 03 Oct, 2022 1 commit
-
-
Jeff Daily authored
* first step, everything compiles * fix rebuilds; skip cuda version check for rocm * use macro for __shfl_up_sync __shfl_down_sync * add BFloat16 support for ROCm and CUDA * add USE_ROCM definition to setup.py * flake8 fixes
-
- 29 Jun, 2019 1 commit
-
-
rusty1s authored
-
- 04 Aug, 2018 1 commit
-
-
rusty1s authored
-
- 19 Apr, 2018 1 commit
-
-
rusty1s authored
-
- 07 Apr, 2018 1 commit
-
-
rusty1s authored
-
- 29 Jan, 2018 1 commit
-
-
rusty1s authored
-
- 27 Dec, 2017 1 commit
-
-
rusty1s authored
-
- 22 Dec, 2017 3 commits
- 18 Dec, 2017 2 commits
- 16 Dec, 2017 3 commits