- 03 Oct, 2022 1 commit
-
-
Jeff Daily authored
* first step, everything compiles * fix rebuilds; skip cuda version check for rocm * use macro for __shfl_up_sync __shfl_down_sync * add BFloat16 support for ROCm and CUDA * add USE_ROCM definition to setup.py * flake8 fixes
-
- 23 Jul, 2022 1 commit
-
-
Gerico Vidanes authored
-
- 24 Mar, 2022 1 commit
-
-
Matthias Fey authored
-
- 22 Oct, 2021 1 commit
-
-
rusty1s authored
-
- 28 Jul, 2021 1 commit
-
-
rusty1s authored
-
- 19 Jul, 2021 2 commits
-
-
rusty1s authored
-
Jacob Zhong authored
* compile with half * Fix * fix * rename * update * update * update * update * typo Co-authored-by:rusty1s <matthias.fey@tu-dortmund.de>
-
- 30 May, 2021 1 commit
-
-
rusty1s authored
-
- 23 Nov, 2020 2 commits
- 24 Aug, 2020 1 commit
-
-
rusty1s authored
-
- 16 Jul, 2020 1 commit
-
-
rusty1s authored
-
- 16 Apr, 2020 1 commit
-
-
rusty1s authored
-
- 04 Feb, 2020 1 commit
-
-
rusty1s authored
-
- 03 Feb, 2020 1 commit
-
-
rusty1s authored
-
- 30 Jan, 2020 3 commits
- 29 Jan, 2020 2 commits