- 03 Oct, 2022 1 commit
-
-
Jeff Daily authored
* first step, everything compiles * fix rebuilds; skip cuda version check for rocm * use macro for __shfl_up_sync __shfl_down_sync * add BFloat16 support for ROCm and CUDA * add USE_ROCM definition to setup.py * flake8 fixes
-
- 23 Jul, 2022 1 commit
-
-
Gerico Vidanes authored
-
- 24 Mar, 2022 1 commit
-
-
Matthias Fey authored
-
- 28 Jul, 2021 1 commit
-
-
rusty1s authored
-
- 29 Jan, 2020 1 commit
-
-
rusty1s authored
-