- 21 Jul, 2025 4 commits
-
-
Andrea Paris authored
-
Andrea Paris authored
-
Andrea Paris authored
-
apaaris authored
-
- 24 May, 2025 1 commit
-
-
Boris Bonev authored
-
- 09 Sep, 2024 1 commit
-
-
Boris Bonev authored
* changing distributeds SHT to use dim=-3 as the channel dimension for distributed transpose * Fixed formatting in tests * Adding bash script for running test suite * Replaced asserts regarding number of dimensions in tensor with checks
-
- 29 Aug, 2024 1 commit
-
-
Thorsten Kurth authored
-
- 05 Feb, 2024 1 commit
-
-
Thorsten Kurth authored
* initial implementation of distributed DISCO layer * working distributed convolution * working refactored serial conv transpose with torch kernel * working distributed conv and transposed conv when using the python kernel * working distributed convolution with torch kernel * fixed triton kernel tests * adding print statement to debug CI * adjusting tolerances in local convolution unittest --------- Co-authored-by:Boris Bonev <bbonev@nvidia.com>
-
- 14 Dec, 2023 1 commit
-
-
Boris Bonev authored
* Fixing the precomputation of the Psi matrix * moving tests to a parent folder * removing tqdm * deleting deprecated distributed tests * Moving distributed test * adapting workfolow * Added some more comments to make the code more understandable * more detailed explanation for the derivation of the rotation angles * added another comment --------- Co-authored-by:Thorsten Kurth <tkurth@nvidia.com>
-
- 07 Dec, 2023 1 commit
-
-
Thorsten Kurth authored
* working distributed fwd SHT with flexible sharding and without padding * fixing distributed sht with new logic * fixed distributed SHT and ISHT with flexible padding * working unittest for distributed fwd SHT * distributed tests converted to unit tests * bumping version number up to 0.6.4 * fixing small splitting bug in th distributed * updated changeloc, removed tests folder
-