- 27 Jun, 2025 1 commit
-
-
Chenggang Zhao authored
-
- 11 Jun, 2025 1 commit
-
-
Chenggang Zhao authored
* Update README * Update `setup.py` * Fix headers * Add `DISABLE_NVSHMEM` for APIs * Fix launch * Fix TMA settings * Fix TMA usages * Fix dlink * Separate layout kernels * Update version * Add `is_sm90_compiled` * Fix tests * Add NVLink connection checks * Update README * Fix tests * Add some comments * Minor fix * Minor fix * Fix bugs
-
- 19 May, 2025 1 commit
-
-
guyueh1 authored
* Add 10.0 to TORCH_CUDA_ARCH_LIST Signed-off-by:
Guyue Huang <guyueh@nvidia.com> * Revert csrc/CMakeLists change; in setup.py make TORCH_CUDA_ARCH_LIST configurable Signed-off-by:
Guyue Huang <guyueh@nvidia.com> --------- Signed-off-by:
Guyue Huang <guyueh@nvidia.com>
-
- 25 Feb, 2025 1 commit
-
-
Chenggang Zhao authored
-