- 09 Oct, 2024 1 commit
-
-
gilbertlee-amd authored
* Adding USE_HSA_DMA to switch to using hsa_amd_memory_async_copy in lieu of hipMemcpyAsync * Adding USE_GPU_DMA for A2A benchmark * Adding largeBAR check and fix for 0-hop GPU-CPU links
-
- 02 Feb, 2024 1 commit
-
-
gilbertlee-amd authored
* Adding targeted DMA engine support * Fixing CUDA compilation for H100
-
- 29 Mar, 2023 1 commit
-
-
gilbertlee-amd authored
* Adding source prep kernel (USE_PREP_KERNEL) * Adding nvcc-only compilation path * Fix for NVIDIA - set shared mem usage to 0 by default * Updating default fill pattern for source data * Restoring missing example.cfg file
-
- 17 Feb, 2023 1 commit
-
-
PedramAlizadeh authored
-
- 08 Apr, 2022 1 commit
-
-
Gilbert Lee authored
-