- 08 Jan, 2026 1 commit
-
-
Gahan Saraiya authored
Signed-off-by:Gahan Saraiya <Gahan.Saraiya@amd.com>
-
- 05 Jan, 2026 1 commit
-
-
gilbertlee-amd authored
* Adding System singleton to support multi-node (communication and topology) * Adding multi-node parsing, rank and device wildcard expansion * Adding multi-node topology, and various support functions * Adding multi-node consistency validation of Config and Transfers * Introducing SINGLE_KERNEL=1 to Makefile to speed up compilation during development * Updating CHANGELOG. Overhauling wildcard parsing. Adding dryrun * Client refactoring. Introduction of tabular formatted results and a2a multi-rank preset * Adding MPI support into CMakeFiles * Cleaning up multi-node topology using TableHelper * Reducing compile time by removing some kernel variants * Updating documentation. Adding nicrings preset * Adding NIC_FILTER to allow NIC device filtering via regex * Updating supported memory types * Fixing P2P preset, and adding some extra memIndex utility functions
-
- 06 Oct, 2025 1 commit
-
-
Mustafa Abduljabbar authored
-
- 04 Sep, 2025 1 commit
-
-
gilbertlee-amd authored
* Added BLOCKSIZES to a2asweep preset to allow sweeping over threadblock sizes * Fixing src initialization when using BYTE_OFFSET * Adding FILL_COMPRESS functionality to allow for different input data patterns * Updating CHANGELOG regarding GFX_BLOCKSIZE limit increase to 1024
-
- 08 Aug, 2025 1 commit
-
-
gilbertlee-amd authored
* Fixing issue with P memory type and use of DMA subexecutor * CMake builds require explicit opt-in by setting NIC_EXEC_ENABLE=1 * Removing self-GPU check for DMA engine copies * [BUILD] Add new GPU targets and switch to amdclang++ (#187) * [BUILD] Add gfx950, gfx1150, and gfx1151 targets * [BUILD] Modify CMake to use amdclang++ * [BUILD] Modify Makefile to use amdclang++ * [GIT] Updated CHANGELOG and .gitignore * Adding HBM testing to healthcheck preset * Tweaking HBM tests to occur first, and provide more info during VERBOSE=1 * Fixing timing reporting issues with NUM_SUBITERATIONS * [BUILD] Simplify Makefile (#190) * Combines steps for compilation and linking * Does not rebuild if no change to source code * Updating CHANGELOG --------- Co-authored-by:Nilesh M Negi <Nilesh.Negi@amd.com>
-
- 28 Feb, 2025 1 commit
-
-
gilbertlee-amd authored
Co-authored-by:Mustafa Abduljabbar <mustafa.abduljabbar@amd.com>
-
- 21 Jan, 2025 1 commit
-
-
gilbertlee-amd authored
Adding NIC execution capabilities, various bug fixes introduced by header-only-library refactor --------- Co-authored-by:Mustafa Abduljabbar <mustafa.abduljabbar@amd.com>
-
- 28 Nov, 2024 1 commit
-
-
gilbertlee-amd authored
* Removing C++20 dependencies, modified how version is reported * Changing GFX_SINGLE_TEAM=0 by default
-
- 26 Nov, 2024 1 commit
-
-
gilbertlee-amd authored
-
- 22 Nov, 2024 1 commit
-
-
gilbertlee-amd authored
-
- 21 Nov, 2024 1 commit
-
-
gilbertlee-amd authored
-
- 17 Feb, 2023 1 commit
-
-
PedramAlizadeh authored
-
- 31 Jan, 2023 1 commit
-
-
gilbertlee-amd authored
-
- 20 Jan, 2023 1 commit
-
-
gilbertlee-amd authored
* Adding MIMO support, DMA executor, Null memory type
-
- 08 Apr, 2022 1 commit
-
-
Gilbert Lee authored
-