- 09 Jun, 2025 1 commit
-
-
gilbertlee-amd authored
* Adding non-temporal loads and stores via GFX_TEMPORAL * Adding additional summary details to a2a preset * Add SHOW_MIN_ONLY for a2asweep preset * Adding new P CPU memory type which is indexed by closest GPU
-
- 21 Jan, 2025 1 commit
-
-
gilbertlee-amd authored
Adding NIC execution capabilities, various bug fixes introduced by header-only-library refactor --------- Co-authored-by:Mustafa Abduljabbar <mustafa.abduljabbar@amd.com>
-
- 29 Mar, 2023 1 commit
-
-
gilbertlee-amd authored
* Adding source prep kernel (USE_PREP_KERNEL) * Adding nvcc-only compilation path * Fix for NVIDIA - set shared mem usage to 0 by default * Updating default fill pattern for source data * Restoring missing example.cfg file
-
- 24 Mar, 2023 1 commit
-
-
Sam Wu authored
* add read the docs configs * add examples to docs format example docs * formatting for configfile format doc page * generate doxygen docs
-
- 17 Feb, 2023 1 commit
-
-
PedramAlizadeh authored
-
- 20 Jan, 2023 1 commit
-
-
gilbertlee-amd authored
* Adding MIMO support, DMA executor, Null memory type
-
- 15 Sep, 2022 1 commit
-
-
gilbertlee-amd authored
* Updating version to v1.06 * Fixing CPU NUMA allocation * Fix random sweep repeatability * Adding unpinned CPU memory as possible memory type * Adding ability to customize per-transfer byte sizes * Updating advanced configuration file mode to take in numBytes per Transfer * Adding logging of sweep tests configuration to lastSweep.cfg * Add ability to specify #CUs for sweep benchmark
-
- 27 Apr, 2022 1 commit
-
-
Gilbert Lee authored
-
- 08 Apr, 2022 1 commit
-
-
Gilbert Lee authored
-