- 15 Aug, 2024 1 commit
-
-
gilbertlee-amd authored
* Fixing potential out-of-bounds write during topology detection * Fixing CU_MASK for multi-XCD GPUs * Adding sub-iterations via NUM_SUBITERATIONS * Adding support for variable subexecutor Transfers * Adding healthcheck preset
-
- 03 Apr, 2024 1 commit
-
-
gilbertlee-amd authored
* Adding pcopy benchmark, fixing CPU kernel on null destination
-
- 02 Feb, 2024 1 commit
-
-
gilbertlee-amd authored
* Adding targeted DMA engine support * Fixing CUDA compilation for H100
-
- 09 Jan, 2024 2 commits
-
-
gilbertlee-amd authored
-
gilbertlee-amd authored
-
- 05 Dec, 2023 1 commit
-
-
gilbertlee-amd authored
* v1.45 New GFX kernel
-
- 28 Nov, 2023 1 commit
-
-
gilbertlee-amd authored
-
- 24 Nov, 2023 1 commit
-
-
gilbertlee-amd authored
-
- 17 Oct, 2023 1 commit
-
-
gilbertlee-amd authored
* Adding xccID output to SHOW_ITERATIONS
-
- 13 Oct, 2023 1 commit
-
-
gilbertlee-amd authored
-
- 28 Sep, 2023 1 commit
-
-
gilbertlee-amd authored
-
- 19 Sep, 2023 1 commit
-
-
gilbertlee-amd authored
-
- 05 Jun, 2023 1 commit
-
-
gilbertlee-amd authored
-
- 29 Mar, 2023 1 commit
-
-
gilbertlee-amd authored
* Adding source prep kernel (USE_PREP_KERNEL) * Adding nvcc-only compilation path * Fix for NVIDIA - set shared mem usage to 0 by default * Updating default fill pattern for source data * Restoring missing example.cfg file
-
- 17 Feb, 2023 1 commit
-
-
PedramAlizadeh authored
-
- 31 Jan, 2023 1 commit
-
-
gilbertlee-amd authored
-
- 20 Jan, 2023 1 commit
-
-
gilbertlee-amd authored
* Adding MIMO support, DMA executor, Null memory type
-
- 08 Apr, 2022 1 commit
-
-
Gilbert Lee authored
-