1. 05 Jan, 2026 1 commit
    • gilbertlee-amd's avatar
      TransferBench v1.66 - Multi-Rank support (#224) · bbd72a6c
      gilbertlee-amd authored
      * Adding System singleton to support multi-node (communication and topology)
      * Adding multi-node parsing, rank and device wildcard expansion
      * Adding multi-node topology, and various support functions
      * Adding multi-node consistency validation of Config and Transfers
      * Introducing SINGLE_KERNEL=1 to Makefile to speed up compilation during development
      * Updating CHANGELOG.  Overhauling wildcard parsing.  Adding dryrun
      * Client refactoring.  Introduction of tabular formatted results and a2a multi-rank preset
      * Adding MPI support into CMakeFiles
      * Cleaning up multi-node topology using TableHelper
      * Reducing compile time by removing some kernel variants
      * Updating documentation.  Adding nicrings preset
      * Adding NIC_FILTER to allow NIC device filtering via regex
      * Updating supported memory types
      * Fixing P2P preset, and adding some extra memIndex utility functions
      bbd72a6c
  2. 13 Nov, 2025 1 commit
  3. 04 Sep, 2025 1 commit
    • gilbertlee-amd's avatar
      v1.64.00 (#198) · a824bc1b
      gilbertlee-amd authored
      * Added BLOCKSIZES to a2asweep preset to allow sweeping over threadblock sizes
      * Fixing src initialization when using BYTE_OFFSET
      * Adding FILL_COMPRESS functionality to allow for different input data patterns
      * Updating CHANGELOG regarding GFX_BLOCKSIZE limit increase to 1024
      a824bc1b
  4. 08 Aug, 2025 1 commit
    • gilbertlee-amd's avatar
      TransferBench v1.63 (#193) · 023ce416
      gilbertlee-amd authored
      
      
      * Fixing issue with P memory type and use of DMA subexecutor
      * CMake builds require explicit opt-in by setting NIC_EXEC_ENABLE=1
      * Removing self-GPU check for DMA engine copies
      * [BUILD] Add new GPU targets and switch to amdclang++ (#187)
      * [BUILD] Add gfx950, gfx1150, and gfx1151 targets
      * [BUILD] Modify CMake to use amdclang++
      * [BUILD] Modify Makefile to use amdclang++
      * [GIT] Updated CHANGELOG and .gitignore
      * Adding HBM testing to healthcheck preset
      * Tweaking HBM tests to occur first, and provide more info during VERBOSE=1
      * Fixing timing reporting issues with NUM_SUBITERATIONS
      * [BUILD] Simplify Makefile (#190)
      * Combines steps for compilation and linking
      * Does not rebuild if no change to source code
      
      * Updating CHANGELOG
      
      ---------
      Co-authored-by: default avatarNilesh M Negi <Nilesh.Negi@amd.com>
      023ce416
  5. 09 Jun, 2025 1 commit
    • gilbertlee-amd's avatar
      TransferBench v1.62.00 (#181) · 569287a7
      gilbertlee-amd authored
      * Adding non-temporal loads and stores via GFX_TEMPORAL
      * Adding additional summary details to a2a preset
      * Add SHOW_MIN_ONLY for a2asweep preset
      * Adding new P CPU memory type which is indexed by closest GPU
      569287a7
  6. 28 Feb, 2025 1 commit
  7. 30 Jan, 2025 1 commit
  8. 24 Jan, 2025 1 commit
  9. 21 Jan, 2025 1 commit
  10. 05 Dec, 2024 1 commit
  11. 02 Dec, 2024 1 commit
  12. 28 Nov, 2024 1 commit
  13. 26 Nov, 2024 1 commit
  14. 22 Nov, 2024 1 commit
  15. 21 Nov, 2024 1 commit