1. 05 Jan, 2026 1 commit
    • gilbertlee-amd's avatar
      TransferBench v1.66 - Multi-Rank support (#224) · bbd72a6c
      gilbertlee-amd authored
      * Adding System singleton to support multi-node (communication and topology)
      * Adding multi-node parsing, rank and device wildcard expansion
      * Adding multi-node topology, and various support functions
      * Adding multi-node consistency validation of Config and Transfers
      * Introducing SINGLE_KERNEL=1 to Makefile to speed up compilation during development
      * Updating CHANGELOG.  Overhauling wildcard parsing.  Adding dryrun
      * Client refactoring.  Introduction of tabular formatted results and a2a multi-rank preset
      * Adding MPI support into CMakeFiles
      * Cleaning up multi-node topology using TableHelper
      * Reducing compile time by removing some kernel variants
      * Updating documentation.  Adding nicrings preset
      * Adding NIC_FILTER to allow NIC device filtering via regex
      * Updating supported memory types
      * Fixing P2P preset, and adding some extra memIndex utility functions
      bbd72a6c
  2. 09 Jun, 2025 1 commit
    • gilbertlee-amd's avatar
      TransferBench v1.62.00 (#181) · 569287a7
      gilbertlee-amd authored
      * Adding non-temporal loads and stores via GFX_TEMPORAL
      * Adding additional summary details to a2a preset
      * Add SHOW_MIN_ONLY for a2asweep preset
      * Adding new P CPU memory type which is indexed by closest GPU
      569287a7
  3. 21 Jan, 2025 1 commit
  4. 29 Mar, 2023 1 commit
    • gilbertlee-amd's avatar
      Source Prep Kernel / Better NVCC support (#18) · e6f64e97
      gilbertlee-amd authored
      * Adding source prep kernel (USE_PREP_KERNEL)
      * Adding nvcc-only compilation path
      * Fix for NVIDIA - set shared mem usage to 0 by default
      * Updating default fill pattern for source data
      * Restoring missing example.cfg file
      e6f64e97
  5. 24 Mar, 2023 1 commit
  6. 17 Feb, 2023 1 commit
  7. 20 Jan, 2023 1 commit
  8. 15 Sep, 2022 1 commit
    • gilbertlee-amd's avatar
      V1.06 (#4) · be0db711
      gilbertlee-amd authored
      * Updating version to v1.06
      * Fixing CPU NUMA allocation
      * Fix random sweep repeatability
      * Adding unpinned CPU memory as possible memory type
      * Adding ability to customize per-transfer byte sizes
      * Updating advanced configuration file mode to take in numBytes per Transfer
      * Adding logging of sweep tests configuration to lastSweep.cfg
      * Add ability to specify #CUs for sweep benchmark
      be0db711
  9. 27 Apr, 2022 1 commit
  10. 08 Apr, 2022 1 commit