1. 24 Jun, 2022 2 commits
    • Yifan Xiong's avatar
      Fix incorrect ulimit config in Dockerfile (#364) · 325a7338
      Yifan Xiong authored
      Fix incorrect ulimit nofile config in Dockerfile.
      
      Instead of bash, sh is used by default where `echo` does not accept any parameters and `-e` is written into /etc/security/limits.conf.
      325a7338
    • Yifan Xiong's avatar
      Support multiple IB/GPU in ib validation (#363) · bfaa1c83
      Yifan Xiong authored
      **Description**
      
      Support multiple IB/GPU devices run simultaneously in ib validation benchmark.
      
      **Major Revisions**
      - Revise ib_validation_performance.cc so that multiple processes per node could be used to launch multiple perftest commands simultaneously. For each node pair in the config, number of processes per node will run in parallel.
      - Revise ib_validation_performance.py to correct file paths and adjust parameters to specify different NICs/GPUs/NUMA nodes.
      - Fix env issues in Dockerfile for end-to-end test.
      - Update ib-traffic configuration examples in config files.
      - Update unit tests and docs accordingly.
      
      Closes #326.
      bfaa1c83
  2. 15 Jun, 2022 1 commit
    • Yifan Xiong's avatar
      Fix cmake and build issues (#360) · 60a3c743
      Yifan Xiong authored
      **Description**
      
      Fix cmake and build issues.
      
      **Major Revision**
      
      * Remove unnecessary boost build
      * Remove user-agent for mlc
      * Remove -j for third party to build each project in sequence
      * Fix ansible collections installation path
      60a3c743
  3. 31 May, 2022 1 commit
  4. 08 Feb, 2022 1 commit
  5. 13 Dec, 2021 1 commit
  6. 10 Dec, 2021 1 commit
  7. 30 Oct, 2021 1 commit
  8. 02 Sep, 2021 1 commit
  9. 01 Sep, 2021 2 commits
  10. 29 Jul, 2021 1 commit
  11. 16 Jul, 2021 2 commits
  12. 16 Jun, 2021 1 commit
    • Yifan Xiong's avatar
      Dockerfile - Update CUDA 11.1.1 Dockerfile (#96) · 25ec3a7c
      Yifan Xiong authored
      Update packages and add build cache for CUDA 11.1.1 Dockerfile:
      
      * Remove duplicate cmake and ompi, which are already in base image
      * Add hpcx and sharp lib
      * Add cache for gitmodules build
      * Sort apt-get packages
      25ec3a7c
  13. 01 Jun, 2021 2 commits
  14. 18 May, 2021 1 commit
  15. 17 May, 2021 1 commit
  16. 14 Apr, 2021 1 commit
  17. 13 Apr, 2021 1 commit
  18. 12 Apr, 2021 1 commit