- 22 Nov, 2024 1 commit
-
-
Hongtao Zhang authored
**Description** Add nvbandwidth benchmark. --------- Co-authored-by:hongtaozhang <hongtaozhang@microsoft.com>
-
- 21 Nov, 2024 2 commits
-
-
Hongtao Zhang authored
**Description** Add nvbandwidth build to repo --------- Co-authored-by:hongtaozhang <hongtaozhang@microsoft.com>
-
Yifan Xiong authored
Update CODEOWNERS for docs.
-
- 20 Nov, 2024 1 commit
-
-
Hongtao Zhang authored
**Description** Add micro benchmark to measure general CPU bandwidth and latency without 'mlc'. Test output: ``` { "cpu-memory-bw-latency/return_code": 0, "cpu-memory-bw-latency/mem_bandwidth_matrix_numa_0_1_bw": 5388.75021, "cpu-memory-bw-latency/mem_bandwidth_matrix_numa_0_1_lat": 0.185571786, "cpu-memory-bw-latency/mem_bandwidth_matrix_numa_1_0_bw": 4634.82028, "cpu-memory-bw-latency/mem_bandwidth_matrix_numa_1_0_lat": 0.215758096, } ``` --------- Co-authored-by:hongtaozhang <hongtaozhang@microsoft.com>
-
- 15 Nov, 2024 1 commit
-
-
Hongtao Zhang authored
**Description** Bump onnxruntime-gpu from 1.10.0 to 1.12.0. --------- Co-authored-by:hongtaozhang <hongtaozhang@microsoft.com>
-
- 07 Nov, 2024 2 commits
-
-
dependabot[bot] authored
Bumps [webpack](https://github.com/webpack/webpack) from 5.76.1 to 5.96.1. - [Release notes](https://github.com/webpack/webpack/releases ) - [Commits](webpack/webpack@v5.76.1...v5.96.1) --- updated-dependencies: - dependency-name: webpack dependency-type: indirect ... Signed-off-by:
dependabot[bot] <support@github.com>
-
dependabot[bot] authored
Bumps [cookie](https://github.com/jshttp/cookie) and [express](https://github.com/expressjs/express). These dependencies needed to be updated together. Updates `cookie` from 0.6.0 to 0.7.1 - [Release notes](https://github.com/jshttp/cookie/releases) - [Commits](jshttp/cookie@v0.6.0...v0.7.1) Updates `express` from 4.21.0 to 4.21.1 - [Release notes](https://github.com/expressjs/express/releases) - [Changelog](https://github.com/expressjs/express/blob/4.21.1/History.md ) - [Commits](expressjs/express@4.21.0...4.21.1) --- updated-dependencies: - dependency-name: cookie dependency-type: indirect - dependency-name: express dependency-type: indirect ... Signed-off-by:
dependabot[bot] <support@github.com>
-
- 06 Nov, 2024 1 commit
-
-
pdr authored
Add support for arm64 build: - Updated dockerfile for arm64 build - extend cpu stream compilation for neoverse - handle onnxruntime-gpu installation - third party builds filtering based on arch - disable cuda decode perf build for non x86
-
- 05 Nov, 2024 1 commit
-
-
pdr authored
The current GPU Copy BW Performance fails on Nvidia Grace systems. This is due to the memory only numa node and thus the numa_run_on_node fails for such nodes and halts completely. This fix checks for the presence of assigned CPU cores for the numa node, on checking if it has no cpu cores assigned, it skips that specific node during the args creation and continues.
-
- 02 Nov, 2024 1 commit
-
-
Yifan Xiong authored
**Description** Update image build. **Major Revision** * Remove ROCm 6.0 image due to outdated packages * Remove build tag for ROCm * Preserve build cache for 30 days
-
- 10 Oct, 2024 1 commit
-
-
Yuting Jiang authored
**Description** Cherry pick bug fixes from v0.11.0 to main **Major Revision** * #645 * #648 * #646 * #647 * #651 * #652 * #650 --------- Co-authored-by:
hongtaozhang <hongtaozhang@microsoft.com> Co-authored-by:
Yifan Xiong <yifan.xiong@microsoft.com>
-
- 19 Sep, 2024 1 commit
-
-
dependabot[bot] authored
Bumps [serve-static](https://github.com/expressjs/serve-static) and [express](https://github.com/expressjs/express). These dependencies needed to be updated together. Updates `serve-static` from 1.15.0 to 1.16.2 - [Release notes](https://github.com/expressjs/serve-static/releases) - [Changelog](https://github.com/expressjs/serve-static/blob/v1.16.2/HISTORY.md) - [Commits](expressjs/serve-static@v1.15.0...v1.16.2) Updates `express` from 4.19.2 to 4.21.0 - [Release notes](https://github.com/expressjs/express/releases) - [Changelog](https://github.com/expressjs/express/blob/4.21.0/History.md ) - [Commits](expressjs/express@4.19.2...4.21.0) --- updated-dependencies: - dependency-name: serve-static dependency-type: indirect - dependency-name: express dependency-type: indirect ... Signed-off-by:
dependabot[bot] <support@github.com>
-
- 20 Aug, 2024 1 commit
-
-
Yang Wang authored
**Description** Fix executor for Benchmark Execution Without Explicit Framework Field
-
- 16 Aug, 2024 1 commit
-
-
Yuting Jiang authored
**Description** Fix bug of failure test and warning of pandas in data diagnosis. **Major Revision** - fix warning of pandas in replace and fillna due to type downcast - fix bug of failure check function only check one matched metric rather than all matched metrics - fix bug when converting regex into str of metrics when there're more than one match group
-
- 13 Aug, 2024 1 commit
-
-
Yang Wang authored
Add 10-hpcx.sh to /etc/profile.d Update the Docker exec command to ensure a persistent HPCX environment.
-
- 08 Aug, 2024 1 commit
-
-
Yang Wang authored
* https://pypi.org/project/types-pkg-resources/ * Use types-setuptools instead
-
- 28 Jul, 2024 1 commit
-
-
Yuting Jiang authored
**Description** Fix MSCCL build error in CUDA12.4 docker build pipeline due to OOM issue.
-
- 26 Jul, 2024 2 commits
-
-
Yuting Jiang authored
**Description** Add support GPU ARCH 8.9 for NVIDIA L4/L40/L40s GPUs in gemm-flops.
-
dependabot[bot] authored
Bumps [express](https://github.com/expressjs/express) from 4.18.2 to 4.19.2. - [Release notes](https://github.com/expressjs/express/releases) - [Changelog](https://github.com/expressjs/express/blob/master/History.md ) - [Commits](expressjs/express@4.18.2...4.19.2) --- updated-dependencies: - dependency-name: express dependency-type: indirect ... Signed-off-by:
dependabot[bot] <support@github.com>
-
- 25 Jul, 2024 3 commits
-
-
dependabot[bot] authored
Bumps [ws](https://github.com/websockets/ws) from 6.2.2 to 6.2.3. - [Release notes](https://github.com/websockets/ws/releases ) - [Commits](websockets/ws@6.2.2...6.2.3) --- updated-dependencies: - dependency-name: ws dependency-type: indirect ... Signed-off-by:
dependabot[bot] <support@github.com>
-
omahs authored
Docs - fix typos
-
dependabot[bot] authored
Bumps [ip](https://github.com/indutny/node-ip ) from 1.1.5 to 1.1.9. - [Commits](indutny/node-ip@v1.1.5...v1.1.9) --- updated-dependencies: - dependency-name: ip dependency-type: indirect ... Signed-off-by:
dependabot[bot] <support@github.com>
-
- 24 Jul, 2024 2 commits
-
-
dependabot[bot] authored
Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects) from 1.14.8 to 1.15.6. - [Release notes](https://github.com/follow-redirects/follow-redirects/releases ) - [Commits](follow-redirects/follow-redirects@v1.14.8...v1.15.6) --- updated-dependencies: - dependency-name: follow-redirects dependency-type: indirect ... Signed-off-by:
dependabot[bot] <support@github.com>
-
Yifan Xiong authored
Add BibTeX for citation in README and repo.
-
- 23 Jul, 2024 1 commit
-
-
Yang Wang authored
Update `omegaconf` version to [2.3.0](https://pypi.org/project/omegaconf/2.3.0/) as omegaconf 2.0.6 has a non-standard dependency specifier PyYAML>=5.1.*. pip 24.1 will enforce this behaviour change. Discussion can be found at https://github.com/pypa/pip/issues/12063.
-
- 22 Apr, 2024 1 commit
-
-
Yuting Jiang authored
**Description** Add CUDA 12.4 dockerfile. **Major Revision** - upgrade nvidia docker into 23.04 **Minor Revision** - upgrade hpcx into 2.18
-
- 18 Apr, 2024 1 commit
-
-
Yuting Jiang authored
**Description** Upgrade mlc to v3.11.
-
- 02 Apr, 2024 1 commit
-
-
Ziyue Yang authored
**Description** Adds hipblasLt tuning to dist-inference cpp implementation.
-
- 21 Mar, 2024 1 commit
-
-
Yang Wang authored
**Description** Cuda 12.2 image will report undfined symbol error due to incomplete LD_LIBRARY_PATH:  ### How to reproduce: 1. Deploy sb with cuda12.2 image ``` sb deploy -f local.ini -i superbench/superbench:v0.10.0-cuda12.2 ``` 2. Enter to the container ``` sudo docker exec -it sb-workspace bash ``` 3. Execute `mpirun`: ``` root@sb-container:~# mpirun mpirun: symbol lookup error: mpirun: undefined symbol: opal_libevent2022_event_base_loop ``` ### Fix to fix * Append hpcx_load into /etc/bash.bashrc for updaing env LD_LIBRARY_PATH in each time ---------
-
- 08 Jan, 2024 1 commit
-
-
Yifan Xiong authored
**Description** Cherry-pick bug fixes from v0.10.0 to main. **Major Revisions** * Benchmarks: Microbenchmark - Support different hipblasLt data types in dist_inference #590 * Benchmarks: Microbenchmark - Support in-place for NCCL/RCCL benchmark #591 * Bug Fix - Fix NUMA Domains Swap Issue in NDv4 Topology File #592 * Benchmarks: Microbenchmark - Add data type option for NCCL and RCCL tests #595 * Benchmarks: Bug Fix - Make metrics of dist-inference-cpp aligned with PyTorch version #596 * CI/CD - Add ndv5 topo file #597 * Benchmarks: Microbenchmark - Improve AMD GPU P2P performance with fine-grained GPU memory #593 * Benchmarks: Build Pipeline - fix nccl and nccl test version to 2.18.3 to resolve hang issue in cuda12.2 docker #599 * Dockerfile - Bug fix for rocm docker build and deploy #598 * Benchmarks: Microbenchmark - Adapt to hipblasLt data type changes #603 * Benchmarks: Micro benchmarks - Update hipblaslt metric unit to tflops #604 * Monitor - U...
-
- 11 Dec, 2023 1 commit
-
-
Ziyue Yang authored
**Description** `add_compile_options` will not work for ROCm build, change it to setting `CMAKE_CXX_FLAGS`.
-
- 10 Dec, 2023 1 commit
-
-
Ziyue Yang authored
**Description** Add distributed inference benchmark cpp implementation.
-
- 09 Dec, 2023 1 commit
-
-
Yuting Jiang authored
**Description** upgrade to rocm5.7 dockerfile. --------- Co-authored-by:yukirora <yuting.jiang@microsoft.com>
-
- 08 Dec, 2023 1 commit
-
-
Ziyue Yang authored
Benchmarks: Micro benchmark - Add one-to-all, all-to-one, all-to-all support to gpu_copy_bw_performance (#588) **Description** Add one-to-all, all-to-one, all-to-all support to gpu_copy_bw_performance, and fix performance bug in gpu_copy
-
- 07 Dec, 2023 2 commits
-
-
Ziyue Yang authored
**Description** Add MSCCL support for Nvidia GPU
-
Yuting Jiang authored
**Description** Megatron-LM/Megatron-Deepspeed GPT pretrain benchmark
-
- 05 Dec, 2023 1 commit
-
-
Ziyue Yang authored
**Description** Revise NCCL/RCCL benchmarks to graph mode add latency metrics.
-
- 04 Dec, 2023 1 commit
-
-
Yuting Jiang authored
**Description** Benchmarks: micro benchmark - Support cpu-gpu and gpu-cpu in ib-validation **Major Revision** - Support cpu-gpu and gpu-cpu in ib-validation **Minor Revision** - support multi msg size, multi direction, multi ib commands in ib-validation
-
- 27 Nov, 2023 1 commit
-
-
guoshzhao authored
**Description** Add AMD support in monitor. **Major Revision** - Add library pyrsmi to collect metrics. - Currently can get device_utilization, device_power, device_used_memory and device_total_memory.
-
- 22 Nov, 2023 1 commit
-
-
Yifan Xiong authored
Upgrade Docker image to CUDA 12.2 for H100: * upgrade base image to 23.10 * fix onnxruntime version in python3.10 * fix compilation errors
-