- 21 Dec, 2025 1 commit
-
-
Hongtao Zhang authored
**Description** Azure pipeline cpu-unit-test failed for "2025-12-10T03:47:59.0628597Z ERROR: Could not install packages due to an OSError: [Errno 28] No space left on device" **Root Cause** This happens because the matrix jobs (Python 3.7, 3.10, 3.12) run in parallel and share the same VM's disk. Python 3.12 downloads newer/larger packages (especially PyTorch and NVIDIA CUDA libraries which are ~3GB+), and when multiple jobs run simultaneously, they exhaust the disk space. **Fix** Disable the cache usage when installing SB Co-authored-by:Hongtao Zhang <hongtaozhang@microsoft.com>
-
- 05 Nov, 2025 1 commit
-
-
Hongtao Zhang authored
Python3.10 verification pipeline failed for conflict 'setuptools' version as below. <img width="1157" height="622" alt="image" src="https://github.com/user-attachments/assets/ba0f6045-4b92-4fd8-b92f-1c474725534c " /> Root Cause: The problem is that modern pip (25.3) uses an isolated build environment with the latest setuptools by default. The pipeline installs setuptools 65.7 in the user environment, but pip builds the package in an isolated environment with newer setuptools, which conflicts with the version check in [setup.py]. Solution: Remove pip upgrade. --------- Co-authored-by:
Hongtao Zhang <hongtaozhang@microsoft.com>
-
- 22 Oct, 2025 1 commit
-
-
Hongtao Zhang authored
**Description** Python3.10 pipeline failed. **Solution** From log, 'bc' cmd is missing. Since our image tags are simple, the solution is to remove 'bc' cmd directly. --------- Co-authored-by:Hongtao Zhang <hongtaozhang@microsoft.com>
-
- 25 Feb, 2025 1 commit
-
-
Hongtao Zhang authored
Added support for Python 3.11, 3.12 and 3.13. yapf is not compatiable with python3.12+, so we disable yapf in py3.12 for now. https://github.com/google/yapf/issues/1258 https://github.com/google/yapf/issues/1266 --------- Co-authored-by:
hongtaozhang <hongtaozhang@microsoft.com>
-
- 27 Nov, 2024 1 commit
-
-
Yifan Xiong authored
Upgrade dependency versions in Azure pipeline: * Remove Python 3.6 and add Python 3.10 for cpu-unit-test * Upgrade CUDA from 11.1 to 12.4 for cuda-unit-test * Update labels accordingly --------- Co-authored-by:Dilip Patlolla <dilipreddi@gmail.com>
-
- 22 Nov, 2023 1 commit
-
-
Yuting Jiang authored
**Description** hipblaslt function benchmark and rebase cublaslt function benchmark.
-
- 22 Aug, 2023 1 commit
-
-
Yuting Jiang authored
**Description** source code for evaluating NVDEC decoding performance. --------- Co-authored-by:yukirora <yuting.jiang@microsoft.com>
-
- 06 Mar, 2023 1 commit
-
-
Yifan Xiong authored
Pin setuptools version to [v65.7.0](https://setuptools.pypa.io/en/latest/history.html#v65-7-0) to avoid breaking changes since v66.0.0.
-
- 17 Feb, 2023 1 commit
-
-
Yuting Jiang authored
**Description** Upgrade networkx version to fix installation compatibility issue.
-
- 17 Aug, 2022 1 commit
-
-
Yifan Xiong authored
__Description__ Update Python setup for require packages. __Major Revisions__ * downgrade requests version to be compatible with python 3.6, add corresponding pipeline for 3.6 * add extra entry in extras_require for nested packages * update `pip install` contents accordingly
-
- 13 Aug, 2022 1 commit
-
-
Yang Wang authored
An enhancement for topo-aware IB performance validation #373. This PR will auto-generate a required ibstate file `ib_traffic_topo_aware_ibstat.txt` which is used as input to build a graph.
-
- 10 Dec, 2021 2 commits
-
-
guoshzhao authored
**Description** Add ONNXRuntime inference benchmark based on ORT python API. **Major Revision** - Add `ORTInferenceBenchmark` class to export pytorch model to onnx model and do inference - Add tests and example for `ort-inference` benchmark - Update the introduction docs.
-
Yuting Jiang authored
**Description** Add basic analysis features. **Major Revision** - Add statistics, correlations of the raw data - Add numeric outlier detection(inter_quartile_range) - Add boxplot for selected metric
-
- 22 Oct, 2021 1 commit
-
-
guoshzhao authored
Description Add CudaDockerBenchmark and RocmDockerBenchmark to support amd and cuda platform for DockerBenchmark.
-
- 21 Oct, 2021 1 commit
-
-
Yifan Xiong authored
Upgrade to latest agent image in pipeline, fix "Ubuntu16 image does not exist" issue.
-
- 09 Jul, 2021 1 commit
-
-
guoshzhao authored
* Bug Fix - Fix race condition issue for multi ranks (#117) Fix race condition issue when multi ranks rotating the same directory. * Update pipeline for release branch (#122) * Bug Fix - Fix bug when convert bool config to store_true argument. (#120) Co-authored-by:Yifan Xiong <yifan.xiong@microsoft.com>
-
- 01 Jun, 2021 1 commit
-
-
guoshzhao authored
-
- 26 May, 2021 3 commits
-
-
Yuting Jiang authored
-
Yifan Xiong authored
* Add integration tests for Ansible playbooks * Add `gpu_vendor` var to bypass gpu mount
-
Yuting Jiang authored
* Unify SB_MICRO_PATH and SB_MICRO_LIB * fix bug of lib path
-
- 23 May, 2021 1 commit
-
-
Yifan Xiong authored
Implement ansible client and runner: * add ansible client * add deploy and check_env playbooks
-
- 18 May, 2021 2 commits
- 21 Apr, 2021 1 commit
-
-
Yifan Xiong authored
Speedup Azure pipeline for CPU unit test.
-
- 14 Apr, 2021 1 commit
-
-
Yifan Xiong authored
* Rename dev branch to main and set it as default.
-
- 13 Apr, 2021 1 commit
-
-
Yifan Xiong authored
* add flags for different pipelines to merge reports * add token for cuda pipeline * update codecov/patch target
-
- 12 Apr, 2021 1 commit
-
-
Yifan Xiong authored
* add azure pipelines for CUDA tests
-