• Yifan Xiong's avatar
    Release - SuperBench v0.10.0 (#607) · 2c88db90
    Yifan Xiong authored
    **Description**
    
    Cherry-pick bug fixes from v0.10.0 to main.
    
    **Major Revisions**
    
    * Benchmarks: Microbenchmark - Support different hipblasLt data types in dist_inference #590
    * Benchmarks: Microbenchmark - Support in-place for NCCL/RCCL benchmark #591
    * Bug Fix - Fix NUMA Domains Swap Issue in NDv4 Topology File #592
    * Benchmarks: Microbenchmark - Add data type option for NCCL and RCCL tests #595
    * Benchmarks: Bug Fix - Make metrics of dist-inference-cpp aligned with PyTorch version #596
    * CI/CD - Add ndv5 topo file #597
    * Benchmarks: Microbenchmark - Improve AMD GPU P2P performance with fine-grained GPU memory #593
    * Benchmarks: Build Pipeline - fix nccl and nccl test version to 2.18.3 to resolve hang issue in cuda12.2 docker #599
    * Dockerfile - Bug fix for rocm docker build and deploy #598
    * Benchmarks: Microbenchmark - Adapt to hipblasLt data type changes #603
    * Benchmarks: Micro benchmarks - Update hipblaslt metric unit to tflops #604
    * Monitor - U...
    2c88db90
README.md 2.79 KB