• pnunna93's avatar
    Fix for warpSize deprecation in ROCm 7.0 (#1762) · b72b766e
    pnunna93 authored
    
    
    * Port ROCm changes from multi-backend-refactor branch
    
    * Update ops.py
    
    * Update functional.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update functional.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update test_ops.py
    
    * Update test_functional.py
    
    * Update test_ops.py
    
    * Update test_functional.py
    
    * Update test_functional.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update test_functional.py
    
    * Update test_functional.py
    
    * Update cextension.py
    
    * Update cuda_specs.py
    
    * Update cuda_specs.py
    
    * Update test_functional.py
    
    * Update test_linear4bit.py
    
    * Update test_cuda_setup_evaluator.py
    
    * Update test_functional.py
    
    * Update modules.py
    
    * Update modules.py
    
    * Update ops.py
    
    * Update test_linear4bit.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update test_linear4bit.py
    
    * Update test_linear4bit.py
    
    * Update python-package.yml
    
    * Update python-package.yml
    
    * Update python-package.yml
    
    * Update python-package.yml
    
    * Create build-rocm.sh
    
    * Update cuda_specs.py
    
    * Fix trailing whitespace
    
    * Remove conflicts.diff
    
    * update for hipblasVersionMajor >=3
    
    * Update test_functional.py
    
    * Update test_linear4bit.py
    
    * Update test_ops.py
    
    * Update main.py
    
    * Update test_functional.py
    
    * Update test_linear4bit.py
    
    * Update test_ops.py
    
    * Update test_linear4bit.py
    
    * Lint
    
    * Lint
    
    * Update helpers.py
    
    * Update test_functional.py
    
    * Update test_linear4bit.py
    
    * Update test_ops.py
    
    * Lint
    
    * Update pythonInterface.cpp
    
    * lint fix
    
    * lint
    
    * Update pythonInterface.cpp
    
    * revert permissions change
    
    * Fix indentation
    
    * Update kernels_hip.cuh
    
    * Update kernels.hip
    
    * Update ops.hip
    
    * Update ops_hip.cuh
    
    * Update kernels_hip.cuh
    
    * Update kernels.hip
    
    * Update kernels.hip
    
    * Update ops.hip
    
    * Update ops_hip.cuh
    
    * Update ops.hip
    
    * Update CMakeLists.txt
    
    * Update functional.py
    
    * Update cextension.py
    
    * Update cextension.py
    
    * warpSize is being made non constexpr in ROCm 7.0
    
    * Merge pull request #90 from ROCm/IFU-rocm_enabled-09-23-2025
    
    Ifu rocm enabled 09 23 2025
    
    * Fix typo
    
    * unskip test_4bit_quant
    
    ---------
    Co-authored-by: default avatarMISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com>
    Co-authored-by: default avatarMISHANMAUYRA <mishanmaurya31081@gmail.com>
    Co-authored-by: default avataramcamd <andrew.chapman@amd.com>
    Co-authored-by: default avatarPrasanth Nunna <root@banff-cyxtera-s78-1.amd.com>
    Co-authored-by: default avatarsstamenk <strahinja.stamenkovic@amd.com>
    b72b766e
ops.hip 36.9 KB