• pnunna93's avatar
    Enable ROCm backend with custom ops integration (#1683) · 888788d7
    pnunna93 authored
    
    
    * Port ROCm changes from multi-backend-refactor branch
    
    * Update ops.py
    
    * Update functional.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update functional.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update test_ops.py
    
    * Update test_functional.py
    
    * Update test_ops.py
    
    * Update test_functional.py
    
    * Update test_functional.py
    
    * Update functional.py
    
    * Update functional.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update test_functional.py
    
    * Update test_functional.py
    
    * Update cextension.py
    
    * Update cuda_specs.py
    
    * Update cuda_specs.py
    
    * Update test_functional.py
    
    * Update test_linear4bit.py
    
    * Update test_cuda_setup_evaluator.py
    
    * Update test_functional.py
    
    * Update modules.py
    
    * Update modules.py
    
    * Update ops.py
    
    * Update test_linear4bit.py
    
    * Update ops.py
    
    * Update ops.py
    
    * Update test_linear4bit.py
    
    * Update test_linear4bit.py
    
    * Update python-package.yml
    
    * Update python-package.yml
    
    * Update python-package.yml
    
    * Update python-package.yml
    
    * Create build-rocm.sh
    
    * Update cuda_specs.py
    
    * Fix trailing whitespace
    
    * Remove conflicts.diff
    
    * update for hipblasVersionMajor >=3
    
    * Update test_functional.py
    
    * Update test_linear4bit.py
    
    * Update test_ops.py
    
    * Update main.py
    
    * Update test_functional.py
    
    * Update test_linear4bit.py
    
    * Update test_ops.py
    
    * Update test_linear4bit.py
    
    * Lint
    
    * Lint
    
    * Update helpers.py
    
    * Update test_functional.py
    
    * Update test_linear4bit.py
    
    * Update test_ops.py
    
    * Lint
    
    * Update pythonInterface.cpp
    
    * lint fix
    
    * lint
    
    * Update pythonInterface.cpp
    
    * revert permissions change
    
    * Fix indentation
    
    * Update kernels_hip.cuh
    
    * Update kernels.hip
    
    * Update ops.hip
    
    * Update ops_hip.cuh
    
    * Update kernels_hip.cuh
    
    * Update kernels.hip
    
    * Update kernels.hip
    
    * Update ops.hip
    
    * Update ops_hip.cuh
    
    * Update ops.hip
    
    * Update CMakeLists.txt
    
    * Update functional.py
    
    * Update cextension.py
    
    * Update cextension.py
    
    ---------
    Co-authored-by: default avatarMISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com>
    Co-authored-by: default avatarMISHANMAUYRA <mishanmaurya31081@gmail.com>
    Co-authored-by: default avataramcamd <andrew.chapman@amd.com>
    Co-authored-by: default avatarPrasanth Nunna <root@banff-cyxtera-s78-1.amd.com>
    888788d7
kernels.hip 121 KB