1. 23 Sep, 2025 1 commit
    • Matthew Douglas's avatar
      Add CUDA 13.0 Support (#1761) · bdb8b2b7
      Matthew Douglas authored
      * CUDA 13 build enablement
      
      * Try to fix Windows build workflow
      
      * Add torch 2.9+cu130 to tests
      
      * Fix python version
      
      * Update test workflow
      
      * Don't test CPU on torch 2.9 yet
      
      * Update doc
      bdb8b2b7
  2. 16 Sep, 2025 1 commit
  3. 15 Sep, 2025 1 commit
  4. 09 Sep, 2025 1 commit
  5. 11 Aug, 2025 2 commits
  6. 01 Jul, 2025 1 commit
  7. 30 Jun, 2025 1 commit
  8. 27 Jun, 2025 1 commit
    • Matthew Douglas's avatar
      Add CUDA 12.9 build (#1689) · 1abd5e78
      Matthew Douglas authored
      * Add CUDA 12.9 to build/test workflows
      
      * Downgrade Jimver/cuda-toolkit to v0.2.24
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update tests.yml
      
      * Update tests.yml
      1abd5e78
  9. 20 Jun, 2025 1 commit
    • pnunna93's avatar
      Enable ROCm backend with custom ops integration (#1683) · 888788d7
      pnunna93 authored
      
      
      * Port ROCm changes from multi-backend-refactor branch
      
      * Update ops.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update test_ops.py
      
      * Update test_functional.py
      
      * Update test_ops.py
      
      * Update test_functional.py
      
      * Update test_functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update test_functional.py
      
      * Update test_functional.py
      
      * Update cextension.py
      
      * Update cuda_specs.py
      
      * Update cuda_specs.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_cuda_setup_evaluator.py
      
      * Update test_functional.py
      
      * Update modules.py
      
      * Update modules.py
      
      * Update ops.py
      
      * Update test_linear4bit.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update test_linear4bit.py
      
      * Update test_linear4bit.py
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Create build-rocm.sh
      
      * Update cuda_specs.py
      
      * Fix trailing whitespace
      
      * Remove conflicts.diff
      
      * update for hipblasVersionMajor >=3
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Update main.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Update test_linear4bit.py
      
      * Lint
      
      * Lint
      
      * Update helpers.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Lint
      
      * Update pythonInterface.cpp
      
      * lint fix
      
      * lint
      
      * Update pythonInterface.cpp
      
      * revert permissions change
      
      * Fix indentation
      
      * Update kernels_hip.cuh
      
      * Update kernels.hip
      
      * Update ops.hip
      
      * Update ops_hip.cuh
      
      * Update kernels_hip.cuh
      
      * Update kernels.hip
      
      * Update kernels.hip
      
      * Update ops.hip
      
      * Update ops_hip.cuh
      
      * Update ops.hip
      
      * Update CMakeLists.txt
      
      * Update functional.py
      
      * Update cextension.py
      
      * Update cextension.py
      
      ---------
      Co-authored-by: default avatarMISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com>
      Co-authored-by: default avatarMISHANMAUYRA <mishanmaurya31081@gmail.com>
      Co-authored-by: default avataramcamd <andrew.chapman@amd.com>
      Co-authored-by: default avatarPrasanth Nunna <root@banff-cyxtera-s78-1.amd.com>
      888788d7
  10. 17 Jun, 2025 1 commit
    • Matthew Douglas's avatar
      CI: Setup HPU nightly tests (#1681) · 29564ad6
      Matthew Douglas authored
      * Setup XPU CI
      
      * CI: expand XPU matrix
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * skip some fp4 tests on hpu
      
      * skip some fp4 tests on hpu
      
      * skip gemv tests on hpu
      
      * test
      
      * Additional test patches for HPU
      
      * HPU test update
      
      * HPU test update
      
      * HPU test update
      
      * HPU test update
      
      * Format
      29564ad6
  11. 05 Jun, 2025 1 commit
  12. 02 Jun, 2025 1 commit
  13. 24 May, 2025 2 commits
    • Matthew Douglas's avatar
      Add torch.compile tests (#1648) · 9f858294
      Matthew Douglas authored
      * Add torch.compile tests
      
      * Tests: WA aarch64 CPU regressions for torch 2.6.0; add Windows torch==2.7.0+cu118 test config
      
      * Tests: skip torch.compile for cuda on windows
      9f858294
    • Matthew Douglas's avatar
      General cleanup & test improvements (#1646) · 503d243e
      Matthew Douglas authored
      * General cleanup & test improvements
      
      * Tests: WA numpy 2 compat issue for torch<2.3
      
      * Tests: update aarch64 cpu min torch version
      
      * Tests: update aarch64 cpu min torch version
      
      * Tests: update aarch64 cpu min torch version
      503d243e
  14. 19 May, 2025 3 commits
  15. 16 May, 2025 5 commits
  16. 15 May, 2025 1 commit
  17. 14 May, 2025 1 commit
    • Matthew Douglas's avatar
      Additional CI runners (#1639) · 98eed131
      Matthew Douglas authored
      * Improvements for testing suite
      
      * Add workflow for macOS arm64 CPU tests
      
      * Update tests.yml
      
      * Update tests.yml
      
      Use new L4 and CPU runners for testing.
      
      * Update tests.yml
      98eed131
  18. 13 May, 2025 1 commit
  19. 08 May, 2025 4 commits
  20. 05 May, 2025 2 commits
  21. 02 May, 2025 2 commits
  22. 29 Apr, 2025 1 commit
    • Matthew Douglas's avatar
      Set up nightly CI for unit tests (#1619) · a5dd01bb
      Matthew Douglas authored
      * Run unit tests on GH Actions
      
      * fix
      
      * fix
      
      * trigger workflow
      
      * Update
      
      * Update
      
      * Update
      
      * Run tests nightly
      
      * Disable paged optimizer test on Windows
      
      * Skip unit tests on Windows for CUDA 12.x (driver on runner is too old)
      a5dd01bb
  23. 22 Apr, 2025 1 commit
  24. 07 Apr, 2025 1 commit
  25. 27 Mar, 2025 2 commits
  26. 25 Feb, 2025 1 commit