1. 23 Oct, 2025 1 commit
  2. 29 Sep, 2025 1 commit
  3. 24 Sep, 2025 1 commit
    • pnunna93's avatar
      Fix for warpSize deprecation in ROCm 7.0 (#1762) · b72b766e
      pnunna93 authored
      
      
      * Port ROCm changes from multi-backend-refactor branch
      
      * Update ops.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update test_ops.py
      
      * Update test_functional.py
      
      * Update test_ops.py
      
      * Update test_functional.py
      
      * Update test_functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update test_functional.py
      
      * Update test_functional.py
      
      * Update cextension.py
      
      * Update cuda_specs.py
      
      * Update cuda_specs.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_cuda_setup_evaluator.py
      
      * Update test_functional.py
      
      * Update modules.py
      
      * Update modules.py
      
      * Update ops.py
      
      * Update test_linear4bit.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update test_linear4bit.py
      
      * Update test_linear4bit.py
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Create build-rocm.sh
      
      * Update cuda_specs.py
      
      * Fix trailing whitespace
      
      * Remove conflicts.diff
      
      * update for hipblasVersionMajor >=3
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Update main.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Update test_linear4bit.py
      
      * Lint
      
      * Lint
      
      * Update helpers.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Lint
      
      * Update pythonInterface.cpp
      
      * lint fix
      
      * lint
      
      * Update pythonInterface.cpp
      
      * revert permissions change
      
      * Fix indentation
      
      * Update kernels_hip.cuh
      
      * Update kernels.hip
      
      * Update ops.hip
      
      * Update ops_hip.cuh
      
      * Update kernels_hip.cuh
      
      * Update kernels.hip
      
      * Update kernels.hip
      
      * Update ops.hip
      
      * Update ops_hip.cuh
      
      * Update ops.hip
      
      * Update CMakeLists.txt
      
      * Update functional.py
      
      * Update cextension.py
      
      * Update cextension.py
      
      * warpSize is being made non constexpr in ROCm 7.0
      
      * Merge pull request #90 from ROCm/IFU-rocm_enabled-09-23-2025
      
      Ifu rocm enabled 09 23 2025
      
      * Fix typo
      
      * unskip test_4bit_quant
      
      ---------
      Co-authored-by: default avatarMISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com>
      Co-authored-by: default avatarMISHANMAUYRA <mishanmaurya31081@gmail.com>
      Co-authored-by: default avataramcamd <andrew.chapman@amd.com>
      Co-authored-by: default avatarPrasanth Nunna <root@banff-cyxtera-s78-1.amd.com>
      Co-authored-by: default avatarsstamenk <strahinja.stamenkovic@amd.com>
      b72b766e
  4. 16 Sep, 2025 1 commit
  5. 15 Sep, 2025 3 commits
  6. 09 Sep, 2025 1 commit
  7. 08 Sep, 2025 2 commits
  8. 02 Sep, 2025 1 commit
  9. 02 Aug, 2025 2 commits
  10. 14 Jul, 2025 6 commits
  11. 20 Jun, 2025 1 commit
    • pnunna93's avatar
      Enable ROCm backend with custom ops integration (#1683) · 888788d7
      pnunna93 authored
      
      
      * Port ROCm changes from multi-backend-refactor branch
      
      * Update ops.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update test_ops.py
      
      * Update test_functional.py
      
      * Update test_ops.py
      
      * Update test_functional.py
      
      * Update test_functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update test_functional.py
      
      * Update test_functional.py
      
      * Update cextension.py
      
      * Update cuda_specs.py
      
      * Update cuda_specs.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_cuda_setup_evaluator.py
      
      * Update test_functional.py
      
      * Update modules.py
      
      * Update modules.py
      
      * Update ops.py
      
      * Update test_linear4bit.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update test_linear4bit.py
      
      * Update test_linear4bit.py
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Create build-rocm.sh
      
      * Update cuda_specs.py
      
      * Fix trailing whitespace
      
      * Remove conflicts.diff
      
      * update for hipblasVersionMajor >=3
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Update main.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Update test_linear4bit.py
      
      * Lint
      
      * Lint
      
      * Update helpers.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Lint
      
      * Update pythonInterface.cpp
      
      * lint fix
      
      * lint
      
      * Update pythonInterface.cpp
      
      * revert permissions change
      
      * Fix indentation
      
      * Update kernels_hip.cuh
      
      * Update kernels.hip
      
      * Update ops.hip
      
      * Update ops_hip.cuh
      
      * Update kernels_hip.cuh
      
      * Update kernels.hip
      
      * Update kernels.hip
      
      * Update ops.hip
      
      * Update ops_hip.cuh
      
      * Update ops.hip
      
      * Update CMakeLists.txt
      
      * Update functional.py
      
      * Update cextension.py
      
      * Update cextension.py
      
      ---------
      Co-authored-by: default avatarMISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com>
      Co-authored-by: default avatarMISHANMAUYRA <mishanmaurya31081@gmail.com>
      Co-authored-by: default avataramcamd <andrew.chapman@amd.com>
      Co-authored-by: default avatarPrasanth Nunna <root@banff-cyxtera-s78-1.amd.com>
      888788d7
  12. 18 Jun, 2025 1 commit
  13. 17 Jun, 2025 1 commit
    • Matthew Douglas's avatar
      CI: Setup HPU nightly tests (#1681) · 29564ad6
      Matthew Douglas authored
      * Setup XPU CI
      
      * CI: expand XPU matrix
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * skip some fp4 tests on hpu
      
      * skip some fp4 tests on hpu
      
      * skip gemv tests on hpu
      
      * test
      
      * Additional test patches for HPU
      
      * HPU test update
      
      * HPU test update
      
      * HPU test update
      
      * HPU test update
      
      * Format
      29564ad6
  14. 16 Jun, 2025 1 commit
  15. 11 Jun, 2025 2 commits
  16. 08 Jun, 2025 1 commit
  17. 06 Jun, 2025 1 commit
  18. 04 Jun, 2025 1 commit
    • Matthew Douglas's avatar
      Deprecation cleanup (#1669) · 849d9449
      Matthew Douglas authored
      * Deprecation cleanup: remove histogram_scatter_add_2d
      
      * Deprecation cleanup: vectorwise_mm_dequant
      
      * Deprecation cleanup: vectorwise_quant
      
      * Remove unused test
      
      * Optimizer test cleanup
      
      * Deprecations: remove estimate_quantiles, create_quantile_map
      
      * Move deprecated test
      849d9449
  19. 03 Jun, 2025 1 commit
  20. 02 Jun, 2025 2 commits
    • Matthew Douglas's avatar
      Add CPU + IPEX to nightly CI (#1667) · 318a86e3
      Matthew Douglas authored
      * Tests: add linux x64 cpu+ipex to nightly CI workflow
      
      * typo
      
      * Tests: guard linear8bit compile test for ipex cpu issue
      318a86e3
    • Matthew Douglas's avatar
      Fix CI regression (#1666) · 945f7c1d
      Matthew Douglas authored
      * Tests: xfail opcheck for 4bit quantization with floating storage dtypes
      
      * Tests: xfail opcheck for 4bit quantization with floating storage dtypes
      
      * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch
      
      * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch
      945f7c1d
  21. 28 May, 2025 1 commit
  22. 24 May, 2025 2 commits
    • Matthew Douglas's avatar
      Add torch.compile tests (#1648) · 9f858294
      Matthew Douglas authored
      * Add torch.compile tests
      
      * Tests: WA aarch64 CPU regressions for torch 2.6.0; add Windows torch==2.7.0+cu118 test config
      
      * Tests: skip torch.compile for cuda on windows
      9f858294
    • Matthew Douglas's avatar
      General cleanup & test improvements (#1646) · 503d243e
      Matthew Douglas authored
      * General cleanup & test improvements
      
      * Tests: WA numpy 2 compat issue for torch<2.3
      
      * Tests: update aarch64 cpu min torch version
      
      * Tests: update aarch64 cpu min torch version
      
      * Tests: update aarch64 cpu min torch version
      503d243e
  23. 21 May, 2025 1 commit
  24. 19 May, 2025 1 commit
    • Matthew Douglas's avatar
      CI runner updates (#1643) · cdcae8d3
      Matthew Douglas authored
      * Test g5g runner
      
      * Switch L4 to L40S runner; swap GitHub Linux T4 runner for AWS g4dn
      
      * Run tests on last 2 pytorch stable releases
      
      * Run tests on last 2 pytorch stable releases
      cdcae8d3
  25. 13 May, 2025 1 commit
  26. 29 Apr, 2025 1 commit
    • Matthew Douglas's avatar
      Set up nightly CI for unit tests (#1619) · a5dd01bb
      Matthew Douglas authored
      * Run unit tests on GH Actions
      
      * fix
      
      * fix
      
      * trigger workflow
      
      * Update
      
      * Update
      
      * Update
      
      * Run tests nightly
      
      * Disable paged optimizer test on Windows
      
      * Skip unit tests on Windows for CUDA 12.x (driver on runner is too old)
      a5dd01bb
  27. 28 Apr, 2025 1 commit
  28. 22 Apr, 2025 1 commit
    • Matthew Douglas's avatar
      Updates for device agnosticism (#1601) · 1088ec52
      Matthew Douglas authored
      * Include device support tags for transformers multi-backend compatability; add xpu() and cpu() to Params4bit
      
      * Make test suite more device-agnostic
      
      * Additional device agnostic tests
      
      * Additional device agnosticism for tests
      
      * Add BNB_TEST_DEVICE env var to manually select device for unit tests
      
      * Include device support tags for transformers multi-backend compatability; add xpu() and cpu() to Params4bit
      
      * Make test suite more device-agnostic
      
      * Additional device agnostic tests
      
      * Additional device agnosticism for tests
      
      * Add BNB_TEST_DEVICE env var to manually select device for unit tests
      
      * Small bugfix for int8 test
      
      * Exclude backward() from code coverage reports
      
      * Params4bit: don't try to quantize when moving to meta device
      1088ec52