Commits · bdb8b2b7f1bd1a56d20889e87d56302e46000ad8 · OpenDAS / bitsandbytes

23 Sep, 2025 1 commit

Add CUDA 13.0 Support (#1761) · bdb8b2b7

Matthew Douglas authored Sep 23, 2025

* CUDA 13 build enablement

* Try to fix Windows build workflow

* Add torch 2.9+cu130 to tests

* Fix python version

* Update test workflow

* Don't test CPU on torch 2.9 yet

* Update doc

bdb8b2b7

16 Sep, 2025 1 commit

Bump minimum PyTorch to 2.3 (#1754) · c9bce2b4

Matthew Douglas authored Sep 16, 2025

* Bump minimum PyTorch to 2.3

* Tests: Fix Windows numpy<2 compatibility for torch<2.4.1

c9bce2b4

15 Sep, 2025 1 commit

Add SYCL Kernels for XPU backend (#1679) · 1813b058

Liu Xiaoli authored Sep 15, 2025



* Add SYCL Kernels for XPU backend

* fix transpose
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix log and format
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* revert cpu changes
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* clean ipex_xpu
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* clean ipex import
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix ipex cpu import
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix comments
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* refine gemv_4bit kernel

* enable FP4 for dequant_4bit and gemv_4bit

* refine FP4 dequantization performance

* remove check for better performance
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* clean code

* fix tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* rm comments
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix memory issue

* fix ut failure

* adjust threshold
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix xpu check
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* change test_functional check
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix test_module
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix device check
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Enable Windows build and refine code

* fix xpu log
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* remove ipex entirely
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix cpu int8 CB
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix lint
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix logs (#12)

* fix logs
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Fix sycl lint error and tests (#13)

* fix sycl nd
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* skip typo check for xpu kernel codes (#14)

* skip test for xpu ops
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix lint
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* skip typo for xpu
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* skip
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* skip
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* register triton kernel for quantization (#15)
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Fix version comparison issue (#18)

# Description

The version comparison expression miss reference the .release property from the version object. This lead to compare between the tuple and the string

# Error message
```
The 8-bit optimizer is not available on your device, only available on CUDA for now.
🦥 Unsloth: Will patch your computer to enable 2x faster free finetuning.
Traceback (most recent call last):
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/unsloth_validation/run.py", line 1, in <module>
    import unsloth
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/__init__.py", line 235, in <module>
    from .models import *
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/models/__init__.py", line 15, in <module>
    from .llama     import FastLlamaModel
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/models/llama.py", line 23, in <module>
    from ._utils import *
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/models/_utils.py", line 89, in <module>
    from unsloth_zoo.patching_utils import (
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth_zoo/patching_utils.py", line 629, in <module>
    import transformers.integrations.bitsandbytes
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/transformers/integrations/bitsandbytes.py", line 20, in <module>
    import bitsandbytes as bnb
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/bitsandbytes/bitsandbytes/__init__.py", line 39, in <module>
    from .backends.xpu import ops as xpu_ops
  File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/bitsandbytes/bitsandbytes/backends/xpu/ops.py", line 17, in <module>
    if version.parse(torch.__version__).release >= version.parse("2.9"):
TypeError: '>=' not supported between instances of 'tuple' and 'Version'
```

---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: Er-Xin (Edwin) Shang <shangerxin@hotmail.com>

1813b058

09 Sep, 2025 1 commit

Test improvements (#1750) · 6a07ffe0

Matthew Douglas authored Sep 09, 2025

* Test suite improvements for MPS/XPU/HPU

* Skip test on torch==2.8.0+cpu for Windows regression

6a07ffe0

11 Aug, 2025 2 commits
- Restore temporary changes from release · 7bfe923c
  Matthew Douglas authored Aug 11, 2025
  
  7bfe923c
- Temporary updates for release · 59593890
  Matthew Douglas authored Aug 11, 2025
  
  59593890
01 Jul, 2025 1 commit
- CI: Test with PyTorch 2.8.0 RC (#1693) · ed398d28
  Matthew Douglas authored Jul 01, 2025
```
* Add torch 2.8 rc / 2.9 nightly to tests

* Update tests.yml

* Update tests.yml
```
  ed398d28
30 Jun, 2025 1 commit
- Temporarily disable HPU tests · 6d0a5cd2
  Matthew Douglas authored Jun 30, 2025
  
  6d0a5cd2
27 Jun, 2025 1 commit

Add CUDA 12.9 build (#1689) · 1abd5e78

Matthew Douglas authored Jun 27, 2025

* Add CUDA 12.9 to build/test workflows

* Downgrade Jimver/cuda-toolkit to v0.2.24

* Update python-package.yml

* Update python-package.yml

* Update python-package.yml

* Update tests.yml

* Update tests.yml

1abd5e78

20 Jun, 2025 1 commit

Enable ROCm backend with custom ops integration (#1683) · 888788d7

pnunna93 authored Jun 20, 2025



* Port ROCm changes from multi-backend-refactor branch

* Update ops.py

* Update functional.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update functional.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update functional.py

* Update functional.py

* Update functional.py

* Update functional.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update ops.py

* Update functional.py

* Update functional.py

* Update functional.py

* Update test_ops.py

* Update test_functional.py

* Update test_ops.py

* Update test_functional.py

* Update test_functional.py

* Update functional.py

* Update functional.py

* Update ops.py

* Update ops.py

* Update test_functional.py

* Update test_functional.py

* Update cextension.py

* Update cuda_specs.py

* Update cuda_specs.py

* Update test_functional.py

* Update test_linear4bit.py

* Update test_cuda_setup_evaluator.py

* Update test_functional.py

* Update modules.py

* Update modules.py

* Update ops.py

* Update test_linear4bit.py

* Update ops.py

* Update ops.py

* Update test_linear4bit.py

* Update test_linear4bit.py

* Update python-package.yml

* Update python-package.yml

* Update python-package.yml

* Update python-package.yml

* Create build-rocm.sh

* Update cuda_specs.py

* Fix trailing whitespace

* Remove conflicts.diff

* update for hipblasVersionMajor >=3

* Update test_functional.py

* Update test_linear4bit.py

* Update test_ops.py

* Update main.py

* Update test_functional.py

* Update test_linear4bit.py

* Update test_ops.py

* Update test_linear4bit.py

* Lint

* Lint

* Update helpers.py

* Update test_functional.py

* Update test_linear4bit.py

* Update test_ops.py

* Lint

* Update pythonInterface.cpp

* lint fix

* lint

* Update pythonInterface.cpp

* revert permissions change

* Fix indentation

* Update kernels_hip.cuh

* Update kernels.hip

* Update ops.hip

* Update ops_hip.cuh

* Update kernels_hip.cuh

* Update kernels.hip

* Update kernels.hip

* Update ops.hip

* Update ops_hip.cuh

* Update ops.hip

* Update CMakeLists.txt

* Update functional.py

* Update cextension.py

* Update cextension.py

---------
Co-authored-by: MISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com>
Co-authored-by: MISHANMAUYRA <mishanmaurya31081@gmail.com>
Co-authored-by: amcamd <andrew.chapman@amd.com>
Co-authored-by: Prasanth Nunna <root@banff-cyxtera-s78-1.amd.com>

888788d7

17 Jun, 2025 1 commit

CI: Setup HPU nightly tests (#1681) · 29564ad6

Matthew Douglas authored Jun 17, 2025

* Setup XPU CI

* CI: expand XPU matrix

* test

* test

* test

* test

* test

* test

* test

* test

* test

* test

* skip some fp4 tests on hpu

* skip some fp4 tests on hpu

* skip gemv tests on hpu

* test

* Additional test patches for HPU

* HPU test update

* HPU test update

* HPU test update

* HPU test update

* Format

29564ad6

05 Jun, 2025 1 commit
- CI workflow: bump torch 2.7.0 to 2.7.1 (#1670) · e9fc96a2
  Matthew Douglas authored Jun 05, 2025
  
  e9fc96a2
02 Jun, 2025 1 commit

Add CPU + IPEX to nightly CI (#1667) · 318a86e3

Matthew Douglas authored Jun 02, 2025

* Tests: add linux x64 cpu+ipex to nightly CI workflow

* typo

* Tests: guard linear8bit compile test for ipex cpu issue

318a86e3

24 May, 2025 2 commits

Add torch.compile tests (#1648) · 9f858294

Matthew Douglas authored May 23, 2025

* Add torch.compile tests

* Tests: WA aarch64 CPU regressions for torch 2.6.0; add Windows torch==2.7.0+cu118 test config

* Tests: skip torch.compile for cuda on windows

9f858294

General cleanup & test improvements (#1646) · 503d243e

Matthew Douglas authored May 23, 2025

* General cleanup & test improvements

* Tests: WA numpy 2 compat issue for torch<2.3

* Tests: update aarch64 cpu min torch version

* Tests: update aarch64 cpu min torch version

* Tests: update aarch64 cpu min torch version

503d243e

19 May, 2025 3 commits
- CI runner updates (#1643) · cdcae8d3
  Matthew Douglas authored May 19, 2025
```
* Test g5g runner

* Switch L4 to L40S runner; swap GitHub Linux T4 runner for AWS g4dn

* Run tests on last 2 pytorch stable releases

* Run tests on last 2 pytorch stable releases
```
  cdcae8d3
- continuous release: tweak + docs · 513e69be
  Titus von Koeller authored May 19, 2025
  
  513e69be
- continuous release: tweaks · 3047ab97
  Titus von Koeller authored May 19, 2025
  
  3047ab97
16 May, 2025 5 commits
- continuous release: tweaks · 31762776
  Titus von Koeller authored May 16, 2025
  
  31762776
- continuous release: tweaks · 4011273a
  Titus von Koeller authored May 16, 2025
  
  4011273a
- continuous release: tweaks · 66c0c454
  Titus von Koeller authored May 16, 2025
  
  66c0c454
- continuous release: tweaks · 90f38acc
  Titus von Koeller authored May 16, 2025
  
  90f38acc
- continuous release: tweaks · 18ead193
  Titus von Koeller authored May 16, 2025
  
  18ead193
15 May, 2025 1 commit
- continuous build: refine + make sure release is always fresh · 5eb35ec9
  Titus von Koeller authored May 15, 2025
  
  5eb35ec9
14 May, 2025 1 commit

Additional CI runners (#1639) · 98eed131

Matthew Douglas authored May 14, 2025

* Improvements for testing suite

* Add workflow for macOS arm64 CPU tests

* Update tests.yml

* Update tests.yml

Use new L4 and CPU runners for testing.

* Update tests.yml

98eed131

13 May, 2025 1 commit
- Improvements to test suite (#1636) · 42bc7291
  Matthew Douglas authored May 13, 2025
```
* Improvements for testing suite

* Add workflow for macOS arm64 CPU tests
```
  42bc7291
08 May, 2025 4 commits
- Update tests.yml · 544c203d
  Matthew Douglas authored May 08, 2025
```
Show slow test durations.
```
  544c203d
- Update python-package.yml · a02c4ad8
  Matthew Douglas authored May 08, 2025
```
Fix trailing whitespace.
```
  a02c4ad8
- continuous release: fix for stable download link · f3adf4f6
  Titus von Koeller authored May 08, 2025
  
  f3adf4f6
- continuous release: fix for stable download link · a8a42651
  Titus von Koeller authored May 08, 2025
  
  a8a42651
05 May, 2025 2 commits
- Update nightly workflow · 8b858e4e
  Matthew Douglas authored May 05, 2025
  
  8b858e4e
- Update nightly workflow · 84517609
  Matthew Douglas authored May 05, 2025
  
  84517609
02 May, 2025 2 commits

Linux aarch64 CI updates (#1622) · 49c044b1

Matthew Douglas authored May 02, 2025

* Add aarch64 cpu tests and CUDA build to nightly workflow

* aarch64: limit CUDA targets to sm75, sm80, sm90, sm100

* aarch64: limit CUDA targets to sm75, sm80, sm90, sm100

* Update build cpu script

* fix

* Update auditwheel for aarch64

49c044b1

Use ARM runners to build for Linux aarch64 (#1539) · 8a31eadf

Johnny authored May 02, 2025



* Update python-package.yml

* Update python-package.yml

* Update python-package.yml

* Cleanup

* Matrix update

---------
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

8a31eadf

29 Apr, 2025 1 commit

Set up nightly CI for unit tests (#1619) · a5dd01bb

Matthew Douglas authored Apr 29, 2025

* Run unit tests on GH Actions

* fix

* fix

* trigger workflow

* Update

* Update

* Update

* Run tests nightly

* Disable paged optimizer test on Windows

* Skip unit tests on Windows for CUDA 12.x (driver on runner is too old)

a5dd01bb

22 Apr, 2025 1 commit

Stop building for CUDA toolkit < 11.8 (#1605) · 53daa0e2

Matthew Douglas authored Apr 22, 2025

* Stop building for CUDA toolkit < 11.8

* Simplify

* Drop sm70 from cu128 build targets to align with pytorch

53daa0e2

07 Apr, 2025 1 commit
- fix for missing cpu lib (#1585) · 55b84eea
  Titus authored Apr 07, 2025
  
  55b84eea
27 Mar, 2025 2 commits
- Drop Python 3.8 support. (#1574) · 677ff400
  Matthew Douglas authored Mar 27, 2025
```
* Drop Python 3.8 support.

* Formatting
```
  677ff400
- Bump CUDA 12.8.0 build to CUDA 12.8.1 (#1575) · 9b339952
  Matthew Douglas authored Mar 27, 2025
  
  9b339952
25 Feb, 2025 1 commit
- Build: use ubuntu-22.04 instead of 24.04 for CPU build (glibc compat) (#1538) · b8223fed
  Matthew Douglas authored Feb 25, 2025
  
  b8223fed