Commits · 3047ab97ef858ef0faeeaf6e9f43f40b87f0e5fc · OpenDAS / bitsandbytes

19 May, 2025 1 commit
- continuous release: tweaks · 3047ab97
  Titus von Koeller authored May 19, 2025
  
  3047ab97
16 May, 2025 5 commits
- continuous release: tweaks · 31762776
  Titus von Koeller authored May 16, 2025
  
  31762776
- continuous release: tweaks · 4011273a
  Titus von Koeller authored May 16, 2025
  
  4011273a
- continuous release: tweaks · 66c0c454
  Titus von Koeller authored May 16, 2025
  
  66c0c454
- continuous release: tweaks · 90f38acc
  Titus von Koeller authored May 16, 2025
  
  90f38acc
- continuous release: tweaks · 18ead193
  Titus von Koeller authored May 16, 2025
  
  18ead193
15 May, 2025 1 commit
- continuous build: refine + make sure release is always fresh · 5eb35ec9
  Titus von Koeller authored May 15, 2025
  
  5eb35ec9
14 May, 2025 1 commit

Additional CI runners (#1639) · 98eed131

Matthew Douglas authored May 14, 2025

* Improvements for testing suite

* Add workflow for macOS arm64 CPU tests

* Update tests.yml

* Update tests.yml

Use new L4 and CPU runners for testing.

* Update tests.yml

98eed131

13 May, 2025 2 commits
- Improvements to test suite (#1636) · 42bc7291
  Matthew Douglas authored May 13, 2025
```
* Improvements for testing suite

* Add workflow for macOS arm64 CPU tests
```
  42bc7291
- Switch CUDA builds to use Rocky Linux 8 container (#1638) · d870f9c5
  Matthew Douglas authored May 13, 2025
  
  d870f9c5
08 May, 2025 4 commits
- Update tests.yml · 544c203d
  Matthew Douglas authored May 08, 2025
```
Show slow test durations.
```
  544c203d
- Update python-package.yml · a02c4ad8
  Matthew Douglas authored May 08, 2025
```
Fix trailing whitespace.
```
  a02c4ad8
- continuous release: fix for stable download link · f3adf4f6
  Titus von Koeller authored May 08, 2025
  
  f3adf4f6
- continuous release: fix for stable download link · a8a42651
  Titus von Koeller authored May 08, 2025
  
  a8a42651
05 May, 2025 2 commits
- Update nightly workflow · 8b858e4e
  Matthew Douglas authored May 05, 2025
  
  8b858e4e
- Update nightly workflow · 84517609
  Matthew Douglas authored May 05, 2025
  
  84517609
02 May, 2025 2 commits

Linux aarch64 CI updates (#1622) · 49c044b1

Matthew Douglas authored May 02, 2025

* Add aarch64 cpu tests and CUDA build to nightly workflow

* aarch64: limit CUDA targets to sm75, sm80, sm90, sm100

* aarch64: limit CUDA targets to sm75, sm80, sm90, sm100

* Update build cpu script

* fix

* Update auditwheel for aarch64

49c044b1

Use ARM runners to build for Linux aarch64 (#1539) · 8a31eadf

Johnny authored May 02, 2025



* Update python-package.yml

* Update python-package.yml

* Update python-package.yml

* Cleanup

* Matrix update

---------
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

8a31eadf

29 Apr, 2025 1 commit

Set up nightly CI for unit tests (#1619) · a5dd01bb

Matthew Douglas authored Apr 29, 2025

* Run unit tests on GH Actions

* fix

* fix

* trigger workflow

* Update

* Update

* Update

* Run tests nightly

* Disable paged optimizer test on Windows

* Skip unit tests on Windows for CUDA 12.x (driver on runner is too old)

a5dd01bb

22 Apr, 2025 1 commit

Stop building for CUDA toolkit < 11.8 (#1605) · 53daa0e2

Matthew Douglas authored Apr 22, 2025

* Stop building for CUDA toolkit < 11.8

* Simplify

* Drop sm70 from cu128 build targets to align with pytorch

53daa0e2

07 Apr, 2025 1 commit
- fix for missing cpu lib (#1585) · 55b84eea
  Titus authored Apr 07, 2025
  
  55b84eea
27 Mar, 2025 2 commits
- Drop Python 3.8 support. (#1574) · 677ff400
  Matthew Douglas authored Mar 27, 2025
```
* Drop Python 3.8 support.

* Formatting
```
  677ff400
- Bump CUDA 12.8.0 build to CUDA 12.8.1 (#1575) · 9b339952
  Matthew Douglas authored Mar 27, 2025
  
  9b339952
13 Mar, 2025 1 commit
- disable dependabot for now until CI revamp · e772a9e8
  Titus authored Mar 13, 2025
  
  e772a9e8
25 Feb, 2025 1 commit
- Build: use ubuntu-22.04 instead of 24.04 for CPU build (glibc compat) (#1538) · b8223fed
  Matthew Douglas authored Feb 25, 2025
  
  b8223fed
24 Feb, 2025 2 commits
- Update build-cuda.sh · fc6d8b24
  Matthew Douglas authored Feb 24, 2025
  
  fc6d8b24
- Update build-cuda.sh · e4a9a94c
  Matthew Douglas authored Feb 24, 2025
  
  e4a9a94c
28 Jan, 2025 1 commit
- Blackwell binaries! (#1491) · f3e8cbb2
  Johnny authored Jan 28, 2025
```
* blackwell

* blackwell

* Update python-package.yml
```
  f3e8cbb2
23 Jan, 2025 1 commit
- (build) include Ada/Hopper targets in cu118 build (#1487) · b4172770
  Matthew Douglas authored Jan 23, 2025
  
  b4172770
22 Jan, 2025 1 commit

Initial support blackwell (#1481) · db90effe

Johnny authored Jan 22, 2025



* initial support blackwell

* Update CHANGELOG.md
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

* Update CMakeLists.txt

* Update CHANGELOG.md

* fix build-cuda.sh

* fix build-cuda.sh

* fix cuda 12.7 build-cuda.sh

* Update build-cuda.sh

* Update cuda from 12.6.2 to 12.6.3

* Update .github/workflows/python-package.yml
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

* Update install_cuda.py
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

* Update install_cuda.sh
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

* Update .github/scripts/build-cuda.sh

* Update install_cuda.sh

---------
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

db90effe

17 Dec, 2024 1 commit

chore: migrate config files to `pyproject.toml` (#1373) · 5b015890

Saurav Maheshkar authored Dec 17, 2024



* chore: move configs to pyproject.toml

* fix: drop file from CI workflow

* feat: reorder pytest markers

* chore: retain comments

* chore(build): migrate build data to pyproject
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Aarni Koskela <akx@iki.fi>

* chore: move configs to pyproject.toml

* Apply suggestions from code review
Co-authored-by: Aarni Koskela <akx@iki.fi>

* bump ruff

---------
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>
Co-authored-by: Aarni Koskela <akx@iki.fi>

5b015890

05 Dec, 2024 1 commit

LLM.int8() Refactoring: Part 1 (#1401) · 81e6345d

Matthew Douglas authored Dec 05, 2024



* Start of int8 refactor: remove col32/col_ampere/col_turing transforms in new igemmlt implementation

* Fix unintended change

* New naive mm_dequant kernel for row-major; cleanup

* fix

* int8 refactor: initial sparse decomp, cleanup

* Int8 refactoring: remove separate NO_CUBLASLT build; more cleanup

* int8: inference optimizations, some cleanup

* int8: more tests passing, cleanup

* int8 - more cleanup, most tests passing

* int8: specify CUDA stream for int8 ops

* perf: reduce overhead from getting cudaStream ptr

* Mark some functions for deprecation.

* int8 sparse decomp: small perf improvement

* update setup.py

* Update bitsandbytes/autograd/_functions.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* Update bitsandbytes/functional.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* Update bitsandbytes/functional.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* Update bitsandbytes/research/autograd/_functions.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* int8 - perf improvement for sparse decomposition inference; deprecate get_tensor_stream() in favor of new private fn

* int8 cleanup

* Ignore ruff rule ISC001 (incompatible with formatter)

* add comment

* int8 more cleanup

* Update bitsandbytes/functional.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* int8: rename / deprecate old fn signatures

* Update bitsandbytes/functional.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* type annotation

* format update

* Update bitsandbytes/research/autograd/_functions.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* cleanup

* Add comment to explain division optimization

* more cleanup

* Update bitsandbytes/functional.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* Update bitsandbytes/functional.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* Update bitsandbytes/functional.py
Co-authored-by: Aarni Koskela <akx@iki.fi>

* cleanup

* Type annotations, cleanup

* remove unused kernels; improved type annotations

* small perf optimization for single-GPU systems

* small perf optimization for single-GPU systems

* update docstrings

* Improve docs and tests

* Update docstring

* Update test

* add benchmarking script

* test cleanup: add deprecated marker, move benchmarks out

* Add int8 dequant function; misc improvements

* int8 matmul fallback for inner dims not divisible by 4

* improve register usage of kInt8VectorQuant - especially for A100/H100

* disable fail-fast for package build

* maxwell compat

* ptxas verbose

* docs update

* doc update

* backward fix

* Bugfix sparse decomp

* Int8 fix for PEFT OLoRA init

* Fix test for deprecated spmm_coo

* test improvement

* doc update

* typo

* doc cleanup

* docs

* add inference benchmark script

* Add benchmarks, doc update

---------
Co-authored-by: Aarni Koskela <akx@iki.fi>

81e6345d

02 Dec, 2024 1 commit

[Build] Add CUDA 12.6.2 build; update 12.5.0 to 12.5.1 (#1431) · 7dca7004

Matthew Douglas authored Dec 02, 2024

* [Build] Add CUDA 12.6.2 build; update 12.5.0 to 12.5.1

* bump cuda-toolkit action version

* Update docs for cuda versions

7dca7004

30 Sep, 2024 3 commits
- omit macos wheels for now · d873fb34
  Titus von Koeller authored Sep 30, 2024
  
  d873fb34
- more descriptive continuous release name · 2a1ff2c0
  Titus von Koeller authored Sep 30, 2024
  
  2a1ff2c0
- tweak continuous release of `main` · 4f198988
  Titus von Koeller authored Sep 30, 2024
  
  4f198988
24 Sep, 2024 1 commit

Add workflow to publish tagged releases to PyPI (#1369) · bdf381c8

Matthew Douglas authored Sep 23, 2024

* CI/CD: Add step to publish wheels on tag creation

* Remove file

* Restrict pre-release workflow branches

* Update PyPI publishing

* Update PyPI publishing

* Update package workflow name

* continuous pre-release only on main

bdf381c8

30 Aug, 2024 1 commit
- actions: update permissions for pr docs publishing · e4674531
  Titus von Koeller authored Aug 30, 2024
  
  e4674531
31 Jul, 2024 1 commit
- packaging: bump permissions for continuous release step · 4be18838
  Titus authored Jul 31, 2024
  
  4be18838
29 Jul, 2024 1 commit
- add job to upload wheels to continuous pre-release (#1282) · b64cbe32
  Titus authored Jul 29, 2024
  
  b64cbe32