Commits · 2336a45cedde1a7b9909c586aa8793b4eb8d00c4 · OpenDAS / bitsandbytes

01 Feb, 2024 2 commits

Aarni Koskela authored Feb 01, 2024

* test_nvidia_transform: fix variable reference

`out_order` is the global parametrization list, not the test fixture argument

* Make `parametrize` use more idiomatic

* Use a more deterministic helper for `dim*` determination

* Convert NO_CUBLASLT errors into skips too

* Mark slow and benchmark tests as such (allows `-k "not benchmark"`)

2336a45c

test_nvidia_transform: fix variable reference (#1000) · 1a0dc5c3
Aarni Koskela authored Feb 01, 2024
```
`out_order` is the global parametrization list, not the test fixture argument
```
1a0dc5c3

30 Jan, 2024 1 commit

Ruff fixes (#984) · 706ec24d

Aarni Koskela authored Jan 30, 2024



* Adjust Ruff configuration

* do not autofix always
* be less strict around tests and benchmarks
* adjust ignores for now

* Ruff: autofix I and F401

* Apply ruff autofixes

* Fix RUF013 complaint

* Fix mutable default in replace_linear

* Don't use bare except

* Wrap bitsandbytes.__main__ entrypoint in function; fix "sensible" typo

* Fix ruff B008 (function call in arguments)

* Add ruff noqas as suitable

* Fix RUF005 (splat instead of concatenating)

* Fix B018 (useless expression)

* Add pre-commit configuration + GitHub Actions lint workflow

* Fix unused `e` in bitsandbytes/__main__.py

* fix merge conflict resolution error

* run pre-commit hook

---------
Co-authored-by: Titus <9048635+Titus-von-Koeller@users.noreply.github.com>

706ec24d

24 Jan, 2024 1 commit

Tests: improve CUDA support detection (#985) · f1c75741

Aarni Koskela authored Jan 24, 2024

* implicitly skip any test that implicitly uses CUDA on a non-CUDA box
* add a `requires_cuda` fixture

f1c75741

17 Jan, 2024 1 commit

Initial FSDP Support for QLoRA Finetuning (#970) · dcfb6f81

Benjamin Warner authored Jan 16, 2024



This PR adds initial FSDP support for training QLoRA models. It enables basic FSDP and CPU Offload support, with low memory training via FSDP.sync_module_states option unsupported.

This PR builds off of #840 commit 8278fca and BNB FSDP by @TimDettmers and @Titus-von-Koeller.

An example of using this PR to finetune QLoRA models with FSDP can be found in the demo repo: AnswerDotAi/fsdp_qlora.

* Minimal changes for fp32 4bit storage from BNB commit 8278fca

* Params4bit with selectable storage dtype

* possible fix for double quantizing linear weight & quant storage dtype

* minor fixes in Params4bit for peft tests

* remove redundant

* add float16

* update test

* Remove float16 quant cast as there are fp32, bf16, & fp16 quant kernels

---------
Co-authored-by: Kerem Turgutlu <keremturgutlu@gmail.com>

dcfb6f81

08 Jan, 2024 1 commit
- Fixed bnb input in setup.py. Bumped version for release. · 4870580f
  Tim Dettmers authored Jan 07, 2024
  
  4870580f
03 Dec, 2023 1 commit
- chore: update dev setup · 2c605d03
  Titus von Koeller authored Dec 03, 2023
  
  2c605d03
10 Nov, 2023 1 commit
- test comment removed · 45864262
  Ruslan Svirschevski authored Nov 10, 2023
  
  45864262
09 Nov, 2023 1 commit
- fixes for init and tests · ffd46ce1
  Ruslan Svirschevski authored Nov 10, 2023
  
  ffd46ce1
08 Nov, 2023 1 commit
- partially reverted 76b40a5c · 781fcd5b
  Ruslan Svirschevski authored Nov 08, 2023
  
  781fcd5b
02 Nov, 2023 5 commits
- save/load via state_dict now · 76b40a5c
  Ruslan Svirschevski authored Oct 25, 2023
  
  76b40a5c
- test update · 965fd5d5
  Ruslan Svirschevski authored Sep 20, 2023
  
  965fd5d5
- reverted fn signatures in functional() · 4c11d6dc
  Ruslan Svirschevski authored Sep 20, 2023
  
  4c11d6dc
- save/load 4bit squashed · 5bcc1ddc
  Ruslan Svirschevski authored Sep 11, 2023
  
  5bcc1ddc
- use QuantState class for quant_state · 61a4a20d
  Ruslan Svirschevski authored Sep 11, 2023
  
  61a4a20d
04 Aug, 2023 1 commit
- Fixed two bugs in dynamic data type creation. · 3c9aca91
  Tim Dettmers authored Aug 03, 2023
  
  3c9aca91
22 Jul, 2023 1 commit
- Added better default compute_dtype handling for Linear4bit layers. · 412fd0e7
  Tim Dettmers authored Jul 22, 2023
  
  412fd0e7
19 Jul, 2023 1 commit
- Increased occupancy. · c82f51c0
  Tim Dettmers authored Jul 19, 2023
  
  c82f51c0
17 Jul, 2023 1 commit
- Fix typo in test_optim.py · 87816e4e
  Ikko Eltociear Ashimine authored Jul 18, 2023
```
paramters -> parameters
```
  87816e4e
14 Jul, 2023 1 commit
- Changed CUDA setup to use PyTorch default; added a weak test. · 1ab6758b
  Tim Dettmers authored Jul 13, 2023
  
  1ab6758b
12 Jul, 2023 1 commit
- Fixed missing bias in bnb.matmul_4bit for inference; more tests. · 90b0ac57
  Tim Dettmers authored Jul 11, 2023
  
  90b0ac57
11 Jul, 2023 2 commits
- Test for bloom that fails with inference kernels. · dc96e9e7
  Tim Dettmers authored Jul 11, 2023
  
  dc96e9e7
- Added more extensive gemv tests; blocksize guard for gemv. · ba51d95d
  Tim Dettmers authored Jul 11, 2023
  
  ba51d95d
10 Jul, 2023 5 commits
- Removed debugging statement. · a26a321e
  Tim Dettmers authored Jul 10, 2023
  
  a26a321e
- Fixed accidential deletion of limits in kernel. · 306f6b23
  Tim Dettmers authored Jul 10, 2023
  
  306f6b23
- Added generation tests. · 490153b2
  Tim Dettmers authored Jul 10, 2023
  
  490153b2
- Added fp32 compute type for gemv_4bit. · 5fab6734
  Tim Dettmers authored Jul 09, 2023
  
  5fab6734
- Added test for Param4bit.to() and fixed double quant behavior. · cef519c8
  Tim Dettmers authored Jul 09, 2023
  
  cef519c8
09 Jul, 2023 3 commits
- Added double quantization support and tests. · 0f0390ac
  Tim Dettmers authored Jul 09, 2023
  
  0f0390ac
- Added FP4 fast inference support. · 94168d79
  Tim Dettmers authored Jul 09, 2023
  
  94168d79
- Added abitrary data types; fixed a bug for small matrices. · 4b88d69d
  Tim Dettmers authored Jul 09, 2023
  
  4b88d69d
08 Jul, 2023 2 commits
- Turning optimization (float accumulation). 185 vs 50. · eefbf602
  Tim Dettmers authored Jul 08, 2023
  
  eefbf602
- Added warp_shuffle indexing 185 vs 54. · 7e49b5b9
  Tim Dettmers authored Jul 08, 2023
  
  7e49b5b9
05 Jul, 2023 1 commit
- Added bfloat16 quantizations and tests. · 02fd80cb
  Tim Dettmers authored Jul 04, 2023
  
  02fd80cb
04 Jul, 2023 2 commits
- Vectorized loads, conflict free NF4; 52 vs 172. · dfe6900b
  Tim Dettmers authored Jul 04, 2023
  
  dfe6900b
- Initial 4-bit naive batch size 1, 81 vs 185. · f89ff93e
  Tim Dettmers authored Jul 03, 2023
  
  f89ff93e
31 May, 2023 2 commits
- Added debugging functions. · e54d2730
  Tim Dettmers authored May 30, 2023
  
  e54d2730
- Added lookup table. · b7f04e2a
  Tim Dettmers authored May 30, 2023
  
  b7f04e2a
24 May, 2023 2 commits
- Added PagedLion and bf16 Lion. · 1b8772a8
  Tim Dettmers authored May 23, 2023
  
  1b8772a8
- Fixed Makefile. · 2bce175d
  Tim Dettmers authored May 23, 2023
  
  2bce175d