1. 01 Feb, 2024 2 commits
    • Aarni Koskela's avatar
      Test improvements (#1001) · 2336a45c
      Aarni Koskela authored
      * test_nvidia_transform: fix variable reference
      
      `out_order` is the global parametrization list, not the test fixture argument
      
      * Make `parametrize` use more idiomatic
      
      * Use a more deterministic helper for `dim*` determination
      
      * Convert NO_CUBLASLT errors into skips too
      
      * Mark slow and benchmark tests as such (allows `-k "not benchmark"`)
      2336a45c
    • Aarni Koskela's avatar
      test_nvidia_transform: fix variable reference (#1000) · 1a0dc5c3
      Aarni Koskela authored
      `out_order` is the global parametrization list, not the test fixture argument
      1a0dc5c3
  2. 30 Jan, 2024 1 commit
    • Aarni Koskela's avatar
      Ruff fixes (#984) · 706ec24d
      Aarni Koskela authored
      
      
      * Adjust Ruff configuration
      
      * do not autofix always
      * be less strict around tests and benchmarks
      * adjust ignores for now
      
      * Ruff: autofix I and F401
      
      * Apply ruff autofixes
      
      * Fix RUF013 complaint
      
      * Fix mutable default in replace_linear
      
      * Don't use bare except
      
      * Wrap bitsandbytes.__main__ entrypoint in function; fix "sensible" typo
      
      * Fix ruff B008 (function call in arguments)
      
      * Add ruff noqas as suitable
      
      * Fix RUF005 (splat instead of concatenating)
      
      * Fix B018 (useless expression)
      
      * Add pre-commit configuration + GitHub Actions lint workflow
      
      * Fix unused `e` in bitsandbytes/__main__.py
      
      * fix merge conflict resolution error
      
      * run pre-commit hook
      
      ---------
      Co-authored-by: default avatarTitus <9048635+Titus-von-Koeller@users.noreply.github.com>
      706ec24d
  3. 24 Jan, 2024 1 commit
  4. 17 Jan, 2024 1 commit
    • Benjamin Warner's avatar
      Initial FSDP Support for QLoRA Finetuning (#970) · dcfb6f81
      Benjamin Warner authored
      
      
      This PR adds initial FSDP support for training QLoRA models. It enables basic FSDP and CPU Offload support, with low memory training via FSDP.sync_module_states option unsupported.
      
      This PR builds off of #840 commit 8278fca and BNB FSDP by @TimDettmers and @Titus-von-Koeller.
      
      An example of using this PR to finetune QLoRA models with FSDP can be found in the demo repo: AnswerDotAi/fsdp_qlora.
      
      * Minimal changes for fp32 4bit storage from BNB commit 8278fca
      
      * Params4bit with selectable storage dtype
      
      * possible fix for double quantizing linear weight & quant storage dtype
      
      * minor fixes in Params4bit for peft tests
      
      * remove redundant
      
      * add float16
      
      * update test
      
      * Remove float16 quant cast as there are fp32, bf16, & fp16 quant kernels
      
      ---------
      Co-authored-by: default avatarKerem Turgutlu <keremturgutlu@gmail.com>
      dcfb6f81
  5. 08 Jan, 2024 1 commit
  6. 03 Dec, 2023 1 commit
  7. 10 Nov, 2023 1 commit
  8. 09 Nov, 2023 1 commit
  9. 08 Nov, 2023 1 commit
  10. 02 Nov, 2023 5 commits
  11. 04 Aug, 2023 1 commit
  12. 22 Jul, 2023 1 commit
  13. 19 Jul, 2023 1 commit
  14. 17 Jul, 2023 1 commit
  15. 14 Jul, 2023 1 commit
  16. 12 Jul, 2023 1 commit
  17. 11 Jul, 2023 2 commits
  18. 10 Jul, 2023 5 commits
  19. 09 Jul, 2023 3 commits
  20. 08 Jul, 2023 2 commits
  21. 05 Jul, 2023 1 commit
  22. 04 Jul, 2023 2 commits
  23. 31 May, 2023 2 commits
  24. 24 May, 2023 2 commits