1. 29 Apr, 2025 1 commit
    • Matthew Douglas's avatar
      Set up nightly CI for unit tests (#1619) · a5dd01bb
      Matthew Douglas authored
      * Run unit tests on GH Actions
      
      * fix
      
      * fix
      
      * trigger workflow
      
      * Update
      
      * Update
      
      * Update
      
      * Run tests nightly
      
      * Disable paged optimizer test on Windows
      
      * Skip unit tests on Windows for CUDA 12.x (driver on runner is too old)
      a5dd01bb
  2. 28 Apr, 2025 4 commits
  3. 22 Apr, 2025 2 commits
    • Matthew Douglas's avatar
      Stop building for CUDA toolkit < 11.8 (#1605) · 53daa0e2
      Matthew Douglas authored
      * Stop building for CUDA toolkit < 11.8
      
      * Simplify
      
      * Drop sm70 from cu128 build targets to align with pytorch
      53daa0e2
    • Matthew Douglas's avatar
      Updates for device agnosticism (#1601) · 1088ec52
      Matthew Douglas authored
      * Include device support tags for transformers multi-backend compatability; add xpu() and cpu() to Params4bit
      
      * Make test suite more device-agnostic
      
      * Additional device agnostic tests
      
      * Additional device agnosticism for tests
      
      * Add BNB_TEST_DEVICE env var to manually select device for unit tests
      
      * Include device support tags for transformers multi-backend compatability; add xpu() and cpu() to Params4bit
      
      * Make test suite more device-agnostic
      
      * Additional device agnostic tests
      
      * Additional device agnosticism for tests
      
      * Add BNB_TEST_DEVICE env var to manually select device for unit tests
      
      * Small bugfix for int8 test
      
      * Exclude backward() from code coverage reports
      
      * Params4bit: don't try to quantize when moving to meta device
      1088ec52
  4. 17 Apr, 2025 2 commits
  5. 15 Apr, 2025 2 commits
  6. 14 Apr, 2025 1 commit
  7. 10 Apr, 2025 2 commits
  8. 07 Apr, 2025 1 commit
  9. 01 Apr, 2025 1 commit
  10. 31 Mar, 2025 1 commit
  11. 27 Mar, 2025 3 commits
  12. 25 Mar, 2025 3 commits
    • Matthew Douglas's avatar
      Bump dev version · b86ff64b
      Matthew Douglas authored
      b86ff64b
    • Matthew Douglas's avatar
      PyTorch Custom Operator Integration (#1544) · e82f72b3
      Matthew Douglas authored
      
      
      * Sketch out first custom op registration
      
      * Add note
      
      * Initial int8 op registration
      
      * Cleanup some deprecated functions.
      
      * Int8 ops updates; tests
      
      * Implement 4bit quant/dequant ops
      
      * Fix nested quant
      
      * cleanup
      
      * Test improvements
      
      * Clean up and improve tests
      
      * Add higher level custom op for int8 matmul + dequant + bias
      
      * Add gemv 4bit custom op
      
      * Cleanup
      
      * Implement out kwarg overloads for custom ops
      
      * Update PyTorch minimum to 2.1
      
      * Deprecation updates
      
      * Deprecation updates
      
      * Cleanup; rename int8_linear_dequant -> int8_scaled_mm
      
      * Bump min pytorch to 2.2
      
      * cleanup
      
      * Test reorganization
      
      * Remove deprecated supports_igemmlt
      
      * More cleanup
      
      * Cleanup obsolete C++/CUDA code
      
      * Cleanup
      
      * Create 'default' backend for fallback op implementations; initial CPU nf4 work
      
      * Stub out for multi-platform
      
      * Fix serialization tests for torch>=2.6.0
      
      * Add example for torch.compile e2e inference
      
      * Test update
      
      ---------
      Co-authored-by: default avatarTitus von Koeller <9048635+Titus-von-Koeller@users.noreply.github.com>
      e82f72b3
    • Matthew Douglas's avatar
      Release 0.45.4 · f0735f95
      Matthew Douglas authored
      f0735f95
  13. 19 Mar, 2025 1 commit
  14. 13 Mar, 2025 1 commit
  15. 07 Mar, 2025 1 commit
  16. 25 Feb, 2025 2 commits
  17. 24 Feb, 2025 4 commits
  18. 20 Feb, 2025 1 commit
  19. 19 Feb, 2025 4 commits
  20. 06 Feb, 2025 3 commits