- 06 May, 2025 3 commits
-
-
Titus von Koeller authored
-
Titus von Koeller authored
-
Titus von Koeller authored
-
- 05 May, 2025 2 commits
-
-
Titus von Koeller authored
-
Titus von Koeller authored
-
- 30 Apr, 2025 2 commits
-
-
Titus von Koeller authored
-
Titus von Koeller authored
-
- 29 Apr, 2025 2 commits
-
-
Titus von Koeller authored
-
Titus von Koeller authored
-
- 28 Apr, 2025 2 commits
-
-
Titus von Koeller authored
-
jiqing-feng authored
Signed-off-by:jiqing-feng <jiqing.feng@intel.com>
-
- 22 Apr, 2025 2 commits
-
-
Matthew Douglas authored
* Stop building for CUDA toolkit < 11.8 * Simplify * Drop sm70 from cu128 build targets to align with pytorch
-
Matthew Douglas authored
* Include device support tags for transformers multi-backend compatability; add xpu() and cpu() to Params4bit * Make test suite more device-agnostic * Additional device agnostic tests * Additional device agnosticism for tests * Add BNB_TEST_DEVICE env var to manually select device for unit tests * Include device support tags for transformers multi-backend compatability; add xpu() and cpu() to Params4bit * Make test suite more device-agnostic * Additional device agnostic tests * Additional device agnosticism for tests * Add BNB_TEST_DEVICE env var to manually select device for unit tests * Small bugfix for int8 test * Exclude backward() from code coverage reports * Params4bit: don't try to quantize when moving to meta device
-
- 17 Apr, 2025 2 commits
-
-
Matthew Douglas authored
-
Vedant authored
* fix: Improve CUDA version detection and error handling * lint fix * lint fix
-
- 15 Apr, 2025 2 commits
-
-
Vedant authored
-
youngrok cha authored
* [fix] define blocksize define blocksize as just showing number is a bit confusing * [fix] match code with ademamix also define blocksize as in prev commit
-
- 14 Apr, 2025 1 commit
-
-
Matthew Douglas authored
Co-authored-by:Titus von Koeller <9048635+Titus-von-Koeller@users.noreply.github.com>
-
- 10 Apr, 2025 2 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 07 Apr, 2025 1 commit
-
-
Titus authored
-
- 01 Apr, 2025 1 commit
-
-
Matthew Douglas authored
-
- 31 Mar, 2025 1 commit
-
-
DevKimbob authored
Changed return type from map object to tuple in get_cuda_version_tuple().
-
- 27 Mar, 2025 3 commits
-
-
Matthew Douglas authored
* Testing cleanup * More test cleanup * Additional deprecations/removals. * Skip benchmark, deprecated, slow tests by default
-
Matthew Douglas authored
* Drop Python 3.8 support. * Formatting
-
Matthew Douglas authored
-
- 25 Mar, 2025 3 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
* Sketch out first custom op registration * Add note * Initial int8 op registration * Cleanup some deprecated functions. * Int8 ops updates; tests * Implement 4bit quant/dequant ops * Fix nested quant * cleanup * Test improvements * Clean up and improve tests * Add higher level custom op for int8 matmul + dequant + bias * Add gemv 4bit custom op * Cleanup * Implement out kwarg overloads for custom ops * Update PyTorch minimum to 2.1 * Deprecation updates * Deprecation updates * Cleanup; rename int8_linear_dequant -> int8_scaled_mm * Bump min pytorch to 2.2 * cleanup * Test reorganization * Remove deprecated supports_igemmlt * More cleanup * Cleanup obsolete C++/CUDA code * Cleanup * Create 'default' backend for fallback op implementations; initial CPU nf4 work * Stub out for multi-platform * Fix serialization tests for torch>=2.6.0 * Add example for torch.compile e2e inference * Test update --------- Co-authored-by:Titus von Koeller <9048635+Titus-von-Koeller@users.noreply.github.com>
-
Matthew Douglas authored
-
- 19 Mar, 2025 1 commit
-
-
Titus authored
-
- 13 Mar, 2025 1 commit
-
-
Titus authored
-
- 07 Mar, 2025 1 commit
-
-
Ethan Kiang authored
-
- 25 Feb, 2025 2 commits
-
-
Matthew Douglas authored
-
dependabot[bot] authored
Bumps the minor-patch group with 1 update in the / directory: [ruff](https://github.com/astral-sh/ruff). Updates `ruff` from 0.6.9 to 0.9.6 - [Release notes](https://github.com/astral-sh/ruff/releases) - [Changelog](https://github.com/astral-sh/ruff/blob/main/CHANGELOG.md) - [Commits](https://github.com/astral-sh/ruff/compare/0.6.9...0.9.6 ) --- updated-dependencies: - dependency-name: ruff dependency-type: direct:production update-type: version-update:semver-minor dependency-group: minor-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
- 24 Feb, 2025 4 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 20 Feb, 2025 1 commit
-
-
Titus authored
-
- 19 Feb, 2025 1 commit
-
-
Matthew Douglas authored
-