- 27 Jun, 2025 1 commit
-
-
Matthew Douglas authored
* Add CUDA 12.9 to build/test workflows * Downgrade Jimver/cuda-toolkit to v0.2.24 * Update python-package.yml * Update python-package.yml * Update python-package.yml * Update tests.yml * Update tests.yml
-
- 24 Jun, 2025 1 commit
-
-
Aman Gupta authored
-
- 23 Jun, 2025 1 commit
-
-
Aman Gupta authored
-
- 20 Jun, 2025 1 commit
-
-
pnunna93 authored
* Port ROCm changes from multi-backend-refactor branch * Update ops.py * Update functional.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update functional.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update functional.py * Update functional.py * Update functional.py * Update functional.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update functional.py * Update functional.py * Update functional.py * Update test_ops.py * Update test_functional.py * Update test_ops.py * Update test_functional.py * Update test_functional.py * Update functional.py * Update functional.py * Update ops.py * Update ops.py * Update test_functional.py * Update test_functional.py * Update cextension.py * Update cuda_specs.py * Update cuda_specs.py * Update test_functional.py * Update test_linear4bit.py * Update test_cuda_setup_evaluator.py * Update test_functional.py * Update modules.py * Update modules.py * Update ops.py * Update test_linear4bit.py * Update ops.py * Update ops.py * Update test_linear4bit.py * Update test_linear4bit.py * Update python-package.yml * Update python-package.yml * Update python-package.yml * Update python-package.yml * Create build-rocm.sh * Update cuda_specs.py * Fix trailing whitespace * Remove conflicts.diff * update for hipblasVersionMajor >=3 * Update test_functional.py * Update test_linear4bit.py * Update test_ops.py * Update main.py * Update test_functional.py * Update test_linear4bit.py * Update test_ops.py * Update test_linear4bit.py * Lint * Lint * Update helpers.py * Update test_functional.py * Update test_linear4bit.py * Update test_ops.py * Lint * Update pythonInterface.cpp * lint fix * lint * Update pythonInterface.cpp * revert permissions change * Fix indentation * Update kernels_hip.cuh * Update kernels.hip * Update ops.hip * Update ops_hip.cuh * Update kernels_hip.cuh * Update kernels.hip * Update kernels.hip * Update ops.hip * Update ops_hip.cuh * Update ops.hip * Update CMakeLists.txt * Update functional.py * Update cextension.py * Update cextension.py --------- Co-authored-by:
MISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com> Co-authored-by:
MISHANMAUYRA <mishanmaurya31081@gmail.com> Co-authored-by:
amcamd <andrew.chapman@amd.com> Co-authored-by:
Prasanth Nunna <root@banff-cyxtera-s78-1.amd.com>
-
- 19 Jun, 2025 1 commit
-
-
Matthew Douglas authored
-
- 18 Jun, 2025 1 commit
-
-
Chetan Kumar Verma authored
-
- 17 Jun, 2025 1 commit
-
-
Matthew Douglas authored
* Setup XPU CI * CI: expand XPU matrix * test * test * test * test * test * test * test * test * test * test * skip some fp4 tests on hpu * skip some fp4 tests on hpu * skip gemv tests on hpu * test * Additional test patches for HPU * HPU test update * HPU test update * HPU test update * HPU test update * Format
-
- 16 Jun, 2025 1 commit
-
-
Chetan Kumar Verma authored
-
- 13 Jun, 2025 3 commits
-
-
Matthew Douglas authored
* Add clang-format rules * Update clang-format
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 11 Jun, 2025 3 commits
-
-
वेदांत authored
* doc fix signature for 8-bit optim * required changes * precommit
-
Egor authored
-
Dmitrii Makarenko authored
* [xpu/triton] Add trtion dequantization kernel This PR adds xpu backend and trtion kernel for dequantization nf4 dtype. Trtion is an optional import. Tests: tests/test_functional.py::TestQuantize4BitFunctional supported nf4/fp4 cases tests/test_functional.py::Test8BitBlockwiseQuantizeFunctional implemented quantize_blockwise with binary search that works faster for XPU tests/test_linear4bit.py Signed-off-by:Dmitrii Makarenko <dmitrii.makarenko@intel.com> * align with ipex code * enable test for ipex * test_kbit_backprop: skip no longer needed * remove unused --------- Signed-off-by:
Dmitrii Makarenko <dmitrii.makarenko@intel.com>
-
- 08 Jun, 2025 1 commit
-
-
Matthew Douglas authored
-
- 06 Jun, 2025 2 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 05 Jun, 2025 2 commits
-
-
Ruheena Suhani Shaik authored
* supports hpu backend in main branch * Update bitsandbytes/backends/hpu/ops.py updates the assertion message Co-authored-by:
Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * Update bitsandbytes/backends/hpu/ops.py Co-authored-by:
Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * Update ops.py Fix lint issue * Update ops.py --------- Co-authored-by:
Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>
-
Matthew Douglas authored
-
- 04 Jun, 2025 1 commit
-
-
Matthew Douglas authored
* Deprecation cleanup: remove histogram_scatter_add_2d * Deprecation cleanup: vectorwise_mm_dequant * Deprecation cleanup: vectorwise_quant * Remove unused test * Optimizer test cleanup * Deprecations: remove estimate_quantiles, create_quantile_map * Move deprecated test
-
- 03 Jun, 2025 3 commits
-
-
Matthew Douglas authored
-
mklabunde authored
-
Matthew Douglas authored
-
- 02 Jun, 2025 3 commits
-
-
Matthew Douglas authored
* Tests: add linux x64 cpu+ipex to nightly CI workflow * typo * Tests: guard linear8bit compile test for ipex cpu issue
-
Matthew Douglas authored
* Tests: xfail opcheck for 4bit quantization with floating storage dtypes * Tests: xfail opcheck for 4bit quantization with floating storage dtypes * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch
-
Matthew Douglas authored
-
- 28 May, 2025 1 commit
-
-
jiqing-feng authored
* enable ipex Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix cpu 8bit quantization Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix int8 and nf4 cpu inference Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * add cpu fp4 and rem Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix dequantize nf4 xpu Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix ipex op Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix dequantize nf4 name Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix dequantize nf4 ipex Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix matmul8bitfp Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * enable cpu tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix quantize blockwise output shape Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix quant_storage bf16 and gemv cpu Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix cpu tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix xpu tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix lib Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * skip xpu dequantize blockwise op check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix matmul8bit Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * skip not used function teests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix matmul8bit fp Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * check ipex before MatMul8bitFp Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * update ipex install guide Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * update install guide Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix error log Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix error lof Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * update comment Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * move torch op to default Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * revert ipex check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix code tabledevice Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix code table device Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix xpu ops Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by:
jiqing-feng <jiqing.feng@intel.com>
-
- 27 May, 2025 2 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 24 May, 2025 4 commits
-
-
वेदांत authored
-
Matthew Douglas authored
* Start cleaning up docs * Remove page * Minor update * correction * Minor doc revisions * Update installation.mdx * Update _toctree.yml
-
Matthew Douglas authored
* Add torch.compile tests * Tests: WA aarch64 CPU regressions for torch 2.6.0; add Windows torch==2.7.0+cu118 test config * Tests: skip torch.compile for cuda on windows
-
Matthew Douglas authored
* General cleanup & test improvements * Tests: WA numpy 2 compat issue for torch<2.3 * Tests: update aarch64 cpu min torch version * Tests: update aarch64 cpu min torch version * Tests: update aarch64 cpu min torch version
-
- 23 May, 2025 1 commit
-
-
Matthew Douglas authored
-
- 21 May, 2025 2 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 19 May, 2025 3 commits
-
-
Matthew Douglas authored
* Test g5g runner * Switch L4 to L40S runner; swap GitHub Linux T4 runner for AWS g4dn * Run tests on last 2 pytorch stable releases * Run tests on last 2 pytorch stable releases
-
Titus von Koeller authored
-
Titus von Koeller authored
-
- 16 May, 2025 1 commit
-
-
Titus von Koeller authored
-