- 14 Jul, 2025 2 commits
-
-
Egor Krivov authored
-
Egor Krivov authored
-
- 11 Jul, 2025 2 commits
-
-
Egor Krivov authored
-
Egor Krivov authored
-
- 08 Jul, 2025 2 commits
-
-
Matthew Douglas authored
[XPU] Add inference benchmark for XPU
-
Matthew Douglas authored
fix log
-
- 03 Jul, 2025 1 commit
-
-
jiqing-feng authored
Signed-off-by:jiqing-feng <jiqing.feng@intel.com>
-
- 02 Jul, 2025 1 commit
-
-
Egor Krivov authored
-
- 01 Jul, 2025 4 commits
-
-
Michał Górny authored
* Automatically call CMake as part of PEP 517 build Call CMake and build the CPU extension when invoking the build via a PEP 517 backend, to ensure that at least some extension is built when users are building from source. This improves consistency with other Python packages, and reduces the risk of accidents. We are using `scikit-build-core` setuptools plugin to take care of CMake dependencies and call into CMake. However, we need to modify the `build_py` command to ensure that CMake is called prior to the setuptools command, as otherwise the newly built shared library won't be picked up by `build_py`. Since setuptools is still responsible for collecting the Python package, it also collects all other shared libraries that were built earlier, for example via manual CMake calls as done in the CI pipeline. Furthermore, if the user does not have `scikit-build-core` installed and calls `setup.py` directly, we output a warning but continue working as before. The logic can be further extended in the future, for example to detect the best COMPUTE_BACKEND default. Fixes #1511 * Include C sources and build files in source distribution * Fix formatting
-
Matthew Douglas authored
* Add torch 2.8 rc / 2.9 nightly to tests * Update tests.yml * Update tests.yml
-
Matthew Douglas authored
-
jiqing-feng authored
Signed-off-by:jiqing-feng <jiqing.feng@intel.com>
-
- 30 Jun, 2025 1 commit
-
-
Matthew Douglas authored
-
- 27 Jun, 2025 1 commit
-
-
Matthew Douglas authored
* Add CUDA 12.9 to build/test workflows * Downgrade Jimver/cuda-toolkit to v0.2.24 * Update python-package.yml * Update python-package.yml * Update python-package.yml * Update tests.yml * Update tests.yml
-
- 24 Jun, 2025 1 commit
-
-
Aman Gupta authored
-
- 23 Jun, 2025 1 commit
-
-
Aman Gupta authored
-
- 20 Jun, 2025 1 commit
-
-
pnunna93 authored
* Port ROCm changes from multi-backend-refactor branch * Update ops.py * Update functional.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update functional.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update functional.py * Update functional.py * Update functional.py * Update functional.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update functional.py * Update functional.py * Update functional.py * Update test_ops.py * Update test_functional.py * Update test_ops.py * Update test_functional.py * Update test_functional.py * Update functional.py * Update functional.py * Update ops.py * Update ops.py * Update test_functional.py * Update test_functional.py * Update cextension.py * Update cuda_specs.py * Update cuda_specs.py * Update test_functional.py * Update test_linear4bit.py * Update test_cuda_setup_evaluator.py * Update test_functional.py * Update modules.py * Update modules.py * Update ops.py * Update test_linear4bit.py * Update ops.py * Update ops.py * Update test_linear4bit.py * Update test_linear4bit.py * Update python-package.yml * Update python-package.yml * Update python-package.yml * Update python-package.yml * Create build-rocm.sh * Update cuda_specs.py * Fix trailing whitespace * Remove conflicts.diff * update for hipblasVersionMajor >=3 * Update test_functional.py * Update test_linear4bit.py * Update test_ops.py * Update main.py * Update test_functional.py * Update test_linear4bit.py * Update test_ops.py * Update test_linear4bit.py * Lint * Lint * Update helpers.py * Update test_functional.py * Update test_linear4bit.py * Update test_ops.py * Lint * Update pythonInterface.cpp * lint fix * lint * Update pythonInterface.cpp * revert permissions change * Fix indentation * Update kernels_hip.cuh * Update kernels.hip * Update ops.hip * Update ops_hip.cuh * Update kernels_hip.cuh * Update kernels.hip * Update kernels.hip * Update ops.hip * Update ops_hip.cuh * Update ops.hip * Update CMakeLists.txt * Update functional.py * Update cextension.py * Update cextension.py --------- Co-authored-by:
MISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com> Co-authored-by:
MISHANMAUYRA <mishanmaurya31081@gmail.com> Co-authored-by:
amcamd <andrew.chapman@amd.com> Co-authored-by:
Prasanth Nunna <root@banff-cyxtera-s78-1.amd.com>
-
- 19 Jun, 2025 1 commit
-
-
Matthew Douglas authored
-
- 18 Jun, 2025 1 commit
-
-
Chetan Kumar Verma authored
-
- 17 Jun, 2025 1 commit
-
-
Matthew Douglas authored
* Setup XPU CI * CI: expand XPU matrix * test * test * test * test * test * test * test * test * test * test * skip some fp4 tests on hpu * skip some fp4 tests on hpu * skip gemv tests on hpu * test * Additional test patches for HPU * HPU test update * HPU test update * HPU test update * HPU test update * Format
-
- 16 Jun, 2025 1 commit
-
-
Chetan Kumar Verma authored
-
- 13 Jun, 2025 3 commits
-
-
Matthew Douglas authored
* Add clang-format rules * Update clang-format
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 11 Jun, 2025 3 commits
-
-
वेदांत authored
* doc fix signature for 8-bit optim * required changes * precommit
-
Egor authored
-
Dmitrii Makarenko authored
* [xpu/triton] Add trtion dequantization kernel This PR adds xpu backend and trtion kernel for dequantization nf4 dtype. Trtion is an optional import. Tests: tests/test_functional.py::TestQuantize4BitFunctional supported nf4/fp4 cases tests/test_functional.py::Test8BitBlockwiseQuantizeFunctional implemented quantize_blockwise with binary search that works faster for XPU tests/test_linear4bit.py Signed-off-by:Dmitrii Makarenko <dmitrii.makarenko@intel.com> * align with ipex code * enable test for ipex * test_kbit_backprop: skip no longer needed * remove unused --------- Signed-off-by:
Dmitrii Makarenko <dmitrii.makarenko@intel.com>
-
- 08 Jun, 2025 1 commit
-
-
Matthew Douglas authored
-
- 06 Jun, 2025 2 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 05 Jun, 2025 2 commits
-
-
Ruheena Suhani Shaik authored
* supports hpu backend in main branch * Update bitsandbytes/backends/hpu/ops.py updates the assertion message Co-authored-by:
Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * Update bitsandbytes/backends/hpu/ops.py Co-authored-by:
Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * Update ops.py Fix lint issue * Update ops.py --------- Co-authored-by:
Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>
-
Matthew Douglas authored
-
- 04 Jun, 2025 1 commit
-
-
Matthew Douglas authored
* Deprecation cleanup: remove histogram_scatter_add_2d * Deprecation cleanup: vectorwise_mm_dequant * Deprecation cleanup: vectorwise_quant * Remove unused test * Optimizer test cleanup * Deprecations: remove estimate_quantiles, create_quantile_map * Move deprecated test
-
- 03 Jun, 2025 3 commits
-
-
Matthew Douglas authored
-
mklabunde authored
-
Matthew Douglas authored
-
- 02 Jun, 2025 3 commits
-
-
Matthew Douglas authored
* Tests: add linux x64 cpu+ipex to nightly CI workflow * typo * Tests: guard linear8bit compile test for ipex cpu issue
-
Matthew Douglas authored
* Tests: xfail opcheck for 4bit quantization with floating storage dtypes * Tests: xfail opcheck for 4bit quantization with floating storage dtypes * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch
-
Matthew Douglas authored
-
- 28 May, 2025 1 commit
-
-
jiqing-feng authored
* enable ipex Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix cpu 8bit quantization Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix int8 and nf4 cpu inference Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * add cpu fp4 and rem Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix dequantize nf4 xpu Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix ipex op Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix dequantize nf4 name Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix dequantize nf4 ipex Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix matmul8bitfp Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * enable cpu tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix quantize blockwise output shape Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix quant_storage bf16 and gemv cpu Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix cpu tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix xpu tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix lib Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * skip xpu dequantize blockwise op check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix matmul8bit Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * skip not used function teests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix matmul8bit fp Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * check ipex before MatMul8bitFp Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * update ipex install guide Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * update install guide Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix error log Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix error lof Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * update comment Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * move torch op to default Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * revert ipex check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix code tabledevice Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix code table device Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix xpu ops Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by:
jiqing-feng <jiqing.feng@intel.com>
-