1. 14 Jul, 2025 2 commits
  2. 11 Jul, 2025 2 commits
  3. 08 Jul, 2025 2 commits
  4. 03 Jul, 2025 1 commit
  5. 02 Jul, 2025 1 commit
  6. 01 Jul, 2025 4 commits
    • Michał Górny's avatar
      Automatically call CMake as part of PEP 517 build (#1512) · ed9c8fca
      Michał Górny authored
      * Automatically call CMake as part of PEP 517 build
      
      Call CMake and build the CPU extension when invoking the build
      via a PEP 517 backend, to ensure that at least some extension is built
      when users are building from source.  This improves consistency with
      other Python packages, and reduces the risk of accidents.
      
      We are using `scikit-build-core` setuptools plugin to take care of CMake
      dependencies and call into CMake.  However, we need to modify
      the `build_py` command to ensure that CMake is called prior to
      the setuptools command, as otherwise the newly built shared library
      won't be picked up by `build_py`.
      
      Since setuptools is still responsible for collecting the Python package,
      it also collects all other shared libraries that were built earlier,
      for example via manual CMake calls as done in the CI pipeline.
      Furthermore, if the user does not have `scikit-build-core` installed
      and calls `setup.py` directly, we output a warning but continue working
      as before.
      
      The logic can be further extended in the future, for example to detect
      the best COMPUTE_BACKEND default.
      
      Fixes #1511
      
      * Include C sources and build files in source distribution
      
      * Fix formatting
      ed9c8fca
    • Matthew Douglas's avatar
      CI: Test with PyTorch 2.8.0 RC (#1693) · ed398d28
      Matthew Douglas authored
      * Add torch 2.8 rc / 2.9 nightly to tests
      
      * Update tests.yml
      
      * Update tests.yml
      ed398d28
    • Matthew Douglas's avatar
      Update README.md · e28d4d91
      Matthew Douglas authored
      e28d4d91
    • jiqing-feng's avatar
      bdcee0ff
  7. 30 Jun, 2025 1 commit
  8. 27 Jun, 2025 1 commit
    • Matthew Douglas's avatar
      Add CUDA 12.9 build (#1689) · 1abd5e78
      Matthew Douglas authored
      * Add CUDA 12.9 to build/test workflows
      
      * Downgrade Jimver/cuda-toolkit to v0.2.24
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update tests.yml
      
      * Update tests.yml
      1abd5e78
  9. 24 Jun, 2025 1 commit
  10. 23 Jun, 2025 1 commit
  11. 20 Jun, 2025 1 commit
    • pnunna93's avatar
      Enable ROCm backend with custom ops integration (#1683) · 888788d7
      pnunna93 authored
      
      
      * Port ROCm changes from multi-backend-refactor branch
      
      * Update ops.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update test_ops.py
      
      * Update test_functional.py
      
      * Update test_ops.py
      
      * Update test_functional.py
      
      * Update test_functional.py
      
      * Update functional.py
      
      * Update functional.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update test_functional.py
      
      * Update test_functional.py
      
      * Update cextension.py
      
      * Update cuda_specs.py
      
      * Update cuda_specs.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_cuda_setup_evaluator.py
      
      * Update test_functional.py
      
      * Update modules.py
      
      * Update modules.py
      
      * Update ops.py
      
      * Update test_linear4bit.py
      
      * Update ops.py
      
      * Update ops.py
      
      * Update test_linear4bit.py
      
      * Update test_linear4bit.py
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Update python-package.yml
      
      * Create build-rocm.sh
      
      * Update cuda_specs.py
      
      * Fix trailing whitespace
      
      * Remove conflicts.diff
      
      * update for hipblasVersionMajor >=3
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Update main.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Update test_linear4bit.py
      
      * Lint
      
      * Lint
      
      * Update helpers.py
      
      * Update test_functional.py
      
      * Update test_linear4bit.py
      
      * Update test_ops.py
      
      * Lint
      
      * Update pythonInterface.cpp
      
      * lint fix
      
      * lint
      
      * Update pythonInterface.cpp
      
      * revert permissions change
      
      * Fix indentation
      
      * Update kernels_hip.cuh
      
      * Update kernels.hip
      
      * Update ops.hip
      
      * Update ops_hip.cuh
      
      * Update kernels_hip.cuh
      
      * Update kernels.hip
      
      * Update kernels.hip
      
      * Update ops.hip
      
      * Update ops_hip.cuh
      
      * Update ops.hip
      
      * Update CMakeLists.txt
      
      * Update functional.py
      
      * Update cextension.py
      
      * Update cextension.py
      
      ---------
      Co-authored-by: default avatarMISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com>
      Co-authored-by: default avatarMISHANMAUYRA <mishanmaurya31081@gmail.com>
      Co-authored-by: default avataramcamd <andrew.chapman@amd.com>
      Co-authored-by: default avatarPrasanth Nunna <root@banff-cyxtera-s78-1.amd.com>
      888788d7
  12. 19 Jun, 2025 1 commit
  13. 18 Jun, 2025 1 commit
  14. 17 Jun, 2025 1 commit
    • Matthew Douglas's avatar
      CI: Setup HPU nightly tests (#1681) · 29564ad6
      Matthew Douglas authored
      * Setup XPU CI
      
      * CI: expand XPU matrix
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * test
      
      * skip some fp4 tests on hpu
      
      * skip some fp4 tests on hpu
      
      * skip gemv tests on hpu
      
      * test
      
      * Additional test patches for HPU
      
      * HPU test update
      
      * HPU test update
      
      * HPU test update
      
      * HPU test update
      
      * Format
      29564ad6
  15. 16 Jun, 2025 1 commit
  16. 13 Jun, 2025 3 commits
  17. 11 Jun, 2025 3 commits
  18. 08 Jun, 2025 1 commit
  19. 06 Jun, 2025 2 commits
  20. 05 Jun, 2025 2 commits
  21. 04 Jun, 2025 1 commit
    • Matthew Douglas's avatar
      Deprecation cleanup (#1669) · 849d9449
      Matthew Douglas authored
      * Deprecation cleanup: remove histogram_scatter_add_2d
      
      * Deprecation cleanup: vectorwise_mm_dequant
      
      * Deprecation cleanup: vectorwise_quant
      
      * Remove unused test
      
      * Optimizer test cleanup
      
      * Deprecations: remove estimate_quantiles, create_quantile_map
      
      * Move deprecated test
      849d9449
  22. 03 Jun, 2025 3 commits
  23. 02 Jun, 2025 3 commits
    • Matthew Douglas's avatar
      Add CPU + IPEX to nightly CI (#1667) · 318a86e3
      Matthew Douglas authored
      * Tests: add linux x64 cpu+ipex to nightly CI workflow
      
      * typo
      
      * Tests: guard linear8bit compile test for ipex cpu issue
      318a86e3
    • Matthew Douglas's avatar
      Fix CI regression (#1666) · 945f7c1d
      Matthew Douglas authored
      * Tests: xfail opcheck for 4bit quantization with floating storage dtypes
      
      * Tests: xfail opcheck for 4bit quantization with floating storage dtypes
      
      * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch
      
      * Tests: skip test_gemv_eye_4bit on CPU with bf16 when not supported by torch
      945f7c1d
    • Matthew Douglas's avatar
      Bump dev version · a2a74ede
      Matthew Douglas authored
      a2a74ede
  24. 28 May, 2025 1 commit