- 23 Sep, 2025 1 commit
-
-
Matthew Douglas authored
* CUDA 13 build enablement * Try to fix Windows build workflow * Add torch 2.9+cu130 to tests * Fix python version * Update test workflow * Don't test CPU on torch 2.9 yet * Update doc
-
- 16 Sep, 2025 1 commit
-
-
Matthew Douglas authored
* Bump minimum PyTorch to 2.3 * Tests: Fix Windows numpy<2 compatibility for torch<2.4.1
-
- 15 Sep, 2025 1 commit
-
-
Liu Xiaoli authored
* Add SYCL Kernels for XPU backend * fix transpose Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix log and format Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * revert cpu changes Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * clean ipex_xpu Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * clean ipex import Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix ipex cpu import Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix typo Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix comments Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * refine gemv_4bit kernel * enable FP4 for dequant_4bit and gemv_4bit * refine FP4 dequantization performance * remove check for better performance Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix doc Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * clean code * fix tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * rm comments Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix memory issue * fix ut failure * adjust threshold Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix xpu check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * change test_functional check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix test_module Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix device check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * Enable Windows build and refine code * fix xpu log Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * remove ipex entirely Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix cpu int8 CB Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix lint Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix logs (#12) * fix logs Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * Fix sycl lint error and tests (#13) * fix sycl nd Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * skip typo check for xpu kernel codes (#14) * skip test for xpu ops Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix lint Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * skip typo for xpu Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * skip Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * skip Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * register triton kernel for quantization (#15) Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * Fix version comparison issue (#18) # Description The version comparison expression miss reference the .release property from the version object. This lead to compare between the tuple and the string # Error message ``` The 8-bit optimizer is not available on your device, only available on CUDA for now. 🦥 Unsloth: Will patch your computer to enable 2x faster free finetuning. Traceback (most recent call last): File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/unsloth_validation/run.py", line 1, in <module> import unsloth File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/__init__.py", line 235, in <module> from .models import * File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/models/__init__.py", line 15, in <module> from .llama import FastLlamaModel File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/models/llama.py", line 23, in <module> from ._utils import * File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth/models/_utils.py", line 89, in <module> from unsloth_zoo.patching_utils import ( File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/unsloth_zoo/patching_utils.py", line 629, in <module> import transformers.integrations.bitsandbytes File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/v/lib/python3.10/site-packages/transformers/integrations/bitsandbytes.py", line 20, in <module> import bitsandbytes as bnb File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/bitsandbytes/bitsandbytes/__init__.py", line 39, in <module> from .backends.xpu import ops as xpu_ops File "/home/erxin/jenkins/workspace/Unsloth_Benchmark/bitsandbytes/bitsandbytes/backends/xpu/ops.py", line 17, in <module> if version.parse(torch.__version__).release >= version.parse("2.9"): TypeError: '>=' not supported between instances of 'tuple' and 'Version' ``` --------- Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> Co-authored-by:
jiqing-feng <jiqing.feng@intel.com> Co-authored-by:
Er-Xin (Edwin) Shang <shangerxin@hotmail.com>
-
- 09 Sep, 2025 1 commit
-
-
Matthew Douglas authored
* Test suite improvements for MPS/XPU/HPU * Skip test on torch==2.8.0+cpu for Windows regression
-
- 11 Aug, 2025 2 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 01 Jul, 2025 1 commit
-
-
Matthew Douglas authored
* Add torch 2.8 rc / 2.9 nightly to tests * Update tests.yml * Update tests.yml
-
- 30 Jun, 2025 1 commit
-
-
Matthew Douglas authored
-
- 27 Jun, 2025 1 commit
-
-
Matthew Douglas authored
* Add CUDA 12.9 to build/test workflows * Downgrade Jimver/cuda-toolkit to v0.2.24 * Update python-package.yml * Update python-package.yml * Update python-package.yml * Update tests.yml * Update tests.yml
-
- 20 Jun, 2025 1 commit
-
-
pnunna93 authored
* Port ROCm changes from multi-backend-refactor branch * Update ops.py * Update functional.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update functional.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update functional.py * Update functional.py * Update functional.py * Update functional.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update ops.py * Update functional.py * Update functional.py * Update functional.py * Update test_ops.py * Update test_functional.py * Update test_ops.py * Update test_functional.py * Update test_functional.py * Update functional.py * Update functional.py * Update ops.py * Update ops.py * Update test_functional.py * Update test_functional.py * Update cextension.py * Update cuda_specs.py * Update cuda_specs.py * Update test_functional.py * Update test_linear4bit.py * Update test_cuda_setup_evaluator.py * Update test_functional.py * Update modules.py * Update modules.py * Update ops.py * Update test_linear4bit.py * Update ops.py * Update ops.py * Update test_linear4bit.py * Update test_linear4bit.py * Update python-package.yml * Update python-package.yml * Update python-package.yml * Update python-package.yml * Create build-rocm.sh * Update cuda_specs.py * Fix trailing whitespace * Remove conflicts.diff * update for hipblasVersionMajor >=3 * Update test_functional.py * Update test_linear4bit.py * Update test_ops.py * Update main.py * Update test_functional.py * Update test_linear4bit.py * Update test_ops.py * Update test_linear4bit.py * Lint * Lint * Update helpers.py * Update test_functional.py * Update test_linear4bit.py * Update test_ops.py * Lint * Update pythonInterface.cpp * lint fix * lint * Update pythonInterface.cpp * revert permissions change * Fix indentation * Update kernels_hip.cuh * Update kernels.hip * Update ops.hip * Update ops_hip.cuh * Update kernels_hip.cuh * Update kernels.hip * Update kernels.hip * Update ops.hip * Update ops_hip.cuh * Update ops.hip * Update CMakeLists.txt * Update functional.py * Update cextension.py * Update cextension.py --------- Co-authored-by:
MISHANMAURYA <118961433+MISHANMAURYA@users.noreply.github.com> Co-authored-by:
MISHANMAUYRA <mishanmaurya31081@gmail.com> Co-authored-by:
amcamd <andrew.chapman@amd.com> Co-authored-by:
Prasanth Nunna <root@banff-cyxtera-s78-1.amd.com>
-
- 17 Jun, 2025 1 commit
-
-
Matthew Douglas authored
* Setup XPU CI * CI: expand XPU matrix * test * test * test * test * test * test * test * test * test * test * skip some fp4 tests on hpu * skip some fp4 tests on hpu * skip gemv tests on hpu * test * Additional test patches for HPU * HPU test update * HPU test update * HPU test update * HPU test update * Format
-
- 05 Jun, 2025 1 commit
-
-
Matthew Douglas authored
-
- 02 Jun, 2025 1 commit
-
-
Matthew Douglas authored
* Tests: add linux x64 cpu+ipex to nightly CI workflow * typo * Tests: guard linear8bit compile test for ipex cpu issue
-
- 24 May, 2025 2 commits
-
-
Matthew Douglas authored
* Add torch.compile tests * Tests: WA aarch64 CPU regressions for torch 2.6.0; add Windows torch==2.7.0+cu118 test config * Tests: skip torch.compile for cuda on windows
-
Matthew Douglas authored
* General cleanup & test improvements * Tests: WA numpy 2 compat issue for torch<2.3 * Tests: update aarch64 cpu min torch version * Tests: update aarch64 cpu min torch version * Tests: update aarch64 cpu min torch version
-
- 19 May, 2025 3 commits
-
-
Matthew Douglas authored
* Test g5g runner * Switch L4 to L40S runner; swap GitHub Linux T4 runner for AWS g4dn * Run tests on last 2 pytorch stable releases * Run tests on last 2 pytorch stable releases
-
Titus von Koeller authored
-
Titus von Koeller authored
-
- 16 May, 2025 5 commits
-
-
Titus von Koeller authored
-
Titus von Koeller authored
-
Titus von Koeller authored
-
Titus von Koeller authored
-
Titus von Koeller authored
-
- 15 May, 2025 1 commit
-
-
Titus von Koeller authored
-
- 14 May, 2025 1 commit
-
-
Matthew Douglas authored
* Improvements for testing suite * Add workflow for macOS arm64 CPU tests * Update tests.yml * Update tests.yml Use new L4 and CPU runners for testing. * Update tests.yml
-
- 13 May, 2025 1 commit
-
-
Matthew Douglas authored
* Improvements for testing suite * Add workflow for macOS arm64 CPU tests
-
- 08 May, 2025 4 commits
-
-
Matthew Douglas authored
Show slow test durations.
-
Matthew Douglas authored
Fix trailing whitespace.
-
Titus von Koeller authored
-
Titus von Koeller authored
-
- 05 May, 2025 2 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 02 May, 2025 2 commits
-
-
Matthew Douglas authored
* Add aarch64 cpu tests and CUDA build to nightly workflow * aarch64: limit CUDA targets to sm75, sm80, sm90, sm100 * aarch64: limit CUDA targets to sm75, sm80, sm90, sm100 * Update build cpu script * fix * Update auditwheel for aarch64
-
Johnny authored
* Update python-package.yml * Update python-package.yml * Update python-package.yml * Cleanup * Matrix update --------- Co-authored-by:Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>
-
- 29 Apr, 2025 1 commit
-
-
Matthew Douglas authored
* Run unit tests on GH Actions * fix * fix * trigger workflow * Update * Update * Update * Run tests nightly * Disable paged optimizer test on Windows * Skip unit tests on Windows for CUDA 12.x (driver on runner is too old)
-
- 22 Apr, 2025 1 commit
-
-
Matthew Douglas authored
* Stop building for CUDA toolkit < 11.8 * Simplify * Drop sm70 from cu128 build targets to align with pytorch
-
- 07 Apr, 2025 1 commit
-
-
Titus authored
-
- 27 Mar, 2025 2 commits
-
-
Matthew Douglas authored
* Drop Python 3.8 support. * Formatting
-
Matthew Douglas authored
-
- 25 Feb, 2025 1 commit
-
-
Matthew Douglas authored
-