- 08 Sep, 2025 2 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
* Add parametrize util for targeting parameters outside of nn.Linear modules * Parametrize 4bit: replace existing prequantized weight * cleanup * Add caching for parametrization * Add tests * Fix tests * Guard for torch < 2.5 * Guard for torch < 2.5 * Another test gaurd for torch >= 2.5
-
- 03 Sep, 2025 2 commits
-
-
kaixuanliu authored
* for intel xpu case, use MatMul8bitFp even not use ipex Signed-off-by:
Liu, Kaixuan <kaixuan.liu@intel.com> * fix lint issue Signed-off-by:
Liu, Kaixuan <kaixuan.liu@intel.com> --------- Signed-off-by:
Liu, Kaixuan <kaixuan.liu@intel.com>
-
jiqing-feng authored
* add int mm for xpu after torch 2.9 Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * add packaging on pyproject Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by:
jiqing-feng <jiqing.feng@intel.com>
-
- 02 Sep, 2025 1 commit
-
-
Yuanyuan Chen authored
* Fix unused variable warnings and other ruff warnings Signed-off-by:
cyy <cyyever@outlook.com> * Fix format Signed-off-by:
cyy <cyyever@outlook.com> --------- Signed-off-by:
cyy <cyyever@outlook.com>
-
- 25 Aug, 2025 1 commit
-
-
Yuanyuan Chen authored
Signed-off-by:cyy <cyyever@outlook.com>
-
- 11 Aug, 2025 4 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
-
Matthew Douglas authored
-
Matthew Douglas authored
-
- 06 Aug, 2025 2 commits
-
-
Matthew Douglas authored
[CUDA] Fixing quantization uint8 packing bug for NF4 and FP4
-
Matthew Douglas authored
Fix Params4bit tensor subclass handling
-
- 04 Aug, 2025 1 commit
-
-
ved1beta authored
-
- 02 Aug, 2025 2 commits
-
-
ved1beta authored
-
Mohamed Hisham authored
-
- 31 Jul, 2025 1 commit
-
-
ved1beta authored
-
- 21 Jul, 2025 4 commits
-
-
Matthew Douglas authored
Add Volta support in cu128/cu129 builds
-
Matthew Douglas authored
-
Matthew Douglas authored
Create FUNDING.yml
-
Matthew Douglas authored
-
- 14 Jul, 2025 11 commits
-
-
Matthew Douglas authored
-
Matthew Douglas authored
Add kernel registration for 8bit and 32bit optimizers
-
Egor Krivov authored
-
Egor Krivov authored
-
Egor Krivov authored
-
Egor Krivov authored
-
Egor Krivov authored
-
Egor Krivov authored
-
Egor Krivov authored
-
Egor Krivov authored
-
Egor Krivov authored
-
- 11 Jul, 2025 2 commits
-
-
Egor Krivov authored
-
Egor Krivov authored
-
- 08 Jul, 2025 2 commits
-
-
Matthew Douglas authored
[XPU] Add inference benchmark for XPU
-
Matthew Douglas authored
fix log
-
- 03 Jul, 2025 1 commit
-
-
jiqing-feng authored
Signed-off-by:jiqing-feng <jiqing.feng@intel.com>
-
- 02 Jul, 2025 1 commit
-
-
Egor Krivov authored
-
- 01 Jul, 2025 3 commits
-
-
Michał Górny authored
* Automatically call CMake as part of PEP 517 build Call CMake and build the CPU extension when invoking the build via a PEP 517 backend, to ensure that at least some extension is built when users are building from source. This improves consistency with other Python packages, and reduces the risk of accidents. We are using `scikit-build-core` setuptools plugin to take care of CMake dependencies and call into CMake. However, we need to modify the `build_py` command to ensure that CMake is called prior to the setuptools command, as otherwise the newly built shared library won't be picked up by `build_py`. Since setuptools is still responsible for collecting the Python package, it also collects all other shared libraries that were built earlier, for example via manual CMake calls as done in the CI pipeline. Furthermore, if the user does not have `scikit-build-core` installed and calls `setup.py` directly, we output a warning but continue working as before. The logic can be further extended in the future, for example to detect the best COMPUTE_BACKEND default. Fixes #1511 * Include C sources and build files in source distribution * Fix formatting
-
Matthew Douglas authored
* Add torch 2.8 rc / 2.9 nightly to tests * Update tests.yml * Update tests.yml
-
Matthew Douglas authored
-