- 24 May, 2025 1 commit
-
-
Matthew Douglas authored
* Start cleaning up docs * Remove page * Minor update * correction * Minor doc revisions * Update installation.mdx * Update _toctree.yml
-
- 05 Dec, 2024 1 commit
-
-
Matthew Douglas authored
* Start of int8 refactor: remove col32/col_ampere/col_turing transforms in new igemmlt implementation * Fix unintended change * New naive mm_dequant kernel for row-major; cleanup * fix * int8 refactor: initial sparse decomp, cleanup * Int8 refactoring: remove separate NO_CUBLASLT build; more cleanup * int8: inference optimizations, some cleanup * int8: more tests passing, cleanup * int8 - more cleanup, most tests passing * int8: specify CUDA stream for int8 ops * perf: reduce overhead from getting cudaStream ptr * Mark some functions for deprecation. * int8 sparse decomp: small perf improvement * update setup.py * Update bitsandbytes/autograd/_functions.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * Update bitsandbytes/functional.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * Update bitsandbytes/functional.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * Update bitsandbytes/research/autograd/_functions.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * int8 - perf improvement for sparse decomposition inference; deprecate get_tensor_stream() in favor of new private fn * int8 cleanup * Ignore ruff rule ISC001 (incompatible with formatter) * add comment * int8 more cleanup * Update bitsandbytes/functional.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * int8: rename / deprecate old fn signatures * Update bitsandbytes/functional.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * type annotation * format update * Update bitsandbytes/research/autograd/_functions.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * cleanup * Add comment to explain division optimization * more cleanup * Update bitsandbytes/functional.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * Update bitsandbytes/functional.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * Update bitsandbytes/functional.py Co-authored-by:
Aarni Koskela <akx@iki.fi> * cleanup * Type annotations, cleanup * remove unused kernels; improved type annotations * small perf optimization for single-GPU systems * small perf optimization for single-GPU systems * update docstrings * Improve docs and tests * Update docstring * Update test * add benchmarking script * test cleanup: add deprecated marker, move benchmarks out * Add int8 dequant function; misc improvements * int8 matmul fallback for inner dims not divisible by 4 * improve register usage of kInt8VectorQuant - especially for A100/H100 * disable fail-fast for package build * maxwell compat * ptxas verbose * docs update * doc update * backward fix * Bugfix sparse decomp * Int8 fix for PEFT OLoRA init * Fix test for deprecated spmm_coo * test improvement * doc update * typo * doc cleanup * docs * add inference benchmark script * Add benchmarks, doc update --------- Co-authored-by:
Aarni Koskela <akx@iki.fi>
-
- 20 Sep, 2024 1 commit
-
-
Matthew Douglas authored
* Add AdEMAMix optimizer * Add PagedAdEMAMix32bit, AdEMAMix32bit * Add PagedAdEMAMix32bit, AdEMAMix32bit * AdEMAMix: add support for alpha/beta3 scheduling * Update paged AdEMAMix
-
- 29 Aug, 2024 1 commit
-
-
Titus von Koeller authored
-
- 26 Mar, 2024 1 commit
-
-
Steven Liu authored
-
- 15 Mar, 2024 1 commit
-
-
Steven Liu authored
* optim, integration * toctree * feedback
-
- 07 Mar, 2024 1 commit
-
-
Steven Liu authored
* optims * fix path * fix path * mdx * fix path * toctree * fix * optimizer, adagrad * add init * add * more apis * params * clarify * run pre-commit hooks --------- Co-authored-by:Titus von Koeller <9048635+Titus-von-Koeller@users.noreply.github.com>
-
- 21 Feb, 2024 1 commit
-
-
Steven Liu authored
-
- 04 Feb, 2024 1 commit
-
-
Titus authored
* add optional dependency for preview to environment.yml * Add additional sections, first optimizers, MacOS WIP * drafting + refactoring new docs * some changes * run pre-commit hooks * add mention of pre-commit to contributing * fix * test autodoc * new additions * add subtilte * add some content * add more methods * fix * further docs updates * Update _toctree.yml * fix link * run pre-commit hooks * refactor + further docs * Update README.md with new docs link Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * list of blog posts Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * list of blog posts Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * accept change suggestion Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * accept suggestion Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * accept suggestion Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/integrations.mdx Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * index instead of intro * fixup README, add docs link * add instructions for creating docstrings * final polish (except integrations) * fill out integrations section --------- Co-authored-by:
younesbelkada <younesbelkada@gmail.com> Co-authored-by:
Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
-
- 01 Feb, 2024 1 commit
-
-
Aarni Koskela authored
-
- 12 Jan, 2024 1 commit
-
-
Titus authored
-