- 21 Sep, 2023 1 commit
-
-
Casper Hansen authored
-
- 20 Sep, 2023 30 commits
-
-
Casper Hansen authored
-
Casper authored
-
Casper authored
-
Casper authored
Refactor quantization code
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
-
Casper authored
-
Casper authored
-
Casper authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
Support kv_heads
-
Casper authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
2x faster context processing with GEMV
-
- 19 Sep, 2023 3 commits
-
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
- 15 Sep, 2023 6 commits
-
-
Casper authored
Safetensors and model sharding
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
Allow user to use custom calibration data for quantization
-
Casper authored
-
Casper authored
-