- 21 Sep, 2023 9 commits
-
-
Casper Hansen authored
-
Casper authored
-
Casper authored
Improve model loading
-
Casper authored
support windows
-
Casper authored
-
Casper Hansen authored
-
Casper Hansen authored
-
qwopqwop200 authored
-
Casper authored
-
- 20 Sep, 2023 27 commits
-
-
Casper authored
Refactor quantization code
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
-
Casper authored
-
Casper authored
-
Casper authored
-
Casper authored
-
Casper authored
-
Casper authored
Support kv_heads
-
Casper authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
2x faster context processing with GEMV
-
- 19 Sep, 2023 3 commits
-
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
- 15 Sep, 2023 1 commit
-
-
Casper authored
Safetensors and model sharding
-