- 06 Sep, 2023 13 commits
-
-
Casper authored
-
Casper authored
YaRN support for LLaMa models
-
Casper authored
[BUG] Fix illegal memory access + Quantized Multi-GPU support
-
Casper authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
- 05 Sep, 2023 10 commits
-
-
Casper authored
Implement batch size for speed test
-
Casper Hansen authored
-
Casper Hansen authored
This reverts commit 4fe9974a.
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
support speedtest to benchmark FP16 model
-
Casper Hansen authored
-
Zhen Wan authored
-
- 04 Sep, 2023 4 commits
-
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
fuse_layers bug fix
-
qwopqwop200 authored
-
- 02 Sep, 2023 6 commits
-
-
Casper authored
Refactor fused modules
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
- 01 Sep, 2023 7 commits