- 08 Sep, 2023 9 commits
-
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
- 07 Sep, 2023 2 commits
- 06 Sep, 2023 15 commits
-
-
Casper authored
-
Casper authored
-
Casper authored
-
Casper authored
YaRN support for LLaMa models
-
Casper authored
[BUG] Fix illegal memory access + Quantized Multi-GPU support
-
Casper authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
- 05 Sep, 2023 10 commits
-
-
Casper authored
Implement batch size for speed test
-
Casper Hansen authored
-
Casper Hansen authored
This reverts commit 4fe9974a.
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
support speedtest to benchmark FP16 model
-
Casper Hansen authored
-
Zhen Wan authored
-
- 04 Sep, 2023 4 commits
-
-
Casper Hansen authored
-
Casper Hansen authored
-
Casper authored
fuse_layers bug fix
-
qwopqwop200 authored
-