- 13 Mar, 2025 5 commits
-
-
Michael Yang authored
fix: error on models that don't support embeddings
-
Michael Yang authored
-
Michael Yang authored
ollama-debug.c: correct mistype
-
Parth Sareen authored
-
shane.xb.qian authored
* macOS has different definition per info from @mxyng
-
- 12 Mar, 2025 8 commits
-
-
ParthSareen authored
-
ParthSareen authored
-
ParthSareen authored
-
Bruce MacDonald authored
Softcap isn't in the whitepaper/implementation for the language model so we should remove it. There is no discernible difference in output with it removed.
-
Shane-XB-Qian authored
Signed-off-by:shane.xb.qian <shane.qian@foxmail.com>
-
shane.xb.qian authored
Signed-off-by:shane.xb.qian <shane.qian@foxmail.com>
-
frob authored
Co-authored-by:Richard Lyons <frob@cloudstaff.com>
-
Michael authored
-
- 11 Mar, 2025 27 commits
-
-
Michael Yang authored
engine: add gemma support
-
jmorganca authored
-
jmorganca authored
-
jmorganca authored
-
jmorganca authored
-
Michael Yang authored
-
Daniel Hiltgen authored
-
jmorganca authored
-
jmorganca authored
-
jmorganca authored
This reverts commit c7eae586b899083acebcd9b3847b89ea78c2850c.
-
Jesse Gross authored
This is useful for a few things: - Work around bugs, such as having 2 images in one batch - Keep the image in a single batch for fully connected attention - Improve performance by not evaluating embeddings multiple times
-
Jesse Gross authored
Currently we are using positions, which are relative to a sequence and may not be unique.
-
Jesse Gross authored
-
Jesse Gross authored
-
Michael Yang authored
-
Michael Yang authored
-
Patrick Devine authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Patrick Devine authored
-
Michael Yang authored
-
Michael Yang authored
-
Jesse Gross authored
-