- 26 Nov, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 24 Nov, 2023 2 commits
-
-
Jing Zhang authored
* Support cuda build in Windows * Enable dynamic NumGPU allocation for Windows
-
Jongwook Choi authored
When CUDA peer access is enabled, multi-gpu inference will produce garbage output. This is a known bug of llama.cpp (or nvidia). Until the upstream bug is fixed, we can disable CUDA peer access temporarily to ensure correct output. See #961.
-
- 22 Nov, 2023 2 commits
-
-
Jeffrey Morgan authored
-
Michael Yang authored
-
- 21 Nov, 2023 2 commits
-
-
Michael Yang authored
-
Jeffrey Morgan authored
-
- 20 Nov, 2023 3 commits
-
-
Michael Yang authored
-
Purinda Gunasekara authored
-
Jeffrey Morgan authored
-
- 19 Nov, 2023 2 commits
-
-
Jeffrey Morgan authored
-
Bruce MacDonald authored
-
- 17 Nov, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 10 Nov, 2023 1 commit
-
-
Jeffrey Morgan authored
* add `"format": "json"` as an API parameter --------- Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
- 09 Nov, 2023 2 commits
-
-
Bruce MacDonald authored
-
Michael Yang authored
instead of static number of parameters for each model family, get the real number from the tensors (#1022) * parse tensor info * refactor decoder * return actual parameter count * explicit rounding * s/Human/HumanNumber/
-
- 04 Nov, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 03 Nov, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 02 Nov, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 31 Oct, 2023 1 commit
-
-
Michael Yang authored
-
- 27 Oct, 2023 3 commits
-
-
Jeffrey Morgan authored
-
Bruce MacDonald authored
-
Bruce MacDonald authored
-
- 24 Oct, 2023 3 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 23 Oct, 2023 3 commits
-
-
Michael Yang authored
pin to 9e70cc03229df19ca2d28ce23cc817198f897278 for now since 438c2ca83045a00ef244093d27e9ed41a8cb4ea9 is breaking
-
Michael Yang authored
-
Michael Yang authored
ggufv3 adds support for big endianness, mainly for s390x architecture. while that's not currently supported for ollama, the change is simple. loosen version check to be more forward compatible. unless specified, gguf versions other v1 will be decoded into v2.
-
- 19 Oct, 2023 2 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
add error for falcon and starcoder vocab compatibility --------- Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
- 18 Oct, 2023 6 commits
-
-
Arne Müller authored
-
Arne Müller authored
-
Arne Müller authored
-
Bruce MacDonald authored
-
Arne Müller authored
-
Arne Müller authored
-
- 17 Oct, 2023 3 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Bruce MacDonald authored
-