Commits · dca66862736dff75809bc0649ae1aff68c077c1c · OpenDAS / ollama

15 Oct, 2023 1 commit
- add steps for creating a Modelfile and more example commands to `import.md` · dca66862
  Jeffrey Morgan authored Oct 15, 2023
  
  dca66862
14 Oct, 2023 3 commits
- add push script for docker images · 598621af
  Jeffrey Morgan authored Oct 14, 2023
  
  598621af
- Merge pull request #773 from jmorganca/mattw/howtoquant · 6479f49c
  Matt Williams authored Oct 14, 2023
```
add how to quantize doc
```
  6479f49c
- applied mikes comments · b2974a70
  Matt Williams authored Oct 14, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  b2974a70
13 Oct, 2023 10 commits
- Use correct url for auto updates · 832b4db9
  Jeffrey Morgan authored Oct 13, 2023
  
  832b4db9
- check update response (#785) · c43873f3
  Bruce MacDonald authored Oct 13, 2023
  
  c43873f3
- Merge pull request #783 from jmorganca/mxyng/fix-gpu-offloading · d790bf99
  Michael Yang authored Oct 13, 2023
```
fix: offloading on low end GPUs
```
  d790bf99
- do not use gpu binary when num_gpu == 0 · 35afac09
  Michael Yang authored Oct 13, 2023
  
  35afac09
- no gpu if vram < 2GB · 811c3d19
  Michael Yang authored Oct 13, 2023
  
  811c3d19
- check for newer updates (#784) · 3553d107
  Bruce MacDonald authored Oct 13, 2023
```
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
```
  3553d107
- improve api error handling (#781) · 6fe17813
  Bruce MacDonald authored Oct 13, 2023
```
- remove new lines from llama.cpp error messages relayed to client
- check api option types and return error on wrong type
- change num layers from 95% VRAM to 92% VRAM
```
  6fe17813
- use lower glibc versions in `Dockerfile.build` · d890890f
  Jeffrey Morgan authored Oct 13, 2023
  
  d890890f
- use Go `1.21.3` in `Dockerfile` · 89ba19fe
  Jeffrey Morgan authored Oct 12, 2023
  
  89ba19fe
- update `Dockerfile.build` for linux binary builds · 6f58c776
  Jeffrey Morgan authored Oct 12, 2023
  
  6f58c776
12 Oct, 2023 11 commits
- update doc to refer to docker image · 3c975f89
  Matt Williams authored Oct 12, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  3c975f89
- add how to quantize doc · 9245c8a1
  Matt Williams authored Oct 12, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  9245c8a1
- Merge pull request #770 from jmorganca/mxyng/fix-download · 7a537cdc
  Michael Yang authored Oct 12, 2023
```
fix download
```
  7a537cdc
- fix download · 257ffeb9
  Michael Yang authored Oct 12, 2023
  
  257ffeb9
- Merge pull request #753 from jmorganca/mattw/examplereorg · 9b513bb6
  Matt Williams authored Oct 12, 2023
```
rename the examples to be more descriptive
```
  9b513bb6
- final rename · 042100f7
  Matt Williams authored Oct 12, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  042100f7
- validate api options fields from map (#711) · 7804b8fa
  Bruce MacDonald authored Oct 12, 2023
  
  7804b8fa
- relay model runner error message to client (#720) · 56497663
  Bruce MacDonald authored Oct 12, 2023
```
* give direction to user when runner fails
* also relay errors from timeout
* increase timeout to 3 minutes
```
  56497663
- simple gen to simple · e1afcb8a
  Matt Williams authored Oct 11, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  e1afcb8a
- remove with · 385eeea3
  Matt Williams authored Oct 11, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  385eeea3
- add golang gen · 8a41b244
  Matt Williams authored Oct 11, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  8a41b244
11 Oct, 2023 14 commits
- fix relative links in `README.md` · 92578798
  Jeffrey Morgan authored Oct 11, 2023
  
  92578798
- Merge pull request #760 from jmorganca/mxyng/more-downloads · 78863791
  Michael Yang authored Oct 11, 2023
```
Mxyng/more downloads
```
  78863791
- download: handle inner errors · c413a550
  Michael Yang authored Oct 11, 2023
  
  c413a550
- dynamically size download parts based on file size · 630bb75d
  Michael Yang authored Oct 10, 2023
  
  630bb75d
- update download · a2055a1e
  Michael Yang authored Oct 09, 2023
  
  a2055a1e
- add format bytes · b599946b
  Michael Yang authored Oct 11, 2023
  
  b599946b
- Merge pull request #757 from jmorganca/mxyng/format-time · aca2d65b
  Michael Yang authored Oct 11, 2023
```
cleanup format time
```
  aca2d65b
- cleanup format time · b5e08e33
  Michael Yang authored Oct 11, 2023
  
  b5e08e33
- optional parameter to not stream response (#639) · 274d5a5f
  Bruce MacDonald authored Oct 11, 2023
```
* update streaming request accept header
* add optional stream param to request bodies
```
  274d5a5f
- add ts alternate to python langchain simplegen · fc6b49be
  Matt Williams authored Oct 11, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  fc6b49be
- prevent waiting on exited command (#752) · 77295f71
  Bruce MacDonald authored Oct 11, 2023
```
* prevent waiting on exited command
* close llama runner once
```
  77295f71
- cleanup readme. · 615f7d1d
  Matt Williams authored Oct 11, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  615f7d1d
- rename dirs · cdf5e106
  Matt Williams authored Oct 11, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  cdf5e106
- rename the models to be more descriptive · a85329f5
  Matt Williams authored Oct 10, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  a85329f5
10 Oct, 2023 1 commit
- improve vram safety with 5% vram memory buffer (#724) · f2ba1311
  Bruce MacDonald authored Oct 10, 2023
```
* check free memory not total
* wait for subprocess to exit
```
  f2ba1311