Commits · 4f1afd575d1dfd803b0d9abb995862d61e8d0734 · OpenDAS / ollama

22 Jul, 2024 2 commits

host · 4f1afd57
Michael Yang authored Jul 03, 2024

4f1afd57

Remove no longer supported max vram var · cc269ba0

Daniel Hiltgen authored Jul 22, 2024

The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM
scenarios. With Concurrency this was no longer wired up, and the simplistic
value doesn't map to multi-GPU setups. Users can still set `num_gpu`
to limit memory usage to avoid OOM if we get our predictions wrong.

cc269ba0

14 Jul, 2024 1 commit
- remove template (#5655) · 057d3186
  Patrick Devine authored Jul 13, 2024
  
  057d3186
28 Jun, 2024 2 commits
- Include Show Info in Interactive (#5342) · 5f034f5b
  royjhan authored Jun 28, 2024
  
  5f034f5b
- Ollama Show: Check for Projector Type (#5307) · b910fa90
  royjhan authored Jun 28, 2024
```
* Check exists projtype

* Maintain Ordering
```
  b910fa90
27 Jun, 2024 1 commit
- zip: prevent extracting files into parent dirs (#5314) · 123a722a
  Michael Yang authored Jun 26, 2024
  
  123a722a
25 Jun, 2024 1 commit

cmd: defer stating model info until necessary (#5248) · 2aa91a93

Blake Mizerany authored Jun 24, 2024

This commit changes the 'ollama run' command to defer fetching model
information until it really needs it. That is, when in interactive mode.

It also removes one such case where the model information is fetch in
duplicate, just before calling generateInteractive and then again, first
thing, in generateInteractive.

This positively impacts the performance of the command:

    ; time ./before run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.168 total
    ; time ./before run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.220 total
    ; time ./before run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.217 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.02s user 0.01s system 4% cpu 0.652 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.01s user 0.01s system 5% cpu 0.498 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with or would you like to chat?

    ./after run llama3 'hi'  0.01s user 0.01s system 3% cpu 0.479 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.02s user 0.01s system 5% cpu 0.507 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.02s user 0.01s system 5% cpu 0.507 total

2aa91a93

19 Jun, 2024 1 commit

Extend api/show and ollama show to return more model info (#4881) · fedf7163

royjhan authored Jun 19, 2024



* API Show Extended

* Initial Draft of Information
Co-Authored-By: Patrick Devine <pdevine@sonic.net>

* Clean Up

* Descriptive arg error messages and other fixes

* Second Draft of Show with Projectors Included

* Remove Chat Template

* Touches

* Prevent wrapping from files

* Verbose functionality

* Docs

* Address Feedback

* Lint

* Resolve Conflicts

* Function Name

* Tests for api/show model info

* Show Test File

* Add Projector Test

* Clean routes

* Projector Check

* Move Show Test

* Touches

* Doc update

---------
Co-authored-by: Patrick Devine <pdevine@sonic.net>

fedf7163

12 Jun, 2024 1 commit
- move OLLAMA_HOST to envconfig (#5009) · c69bc19e
  Patrick Devine authored Jun 12, 2024
  
  c69bc19e
04 Jun, 2024 4 commits
- nolintlint · 201d853f
  Michael Yang authored May 22, 2024
  
  201d853f
- lint · e40145a3
  Michael Yang authored May 21, 2024
  
  e40145a3
- nolintlint · 8ffb5174
  Michael Yang authored May 21, 2024
  
  8ffb5174
- replace x/exp/slices with slices · 04f3c12b
  Michael Yang authored May 21, 2024
  
  04f3c12b
30 May, 2024 3 commits

replaced duplicate call with variable · 914f68f0
Josh Yan authored May 30, 2024

914f68f0
fixed japanese characters deleted at end of line · bd1d119b
Josh Yan authored May 30, 2024

bd1d119b

Fix OLLAMA_LLM_LIBRARY with wrong map name and add more env vars to help message (#4663) · a03be181

Lei Jitang authored May 31, 2024



* envconfig/config.go: Fix wrong description of OLLAMA_LLM_LIBRARY
Signed-off-by: Lei Jitang <leijitang@outlook.com>

* serve: Add more env to help message of ollama serve

Add more enviroment variables to `ollama serve --help`
to let users know what can be configurated.
Signed-off-by: Lei Jitang <leijitang@outlook.com>

---------
Signed-off-by: Lei Jitang <leijitang@outlook.com>

a03be181

24 May, 2024 1 commit
- Move envconfig and consolidate env vars (#4608) · 4cc3be30
  Patrick Devine authored May 24, 2024
  
  4cc3be30
20 May, 2024 2 commits
- add fixes for llama · d355d202
  Patrick Devine authored May 08, 2024
  
  d355d202
- Move the parser back + handle utf16 files (#4533) · ccdf0b2a
  Patrick Devine authored May 20, 2024
  
  ccdf0b2a
18 May, 2024 1 commit
- add OLLAMA_NOHISTORY to turn off history in interactive mode (#4508) · 105186aa
  Patrick Devine authored May 18, 2024
  
  105186aa
16 May, 2024 3 commits
- removed comment · 3d90156e
  Josh Yan authored May 16, 2024
  
  3d90156e
- go fmt'd cmd.go · 26bfc1c4
  Josh Yan authored May 15, 2024
  
  26bfc1c4
- go fmt'd cmd.go · 799aa988
  Josh Yan authored May 15, 2024
  
  799aa988
15 May, 2024 2 commits
- updated double-width display · c9e584fb
  Josh Yan authored May 15, 2024
  
  c9e584fb
- fixed width and word count for double spacing · 17b1e81c
  Josh Yan authored May 15, 2024
  
  17b1e81c
14 May, 2024 2 commits
- fix keepalive for non-interactive mode (#4438) · c344da4c
  Patrick Devine authored May 14, 2024
  
  c344da4c
- Ollama `ps` command for showing currently loaded models (#4327) · 68459888
  Patrick Devine authored May 13, 2024
  
  68459888
13 May, 2024 2 commits
- removed inconsistencies · f8464785
  Josh Yan authored May 13, 2024
  
  f8464785
- removed inconsistent punctuation · 91a090a4
  Josh Yan authored May 13, 2024
  
  91a090a4
11 May, 2024 1 commit
- fix `ollama create`'s usage string (#4362) · 8080fbce
  todashuta authored May 12, 2024
  
  8080fbce
10 May, 2024 1 commit

Use `--quantize` flag and `quantize` api parameter (#4321) · 6602e793

Jeffrey Morgan authored May 10, 2024



* rename `--quantization` to `--quantize`

* backwards

* Update api/types.go
Co-authored-by: Michael Yang <mxyng@pm.me>

---------
Co-authored-by: Michael Yang <mxyng@pm.me>

6602e793

06 May, 2024 1 commit
- close server on receiving signal (#4213) · 39d9d22c
  Jeffrey Morgan authored May 06, 2024
  
  39d9d22c
01 May, 2024 4 commits
- server: target invalid · 45b6a12e
  Michael Yang authored May 01, 2024
  
  45b6a12e
- rename parser to model/file · 119589fc
  Michael Yang authored Apr 30, 2024
  
  119589fc
- cmd: import regexp · 5ea84496
  Michael Yang authored May 01, 2024
  
  5ea84496
- parser: add commands format · 176ad3aa
  Michael Yang authored Apr 24, 2024
  
  176ad3aa
30 Apr, 2024 1 commit

prompt to display and add local ollama keys to account (#3717) · 0a7fdbe5

Bruce MacDonald authored Apr 30, 2024

- return descriptive error messages when unauthorized to create blob or push a model
- display the local public key associated with the request that was denied

0a7fdbe5

29 Apr, 2024 1 commit
- better checking for OLLAMA_HOST variable (#3661) · 9009bedf
  Patrick Devine authored Apr 29, 2024
  
  9009bedf
26 Apr, 2024 1 commit
- check file type before zip · 41e03ede
  Michael Yang authored Apr 25, 2024
  
  41e03ede
24 Apr, 2024 1 commit
- only replace if it matches command · ac0801ec
  Michael Yang authored Apr 24, 2024
  
  ac0801ec