Commits · 123a722a6f541e300bc8e34297ac378ebe23f527 · OpenDAS / ollama

27 Jun, 2024 1 commit
- zip: prevent extracting files into parent dirs (#5314) · 123a722a
  Michael Yang authored Jun 26, 2024
  
  123a722a
25 Jun, 2024 1 commit

cmd: defer stating model info until necessary (#5248) · 2aa91a93

Blake Mizerany authored Jun 24, 2024

This commit changes the 'ollama run' command to defer fetching model
information until it really needs it. That is, when in interactive mode.

It also removes one such case where the model information is fetch in
duplicate, just before calling generateInteractive and then again, first
thing, in generateInteractive.

This positively impacts the performance of the command:

    ; time ./before run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.168 total
    ; time ./before run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.220 total
    ; time ./before run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.217 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.02s user 0.01s system 4% cpu 0.652 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.01s user 0.01s system 5% cpu 0.498 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with or would you like to chat?

    ./after run llama3 'hi'  0.01s user 0.01s system 3% cpu 0.479 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.02s user 0.01s system 5% cpu 0.507 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.02s user 0.01s system 5% cpu 0.507 total

2aa91a93

19 Jun, 2024 1 commit

Extend api/show and ollama show to return more model info (#4881) · fedf7163

royjhan authored Jun 19, 2024



* API Show Extended

* Initial Draft of Information
Co-Authored-By: Patrick Devine <pdevine@sonic.net>

* Clean Up

* Descriptive arg error messages and other fixes

* Second Draft of Show with Projectors Included

* Remove Chat Template

* Touches

* Prevent wrapping from files

* Verbose functionality

* Docs

* Address Feedback

* Lint

* Resolve Conflicts

* Function Name

* Tests for api/show model info

* Show Test File

* Add Projector Test

* Clean routes

* Projector Check

* Move Show Test

* Touches

* Doc update

---------
Co-authored-by: Patrick Devine <pdevine@sonic.net>

fedf7163

12 Jun, 2024 1 commit
- move OLLAMA_HOST to envconfig (#5009) · c69bc19e
  Patrick Devine authored Jun 12, 2024
  
  c69bc19e
04 Jun, 2024 4 commits
- nolintlint · 201d853f
  Michael Yang authored May 22, 2024
  
  201d853f
- lint · e40145a3
  Michael Yang authored May 21, 2024
  
  e40145a3
- nolintlint · 8ffb5174
  Michael Yang authored May 21, 2024
  
  8ffb5174
- replace x/exp/slices with slices · 04f3c12b
  Michael Yang authored May 21, 2024
  
  04f3c12b
30 May, 2024 3 commits

replaced duplicate call with variable · 914f68f0
Josh Yan authored May 30, 2024

914f68f0
fixed japanese characters deleted at end of line · bd1d119b
Josh Yan authored May 30, 2024

bd1d119b

Fix OLLAMA_LLM_LIBRARY with wrong map name and add more env vars to help message (#4663) · a03be181

Lei Jitang authored May 31, 2024



* envconfig/config.go: Fix wrong description of OLLAMA_LLM_LIBRARY
Signed-off-by: Lei Jitang <leijitang@outlook.com>

* serve: Add more env to help message of ollama serve

Add more enviroment variables to `ollama serve --help`
to let users know what can be configurated.
Signed-off-by: Lei Jitang <leijitang@outlook.com>

---------
Signed-off-by: Lei Jitang <leijitang@outlook.com>

a03be181

24 May, 2024 1 commit
- Move envconfig and consolidate env vars (#4608) · 4cc3be30
  Patrick Devine authored May 24, 2024
  
  4cc3be30
20 May, 2024 2 commits
- add fixes for llama · d355d202
  Patrick Devine authored May 08, 2024
  
  d355d202
- Move the parser back + handle utf16 files (#4533) · ccdf0b2a
  Patrick Devine authored May 20, 2024
  
  ccdf0b2a
18 May, 2024 1 commit
- add OLLAMA_NOHISTORY to turn off history in interactive mode (#4508) · 105186aa
  Patrick Devine authored May 18, 2024
  
  105186aa
16 May, 2024 3 commits
- removed comment · 3d90156e
  Josh Yan authored May 16, 2024
  
  3d90156e
- go fmt'd cmd.go · 26bfc1c4
  Josh Yan authored May 15, 2024
  
  26bfc1c4
- go fmt'd cmd.go · 799aa988
  Josh Yan authored May 15, 2024
  
  799aa988
15 May, 2024 2 commits
- updated double-width display · c9e584fb
  Josh Yan authored May 15, 2024
  
  c9e584fb
- fixed width and word count for double spacing · 17b1e81c
  Josh Yan authored May 15, 2024
  
  17b1e81c
14 May, 2024 2 commits
- fix keepalive for non-interactive mode (#4438) · c344da4c
  Patrick Devine authored May 14, 2024
  
  c344da4c
- Ollama `ps` command for showing currently loaded models (#4327) · 68459888
  Patrick Devine authored May 13, 2024
  
  68459888
13 May, 2024 2 commits
- removed inconsistencies · f8464785
  Josh Yan authored May 13, 2024
  
  f8464785
- removed inconsistent punctuation · 91a090a4
  Josh Yan authored May 13, 2024
  
  91a090a4
11 May, 2024 1 commit
- fix `ollama create`'s usage string (#4362) · 8080fbce
  todashuta authored May 12, 2024
  
  8080fbce
10 May, 2024 1 commit

Use `--quantize` flag and `quantize` api parameter (#4321) · 6602e793

Jeffrey Morgan authored May 10, 2024



* rename `--quantization` to `--quantize`

* backwards

* Update api/types.go
Co-authored-by: Michael Yang <mxyng@pm.me>

---------
Co-authored-by: Michael Yang <mxyng@pm.me>

6602e793

06 May, 2024 1 commit
- close server on receiving signal (#4213) · 39d9d22c
  Jeffrey Morgan authored May 06, 2024
  
  39d9d22c
01 May, 2024 4 commits
- server: target invalid · 45b6a12e
  Michael Yang authored May 01, 2024
  
  45b6a12e
- rename parser to model/file · 119589fc
  Michael Yang authored Apr 30, 2024
  
  119589fc
- cmd: import regexp · 5ea84496
  Michael Yang authored May 01, 2024
  
  5ea84496
- parser: add commands format · 176ad3aa
  Michael Yang authored Apr 24, 2024
  
  176ad3aa
30 Apr, 2024 1 commit

prompt to display and add local ollama keys to account (#3717) · 0a7fdbe5

Bruce MacDonald authored Apr 30, 2024

- return descriptive error messages when unauthorized to create blob or push a model
- display the local public key associated with the request that was denied

0a7fdbe5

29 Apr, 2024 1 commit
- better checking for OLLAMA_HOST variable (#3661) · 9009bedf
  Patrick Devine authored Apr 29, 2024
  
  9009bedf
26 Apr, 2024 1 commit
- check file type before zip · 41e03ede
  Michael Yang authored Apr 25, 2024
  
  41e03ede
24 Apr, 2024 2 commits
- only replace if it matches command · ac0801ec
  Michael Yang authored Apr 24, 2024
  
  ac0801ec
- split temp zip files · ad66e5b0
  Michael Yang authored Apr 22, 2024
  
  ad66e5b0
15 Apr, 2024 2 commits
- Revert "cmd: provide feedback if OLLAMA_MODELS is set on non-serve command (#3470)" (#3662) · 949d7832
  Blake Mizerany authored Apr 15, 2024
```
This reverts commit 7d05a6ee.

This proved to be more painful than useful.

See: https://github.com/ollama/ollama/issues/3624
```
  949d7832
- Add llama2 / torch models for `ollama create` (#3607) · 9f8691c6
  Patrick Devine authored Apr 15, 2024
  
  9f8691c6
08 Apr, 2024 1 commit
- cgo quantize · 9502e566
  Michael Yang authored Apr 05, 2024
  
  9502e566
03 Apr, 2024 1 commit

cmd: provide feedback if OLLAMA_MODELS is set on non-serve command (#3470) · 7d05a6ee

Blake Mizerany authored Apr 02, 2024

This also moves the checkServerHeartbeat call out of the "RunE" Cobra
stuff (that's the only word I have for that) to on-site where it's after
the check for OLLAMA_MODELS, which allows the helpful error message to
be printed before the server heartbeat check. This also arguably makes
the code more readable without the magic/superfluous "pre" function
caller.

7d05a6ee