Commits · c7cb0f0602d18f16a455e736457a6bfeadd5d32e · OpenDAS / ollama

18 Oct, 2024 1 commit

image processing for llama3.2 (#6963) · c7cb0f06

Patrick Devine authored Oct 18, 2024


Co-authored-by: jmorganca <jmorganca@gmail.com>
Co-authored-by: Michael Yang <mxyng@pm.me>
Co-authored-by: Jesse Gross <jesse@ollama.com>

c7cb0f06

10 Oct, 2024 1 commit

cli: Send all images in conversation history · 7fe39025

Jesse Gross authored Oct 09, 2024

Currently the CLI only sends images from the most recent image-
containing message. This prevents doing things like sending
one message with an image and then a follow message with a
second image and asking for comparision based on additional
information not present in any text that was output.

It's possible that some models have a problem with this but the
CLI is not the right place to do this since any adjustments are
model-specific and should affect all clients.

Both llava:34b and minicpm-v do reasonable things with multiple
images in the history.

7fe39025

01 Oct, 2024 1 commit
- Stop model before deletion if loaded (fixed #6957) (#7050) · f40bb398
  Alex Mavrogiannis authored Oct 01, 2024
  
  f40bb398
11 Sep, 2024 2 commits
- add "stop" command (#6739) · abed273d
  Patrick Devine authored Sep 11, 2024
  
  abed273d
- refactor show ouput · ecab6f1c
  Michael Yang authored Sep 11, 2024
```
fixes line wrapping on long texts
```
  ecab6f1c
05 Sep, 2024 2 commits

llm: make load time stall duration configurable via OLLAMA_LOAD_TIMEOUT · 67190976
Daniel Hiltgen authored Sep 05, 2024
```
With the new very large parameter models, some users are willing to wait for
a very long time for models to load.
```
67190976

Introduce GPU Overhead env var (#5922) · b05c9e83

Daniel Hiltgen authored Sep 05, 2024

Provide a mechanism for users to set aside an amount of VRAM on each GPU
to make room for other applications they want to start after Ollama, or workaround
memory prediction bugs

b05c9e83

01 Sep, 2024 1 commit
- fix(cmd): show info may have nil ModelInfo (#6579) · 5f7b4a5e
  Vimal Kumar authored Sep 01, 2024
  
  5f7b4a5e
23 Aug, 2024 1 commit
- convert safetensor adapters into GGUF (#6327) · 0c819e16
  Patrick Devine authored Aug 23, 2024
  
  0c819e16
21 Aug, 2024 1 commit
- create bert models from cli · beb49eef
  Michael Yang authored Jun 07, 2024
  
  beb49eef
14 Aug, 2024 1 commit

Fix typo and improve readability (#5964) · 0a8d6ea8

longtao authored Aug 14, 2024



* Fix typo and improve readability

Summary:
* Rename updatAvailableMenuID to updateAvailableMenuID
* Replace unused cmd parameter with _ in RunServer function
* Fix typos in comments

(cherry picked from commit 5b8715f0b04773369e8eb1f9e6737995a0ab3ba7)

* Update api/client.go
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

---------
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

0a8d6ea8

12 Aug, 2024 1 commit
- cmd: spinner progress for transfer model data (#6100) · f7e3b919
  Josh authored Aug 12, 2024
  
  f7e3b919
02 Aug, 2024 1 commit
- lint · b732beba
  Michael Yang authored Aug 01, 2024
  
  b732beba
27 Jul, 2024 1 commit
- feat: add support for min_p (resolve #1142) (#1825) · f3d7a481
  Tibor Schmidt authored Jul 27, 2024
  
  f3d7a481
26 Jul, 2024 3 commits
- display messages · a250c2cb
  Michael Yang authored Jul 26, 2024
  
  a250c2cb
- fix: model save · 3d9de805
  Michael Yang authored Jul 26, 2024
```
stop parameter is saved as a slice which is incompatible with modelfile
parsing
```
  3d9de805
- include modelfile messages · 15af5584
  Michael Yang authored Jun 19, 2024
  
  15af5584
23 Jul, 2024 1 commit
- Better explain multi-gpu behavior · 830fdd27
  Daniel Hiltgen authored Jul 23, 2024
  
  830fdd27
22 Jul, 2024 3 commits

bool · 55cd3ddc
Michael Yang authored Jul 03, 2024

55cd3ddc
host · 4f1afd57
Michael Yang authored Jul 03, 2024

4f1afd57

Remove no longer supported max vram var · cc269ba0

Daniel Hiltgen authored Jul 22, 2024

The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM
scenarios. With Concurrency this was no longer wired up, and the simplistic
value doesn't map to multi-GPU setups. Users can still set `num_gpu`
to limit memory usage to avoid OOM if we get our predictions wrong.

cc269ba0

14 Jul, 2024 1 commit
- remove template (#5655) · 057d3186
  Patrick Devine authored Jul 13, 2024
  
  057d3186
12 Jul, 2024 2 commits
- Revert "remove template from tests" · 23ebbaa4
  Patrick Devine authored Jul 12, 2024
```
This reverts commit 9ac0a7a5.
```
  23ebbaa4
- remove template from tests · 9ac0a7a5
  Patrick Devine authored Jul 12, 2024
  
  9ac0a7a5
28 Jun, 2024 2 commits
- Include Show Info in Interactive (#5342) · 5f034f5b
  royjhan authored Jun 28, 2024
  
  5f034f5b
- Ollama Show: Check for Projector Type (#5307) · b910fa90
  royjhan authored Jun 28, 2024
```
* Check exists projtype

* Maintain Ordering
```
  b910fa90
27 Jun, 2024 1 commit
- zip: prevent extracting files into parent dirs (#5314) · 123a722a
  Michael Yang authored Jun 26, 2024
  
  123a722a
25 Jun, 2024 1 commit

cmd: defer stating model info until necessary (#5248) · 2aa91a93

Blake Mizerany authored Jun 24, 2024

This commit changes the 'ollama run' command to defer fetching model
information until it really needs it. That is, when in interactive mode.

It also removes one such case where the model information is fetch in
duplicate, just before calling generateInteractive and then again, first
thing, in generateInteractive.

This positively impacts the performance of the command:

    ; time ./before run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.168 total
    ; time ./before run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.220 total
    ; time ./before run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.217 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.02s user 0.01s system 4% cpu 0.652 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.01s user 0.01s system 5% cpu 0.498 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with or would you like to chat?

    ./after run llama3 'hi'  0.01s user 0.01s system 3% cpu 0.479 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.02s user 0.01s system 5% cpu 0.507 total
    ; time ./after run llama3 'hi'
    Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?

    ./after run llama3 'hi'  0.02s user 0.01s system 5% cpu 0.507 total

2aa91a93

19 Jun, 2024 1 commit

Extend api/show and ollama show to return more model info (#4881) · fedf7163

royjhan authored Jun 19, 2024



* API Show Extended

* Initial Draft of Information
Co-Authored-By: Patrick Devine <pdevine@sonic.net>

* Clean Up

* Descriptive arg error messages and other fixes

* Second Draft of Show with Projectors Included

* Remove Chat Template

* Touches

* Prevent wrapping from files

* Verbose functionality

* Docs

* Address Feedback

* Lint

* Resolve Conflicts

* Function Name

* Tests for api/show model info

* Show Test File

* Add Projector Test

* Clean routes

* Projector Check

* Move Show Test

* Touches

* Doc update

---------
Co-authored-by: Patrick Devine <pdevine@sonic.net>

fedf7163

12 Jun, 2024 1 commit
- move OLLAMA_HOST to envconfig (#5009) · c69bc19e
  Patrick Devine authored Jun 12, 2024
  
  c69bc19e
04 Jun, 2024 4 commits
- nolintlint · 201d853f
  Michael Yang authored May 22, 2024
  
  201d853f
- lint · e40145a3
  Michael Yang authored May 21, 2024
  
  e40145a3
- nolintlint · 8ffb5174
  Michael Yang authored May 21, 2024
  
  8ffb5174
- replace x/exp/slices with slices · 04f3c12b
  Michael Yang authored May 21, 2024
  
  04f3c12b
30 May, 2024 3 commits

replaced duplicate call with variable · 914f68f0
Josh Yan authored May 30, 2024

914f68f0
fixed japanese characters deleted at end of line · bd1d119b
Josh Yan authored May 30, 2024

bd1d119b

Fix OLLAMA_LLM_LIBRARY with wrong map name and add more env vars to help message (#4663) · a03be181

Lei Jitang authored May 31, 2024



* envconfig/config.go: Fix wrong description of OLLAMA_LLM_LIBRARY
Signed-off-by: Lei Jitang <leijitang@outlook.com>

* serve: Add more env to help message of ollama serve

Add more enviroment variables to `ollama serve --help`
to let users know what can be configurated.
Signed-off-by: Lei Jitang <leijitang@outlook.com>

---------
Signed-off-by: Lei Jitang <leijitang@outlook.com>

a03be181

24 May, 2024 1 commit
- Move envconfig and consolidate env vars (#4608) · 4cc3be30
  Patrick Devine authored May 24, 2024
  
  4cc3be30
21 May, 2024 1 commit
- add Ctrl + W shortcut · 353f83a9
  Josh Yan authored May 21, 2024
  
  353f83a9
20 May, 2024 1 commit
- add fixes for llama · d355d202
  Patrick Devine authored May 08, 2024
  
  d355d202