Commits · ac0801ecedec76492a8a024ad2f4fb84f80f001c · OpenDAS / ollama

24 Apr, 2024 12 commits
- only replace if it matches command · ac0801ec
  Michael Yang authored Apr 24, 2024
  
  ac0801ec
- split temp zip files · ad66e5b0
  Michael Yang authored Apr 22, 2024
  
  ad66e5b0
- types/model: make ParseName use default without question (#3886) · ade4b555
  Blake Mizerany authored Apr 24, 2024
  
  ade4b555
- Merge pull request #3882 from dhiltgen/amd_gfx · a6d62e06
  Daniel Hiltgen authored Apr 24, 2024
```
AMD gfx patch rev is hex
```
  a6d62e06
- Merge pull request #3834 from dhiltgen/not_found_in_path · 6e76348d
  Daniel Hiltgen authored Apr 24, 2024
```
Report errors on server lookup instead of path lookup failure
```
  6e76348d
- AMD gfx patch rev is hex · 0d6687f8
  Daniel Hiltgen authored Apr 24, 2024
```
Correctly handle gfx90a discovery
```
  0d6687f8
- add OLLAMA_KEEP_ALIVE env variable to FAQ (#3865) · 74d2a9ef
  Patrick Devine authored Apr 23, 2024
  
  74d2a9ef
- fixes for gguf (#3863) · 14476d48
  Patrick Devine authored Apr 23, 2024
  
  14476d48
- add mixtral 8x7b model conversion (#3859) · ce8ce825
  Patrick Devine authored Apr 23, 2024
  
  ce8ce825
- types/model: restrict digest hash part to a minimum of 2 characters (#3858) · 4dc4f1be
  Blake Mizerany authored Apr 23, 2024
```
This allows users of a valid Digest to know it has a minimum of 2
characters in the hash part for use when sharding.

This is a reasonable restriction as the hash part is a SHA256 hash which
is 64 characters long, which is the common hash used. There is no
anticipation of using a hash with less than 2 characters.

Also, add MustParseDigest.

Also, replace Digest.Type with Digest.Split for getting both the type
and hash parts together, which is most the common case when asking for
either.
```
  4dc4f1be
- Merge pull request #3857 from dhiltgen/mem_escape_valve · 16b52331
  Daniel Hiltgen authored Apr 23, 2024
```
Add back memory escape valve
```
  16b52331
- Add back memory escape valve · 5445aaa9
  Daniel Hiltgen authored Apr 23, 2024
```
If we get our predictions wrong, this can be used to
set a lower memory limit as a workaround.  Recent multi-gpu
refactoring accidentally removed it, so this adds it back.
```
  5445aaa9
23 Apr, 2024 26 commits
- Merge pull request #3850 from dhiltgen/windows_packaging · 2ac3dd68
  Daniel Hiltgen authored Apr 23, 2024
```
Move nested payloads to installer and zip file on windows
```
  2ac3dd68
- Harden sched TestLoad · d8851cb7
  Daniel Hiltgen authored Apr 23, 2024
```
Give the go routine a moment to deliver the expired event
```
  d8851cb7
- Move nested payloads to installer and zip file on windows · 058f6cd2
  Daniel Hiltgen authored Apr 23, 2024
```
Now that the llm runner is an executable and not just a dll, more users are facing
problems with security policy configurations on windows that prevent users
writing to directories and then executing binaries from the same location.
This change removes payloads from the main executable on windows and shifts them
over to be packaged in the installer and discovered based on the executables location.
This also adds a new zip file for people who want to "roll their own" installation model.
```
  058f6cd2
- Merge pull request #3846 from dhiltgen/missing_runner · 790cf34d
  Daniel Hiltgen authored Apr 23, 2024
```
Detect and recover if runner removed
```
  790cf34d
- adding phi-3 mini to readme · 928d8448
  Michael authored Apr 23, 2024
```
adding phi-3 mini to readme
```
  928d8448
- Make CI lint verbvose · 939d6a86
  Daniel Hiltgen authored Apr 23, 2024
  
  939d6a86
- Detect and recover if runner removed · 58888a74
  Daniel Hiltgen authored Apr 23, 2024
```
Tmp cleaners can nuke the file out from underneath us.  This detects the missing
runner, and re-initializes the payloads.
```
  58888a74
- Merge pull request #3709 from remy415/custom-gpu-defs · cc5a71e0
  Daniel Hiltgen authored Apr 23, 2024
```
Adds support for customizing GPU build flags in llama.cpp
```
  cc5a71e0
- Merge pull request #3836 from ollama/mxyng/mixtral · e83bcf7f
  Michael Yang authored Apr 23, 2024
```
fix: mixtral graph
```
  e83bcf7f
- Merge pull request #3418 from dhiltgen/concurrency · 5690e5ce
  Daniel Hiltgen authored Apr 23, 2024
```
Request and model concurrency
```
  5690e5ce
- Local unicode test case · f2ea8470
  Daniel Hiltgen authored Apr 16, 2024
  
  f2ea8470
- Request and model concurrency · 34b9db5a
  Daniel Hiltgen authored Mar 30, 2024
```
This change adds support for multiple concurrent requests, as well as
loading multiple models by spawning multiple runners. The default
settings are currently set at 1 concurrent request per model and only 1
loaded model at a time, but these can be adjusted by setting
OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.
```
  34b9db5a
- Report errors on server lookup instead of path lookup failure · 8711d03d
  Daniel Hiltgen authored Apr 22, 2024
  
  8711d03d
- Merge pull request #3835 from dhiltgen/harden_llm_override · ee448dea
  Daniel Hiltgen authored Apr 22, 2024
```
Trim spaces and quotes from llm lib override
```
  ee448dea
- tidy community integrations · 6e8db047
  Bruce MacDonald authored Apr 22, 2024
```
- move some popular integrations to the top of the lists
```
  6e8db047
- Revert "stop running model on interactive exit" · 658e60cf
  Bruce MacDonald authored Apr 22, 2024
```
This reverts commit fad00a85.
```
  658e60cf
- Merge branch 'main' of https://github.com/ollama/ollama · 4c78f028
  Bruce MacDonald authored Apr 22, 2024
  
  4c78f028
- fix: mixtral graph · 435cc866
  Michael Yang authored Apr 22, 2024
  
  435cc866
- docs: update README to add chat (web UI) for LLM (#3810) · c7d3a558
  Hao Wu authored Apr 23, 2024
```
* add chat (web UI) for LLM

I have used chat with llama3 in local successfully and the code is MIT licensed.

* Update README.md

---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  c7d3a558
- docs: Update README for Lobe-chat integration. (#3817) · 089cdb28
  Maple Gao authored Apr 23, 2024
```
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  089cdb28
- Update README.md (#3655) · ea1e9aa3
  Võ Đình Đạt authored Apr 23, 2024
  
  ea1e9aa3
- Update README.md with Discord-Ollama project (#3633) · d0d28ef9
  Jonathan Smoley authored Apr 22, 2024
```
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  d0d28ef9
- Add podman-ollama to terminal apps (#3626) · 6654186a
  Eric Curtin authored Apr 23, 2024
```
The goal of podman-ollama is to make AI even more boring.
Signed-off-by: Eric Curtin <ecurtin@redhat.com>
```
  6654186a
- Trim spaces and quotes from llm lib override · aa72281e
  Daniel Hiltgen authored Apr 22, 2024
  
  aa72281e
- add qa-pilot link (#3612) · 74bcbf82
  reid41 authored Apr 23, 2024
```
* add qa-pilot link

* format the link

* add shell-pilot
```
  74bcbf82
- Add Chatbot UI v2 to Community Integrations (#3503) · fe39147e
  Christian Neff authored Apr 23, 2024
  
  fe39147e
22 Apr, 2024 1 commit
- stop running model on interactive exit · fad00a85
  Bruce MacDonald authored Apr 22, 2024
  
  fad00a85
21 Apr, 2024 1 commit
- Update gen_windows.ps1 · 9c0db4cc
  Jeremy authored Apr 21, 2024
```
Fixed improper env references
```
  9c0db4cc