Commits · 3ecae420ac3569f7feee6ab2577811ea01959d66 · OpenDAS / ollama

06 May, 2024 3 commits
- Update api.md (#3945) · 3ecae420
  Darinka authored May 07, 2024
```
* Update api.md

Changed the calculation of tps (token/s) in the documentation

* Update docs/api.md

---------
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
```
  3ecae420
- docs: pbcopy on mac (#3129) · aa93423f
  Adrien Brault authored May 06, 2024
  
  aa93423f
- chore: delete `HEAD` (#4194) · fb8ddc56
  Hyden Liu authored May 07, 2024
  
  fb8ddc56
05 May, 2024 1 commit
- Make maximum pending request configurable · 20f6c065
  Daniel Hiltgen authored May 03, 2024
```
This also bumps up the default to be 50 queued requests
instead of 10.
```
  20f6c065
04 May, 2024 1 commit
- Explain the 2 different windows download options · e006480e
  Daniel Hiltgen authored May 03, 2024
  
  e006480e
03 May, 2024 1 commit

Update 'llama2' -> 'llama3' in most places (#4116) · e8aaea03

Dr Nic Williams authored May 04, 2024



* Update 'llama2' -> 'llama3' in most places

---------
Co-authored-by: Patrick Devine <patrick@infrahq.com>

e8aaea03

02 May, 2024 1 commit
- fix line ending · 94c36909
  Michael Yang authored May 02, 2024
```
replace CRLF with LF
```
  94c36909
01 May, 2024 1 commit
- chore: fix typo in docs/development.md (#4073) · 68755f1f
  alwqx authored May 02, 2024
  
  68755f1f
30 Apr, 2024 1 commit
- Update langchainpy.md (#4037) · 5950c176
  Christian Frantzen authored Apr 30, 2024
```
Updated the code a bit
```
  5950c176
26 Apr, 2024 1 commit
- Update windows.md (#3855) · 2a80f55e
  Quinten van Buul authored Apr 26, 2024
```
Fixed a typo
```
  2a80f55e
24 Apr, 2024 1 commit
- add OLLAMA_KEEP_ALIVE env variable to FAQ (#3865) · 74d2a9ef
  Patrick Devine authored Apr 23, 2024
  
  74d2a9ef
20 Apr, 2024 1 commit
- Update api.md (#3705) · e6f9bfc0
  Sri Siddhaarth authored Apr 21, 2024
  
  e6f9bfc0
17 Apr, 2024 1 commit
- update jetson tutorial · 85bdf14b
  Jeremy authored Apr 17, 2024
  
  85bdf14b
15 Apr, 2024 2 commits
- Update langchainjs.md (#2030) · a27e419b
  Carlos Gamez authored Apr 16, 2024
```
Changed ollama.call() for ollama.invoke() as per deprecated documentation from langchain
```
  a27e419b
- Update modelfile.md · e54a3c7f
  Jeffrey Morgan authored Apr 15, 2024
```
Remove Modelfile parameters that are decided at runtime
```
  e54a3c7f
09 Apr, 2024 2 commits

Revert "build.go: introduce a friendlier way to build Ollama (#3548)" (#3564) · 1524f323
Blake Mizerany authored Apr 09, 2024

1524f323

build.go: introduce a friendlier way to build Ollama (#3548) · fccf3eec

Blake Mizerany authored Apr 09, 2024

This commit introduces a more friendly way to build Ollama dependencies
and the binary without abusing `go generate` and removing the
unnecessary extra steps it brings with it.

This script also provides nicer feedback to the user about what is
happening during the build process.

At the end, it prints a helpful message to the user about what to do
next (e.g. run the new local Ollama).

fccf3eec

06 Apr, 2024 1 commit
- Docs: Remove wrong parameter for Chat Completion (#3515) · cb03fc95
  Thomas Vitale authored Apr 06, 2024
```
Fixes gh-3514
Signed-off-by: Thomas Vitale <ThomasVitale@users.noreply.github.com>
```
  cb03fc95
01 Apr, 2024 1 commit

Safeguard for noexec · 0a74cb31

Daniel Hiltgen authored Mar 28, 2024

We may have users that run into problems with our current
payload model, so this gives us an escape valve.

0a74cb31

26 Mar, 2024 2 commits
- remove need for `$VSINSTALLDIR` since build will fail if `ninja` cannot be found (#3350) · 856b8ec1
  Jeffrey Morgan authored Mar 26, 2024
  
  856b8ec1
- change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) · 1b272d5b
  Patrick Devine authored Mar 26, 2024
  
  1b272d5b
25 Mar, 2024 2 commits
- Fix ROCm link in `development.md` · f38b705d
  Jeffrey Morgan authored Mar 25, 2024
  
  f38b705d
- doc: specify ADAPTER is optional (#3333) · 22921a39
  Blake Mizerany authored Mar 25, 2024
  
  22921a39
21 Mar, 2024 2 commits
- Add docs for GPU selection and nvidia uvm workaround · d8fdbfd8
  Daniel Hiltgen authored Mar 21, 2024
  
  d8fdbfd8
- doc: faq gpu compatibility (#3142) · a5ba0fcf
  Bruce MacDonald authored Mar 21, 2024
  
  a5ba0fcf
20 Mar, 2024 1 commit
- Update faq.md · 3a30bf56
  Jeffrey Morgan authored Mar 20, 2024
  
  3a30bf56
18 Mar, 2024 2 commits
- Update faq.md · 7ed3e941
  Jeffrey Morgan authored Mar 18, 2024
  
  7ed3e941
- update `faq.md` · 2297ad39
  jmorganca authored Mar 16, 2024
  
  2297ad39
15 Mar, 2024 1 commit
- Add ROCm support to linux install script (#2966) · 6459377a
  Daniel Hiltgen authored Mar 14, 2024
  
  6459377a
14 Mar, 2024 1 commit
- Update README.md · 5ce997a7
  Jeffrey Morgan authored Mar 13, 2024
  
  5ce997a7
12 Mar, 2024 2 commits
- add more docs on for the modelfile message command (#3087) · ba7cf7fb
  Patrick Devine authored Mar 12, 2024
  
  ba7cf7fb
- Add docs explaining GPU selection env vars · b53229a2
  Daniel Hiltgen authored Mar 11, 2024
  
  b53229a2
11 Mar, 2024 1 commit
- Update troubleshooting.md · 6d3adfbe
  Jeffrey Morgan authored Mar 11, 2024
  
  6d3adfbe
09 Mar, 2024 3 commits
- Doc how to set up ROCm builds on windows · 0fdebb34
  Daniel Hiltgen authored Mar 09, 2024
  
  0fdebb34
- Finish unwinding idempotent payload logic · 4a5c9b80
  Daniel Hiltgen authored Mar 08, 2024
```
The recent ROCm change partially removed idempotent
payloads, but the ggml-metal.metal file for mac was still
idempotent.  This finishes switching to always extract
the payloads, and now that idempotentcy is gone, the
version directory is no longer useful.
```
  4a5c9b80
- Update docs `README.md` and table of contents · 6c0af259
  Jeffrey Morgan authored Mar 08, 2024
  
  6c0af259
08 Mar, 2024 1 commit
- Update api.md · b886bec3
  Jeffrey Morgan authored Mar 07, 2024
  
  b886bec3
07 Mar, 2024 3 commits

Refined ROCm troubleshooting docs · 69f02278
Daniel Hiltgen authored Mar 07, 2024

69f02278

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

update go to 1.22 in other places (#2975) · d481fb3c
Jeffrey Morgan authored Mar 07, 2024

d481fb3c