Commits · bfec2c6e1014e9053c727bcabd3f5079716c765b · OpenDAS / ollama

09 Mar, 2024 5 commits
- simplify host checks · bfec2c6e
  Jeffrey Morgan authored Mar 08, 2024
  
  bfec2c6e
- add additional allowed hosts · 5c143af7
  Jeffrey Morgan authored Mar 08, 2024
  
  5c143af7
- Update docs `README.md` and table of contents · 6c0af259
  Jeffrey Morgan authored Mar 08, 2024
  
  6c0af259
- add allowed host middleware and remove `workDir` middleware (#3018) · fc8c0445
  Jeffrey Morgan authored Mar 08, 2024
  
  fc8c0445
- Merge pull request #3014 from ollama/mxyng/decode-ggla · ecc133d8
  Michael Yang authored Mar 08, 2024
  
  ecc133d8
08 Mar, 2024 8 commits
- decode ggla · 76bdebba
  Michael Yang authored Mar 08, 2024
  
  76bdebba
- convert: fix default shape · 18979ad4
  Michael Yang authored Mar 08, 2024
  
  18979ad4
- Merge pull request #2990 from ollama/mxyng/default-term-size · 8e0ef931
  Michael Yang authored Mar 08, 2024
```
fix: default terminal width, height
```
  8e0ef931
- Merge pull request #2988 from dhiltgen/rocm_docs · 280da445
  Daniel Hiltgen authored Mar 08, 2024
```
Refined ROCm troubleshooting docs
```
  280da445
- fix: allow importing a model from name reference (#3005) · 0cebc79c
  Bruce MacDonald authored Mar 08, 2024
  
  0cebc79c
- update llama.cpp submodule to `6cdabe6` (#2999) · 0e4669b0
  Jeffrey Morgan authored Mar 08, 2024
  
  0e4669b0
- Update api.md · b886bec3
  Jeffrey Morgan authored Mar 07, 2024
  
  b886bec3
- Revert "adjust download and upload concurrency based on available bandwidth" (#2995) · fc062059
  Jeffrey Morgan authored Mar 07, 2024
  
  fc062059
07 Mar, 2024 17 commits

cmd: tighten up env var usage sections (#2962) · 2ada81e0

Blake Mizerany authored Mar 07, 2024

Also, document OLLAMA_HOST client semantics per command that honors it.
This looks nicer than having a general puprose environment variable
section in the root usage which was showing up after the "addition help
topics" section outputed by Cobra's default template.

It was decided this was easier to work with than using a custom template
for Cobra right now.

2ada81e0

default terminal width, height · b1e74d4f
Michael Yang authored Mar 07, 2024

b1e74d4f
Merge pull request #2991 from ollama/mxyng/fix-ci · f678f5c5
Michael Yang authored Mar 07, 2024
```
fix ci
```
f678f5c5
fix ci · 2cb74e23
Michael Yang authored Mar 07, 2024

2cb74e23
Refined ROCm troubleshooting docs · 69f02278
Daniel Hiltgen authored Mar 07, 2024

69f02278
Merge pull request #2885 from dhiltgen/rocm_v6_only · 3c8df380
Daniel Hiltgen authored Mar 07, 2024
```
Revamp ROCm support
```
3c8df380
Merge pull request #2985 from ollama/rm-empty-examples · 7d564835
Michael Yang authored Mar 07, 2024
```
remove empty examples
```
7d564835
no ci test on docs, examples · 72431031
Michael Yang authored Mar 07, 2024

72431031
remove empty examples · 6041abb5
Michael Yang authored Mar 07, 2024

6041abb5

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

Merge pull request #2221 from ollama/mxyng/up-down-ccy · 2e20110e
Michael Yang authored Mar 07, 2024
```
adjust download and upload concurrency based on available bandwidth
```
2e20110e
Merge pull request #2964 from dhiltgen/mem_limit_var · 82ddc3e4
Daniel Hiltgen authored Mar 07, 2024
```
Allow setting max vram for workarounds
```
82ddc3e4
update go to 1.22 in other places (#2975) · d481fb3c
Jeffrey Morgan authored Mar 07, 2024

d481fb3c
docs: Add LLM-X to Web Integration section (#2759) · 23ee6332
DJ Johnson authored Mar 07, 2024

23ee6332
fix some typos (#2973) · 23ebe8fe
John authored Mar 07, 2024
```
Signed-off-by: hishope <csqiye@126.com>
```
23ebe8fe
Convert Safetensors to an Ollama model (#2824) · 2c017ca4
Patrick Devine authored Mar 06, 2024

2c017ca4

Allow setting max vram for workarounds · be330174

Daniel Hiltgen authored Mar 06, 2024

Until we get all the memory calculations correct, this can provide
and escape valve for users to workaround out of memory crashes.

be330174

06 Mar, 2024 2 commits

cmd: document environment variables for serve command · 0ded7fdc
Blake Mizerany authored Mar 06, 2024
```
Updates #2944
```
0ded7fdc

Add Odin Runes, a Feature-Rich Java UI for Ollama, to README (#2440) · 2103a507

Leo authored Mar 07, 2024

* Add Odin Runes to README

Add Odin Runes to README

This commit adds Odin Runes to the "Community Integrations" section of the README. Odin Runes is a Java-based GPT client designed to provide seamless interaction with GPT models, enhancing productivity in prompt engineering and text generation tasks. This addition highlights the integration between Odin Runes and Ollama, offering users the flexibility to leverage large language models locally within their development workflow.

* Update README.md

this commit applies the comments of the reviewer.

2103a507

05 Mar, 2024 1 commit
- Update api.md · ce9f7c46
  Jeffrey Morgan authored Mar 05, 2024
  
  ce9f7c46
04 Mar, 2024 2 commits
- Add NotesOllama to Community Integrations (#2909) · e5596c19
  Anders Rex authored Mar 04, 2024
  
  e5596c19
- Added community link for Ollama Copilot (#2582) · 9bc3fee6
  Timothy Graupmann authored Mar 04, 2024
```
* Added community link for Ollama Copilot

* Update README.md

---------
Co-authored-by: Michael <mchiang0610@users.noreply.github.com>
```
  9bc3fee6
01 Mar, 2024 2 commits
- update llama.cpp submodule to `c29af7e` (#2868) · 21347e1e
  Jeffrey Morgan authored Mar 01, 2024
  
  21347e1e
- Fix embeddings load model behavior (#2848) · 3b4bab3d
  Jeffrey Morgan authored Feb 29, 2024
  
  3b4bab3d
29 Feb, 2024 3 commits
- Merge pull request #2838 from dhiltgen/opensuse · cbd6e3b3
  Daniel Hiltgen authored Feb 29, 2024
```
Add ollama user to video group
```
  cbd6e3b3
- Merge pull request #2837 from dhiltgen/podman_image_support · b830afa7
  Daniel Hiltgen authored Feb 29, 2024
```
Add env var so podman will map cuda GPUs
```
  b830afa7
- Merge pull request #2836 from bmwiedemann/gzip · bd1d8b0d
  Daniel Hiltgen authored Feb 29, 2024
```
Omit build date from gzip headers
```
  bd1d8b0d