Commits · c8059b4dcfad84ef85876d941a4e3d95bf5eb595 · OpenDAS / ollama

27 Jan, 2024 2 commits
- Merge pull request #2224 from jaglinux/fix_rocm_get_version_message · c8059b4d
  Daniel Hiltgen authored Jan 27, 2024
```
ROCm: Correct the response string in rocm_get_version function
```
  c8059b4d
- Update gpu_info_rocm.c · 59d87127
  Jagadish Krishnamoorthy authored Jan 26, 2024
  
  59d87127
26 Jan, 2024 17 commits
- add keep_alive to generate/chat/embedding api endpoints (#2146) · b5cf31b4
  Patrick Devine authored Jan 26, 2024
  
  b5cf31b4
- Merge pull request #2214 from dhiltgen/reject_cuda_without_avx · cc4915e2
  Daniel Hiltgen authored Jan 26, 2024
```
Detect lack of AVX and fallback to CPU mode
```
  cc4915e2
- Detect lack of AVX and fallback to CPU mode · 667a2ba1
  Daniel Hiltgen authored Jan 26, 2024
```
We build the GPU libraries with AVX enabled to ensure that if not all
layers fit on the GPU we get better performance in a mixed mode.
If the user is using a virtualization/emulation system that lacks AVX
this used to result in an illegal instruction error and crash before this
fix.  Now we will report a warning in the server log, and just use
CPU mode to ensure we don't crash.
```
  667a2ba1
- Merge pull request #2212 from ollama/mxyng/fix-build · e054ebe0
  Michael Yang authored Jan 26, 2024
```
fix build
```
  e054ebe0
- fix logging · 9d3dcfd0
  Michael Yang authored Jan 26, 2024
  
  9d3dcfd0
- Merge pull request #1916 from ollama/mxyng/inactivity-monitor · 6e0ea5ec
  Michael Yang authored Jan 26, 2024
```
download: add inactivity monitor
```
  6e0ea5ec
- Merge pull request #2197 from dhiltgen/remove_rocm_image · a47d8b25
  Daniel Hiltgen authored Jan 26, 2024
```
Add back ROCm container support
```
  a47d8b25
- Merge pull request #2195 from dhiltgen/rocm_real_gpus · 30c43c28
  Daniel Hiltgen authored Jan 26, 2024
```
Ignore AMD integrated GPUs
```
  30c43c28
- Merge pull request #2209 from dhiltgen/harden_mgmt · 23a7ea59
  Daniel Hiltgen authored Jan 26, 2024
```
Fix crash on cuda ml init failure
```
  23a7ea59
- Add back ROCm container support · 75c44aa3
  Daniel Hiltgen authored Jan 25, 2024
```
This adds ROCm support back as a discrete image.
```
  75c44aa3
- Ignore AMD integrated GPUs · 9d7b5d6c
  Daniel Hiltgen authored Jan 25, 2024
```
Detect and ignore integrated GPUs reported by rocm.
```
  9d7b5d6c
- Fix crash on cuda ml init failure · 5d9c4a5f
  Daniel Hiltgen authored Jan 26, 2024
```
The new driver lookup code was triggering after init failure due to a missing return
```
  5d9c4a5f
- Merge pull request #2196 from dhiltgen/remove_rocm_image · 197e420a
  Daniel Hiltgen authored Jan 25, 2024
```
Switch back to ubuntu base
```
  197e420a
- Switch back to ubuntu base · a34e1ad3
  Daniel Hiltgen authored Jan 25, 2024
```
The size increase for rocm support in the standard image is problematic
We'll revisit multiple tags for rocm support in a follow up PR.
```
  a34e1ad3
- Merge pull request #1679 from ollama/mxyng/build-gpus · 2ae05562
  Michael Yang authored Jan 25, 2024
```
build cuda and rocm
```
  2ae05562
- Update modelfile.md · 5be9bdd4
  Jeffrey Morgan authored Jan 25, 2024
  
  5be9bdd4
- Update modelfile.md to include `MESSAGE` · b7067949
  Jeffrey Morgan authored Jan 25, 2024
  
  b7067949
25 Jan, 2024 11 commits
- only generate gpu libs · a8c5413d
  Michael Yang authored Jan 19, 2024
  
  a8c5413d
- archive ollama binaries · 5580de45
  Michael Yang authored Dec 22, 2023
  
  5580de45
- build cuda and rocm · 946431d5
  Michael Yang authored Dec 22, 2023
  
  946431d5
- remove env setting · 06101260
  Michael Yang authored Jan 18, 2024
  
  06101260
- update submodule to `cd4fddb29f81d6a1f6d51a0c016bc6b486d68def` · 3ebd6a83
  Jeffrey Morgan authored Jan 25, 2024
  
  3ebd6a83
- Fix clearing kv cache between requests with the same prompt (#2186) · a64570dc
  Jeffrey Morgan authored Jan 25, 2024
```
* Fix clearing kv cache between requests with the same prompt

* fix powershell script
```
  a64570dc
- Save and load sessions (#2063) · 7c40a678
  Patrick Devine authored Jan 25, 2024
  
  7c40a678
- Merge pull request #2181 from ollama/mxyng/stub-lint · e64b5b07
  Michael Yang authored Jan 25, 2024
```
stub generate outputs for lint
```
  e64b5b07
- Merge pull request #2175 from ollama/mxyng/refactor-tensor-read · 9e1e295c
  Michael Yang authored Jan 25, 2024
```
refactor tensor read
```
  9e1e295c
- Update README.md · a643823f
  Jeffrey Morgan authored Jan 24, 2024
  
  a643823f
- stub generate outputs for lint · 8e5d359a
  Michael Yang authored Jan 24, 2024
  
  8e5d359a
24 Jan, 2024 5 commits
- Merge pull request #2174 from dhiltgen/rocm_real_gpus · a170888d
  Daniel Hiltgen authored Jan 24, 2024
```
More logging for gpu management
```
  a170888d
- refactor tensor read · cd22855e
  Michael Yang authored Jan 24, 2024
  
  cd22855e
- More logging for gpu management · 013fd071
  Daniel Hiltgen authored Jan 24, 2024
```
Fix an ordering glitch of dlerr/dlclose and add more logging to help
root cause some crashes users are hitting. This also refines the
function pointer names to use the underlying function names instead
of simplified names for readability.
```
  013fd071
- Merge pull request #2162 from dhiltgen/rocm_real_gpus · f63dc2db
  Daniel Hiltgen authored Jan 23, 2024
```
Report more information about GPUs in verbose mode
```
  f63dc2db
- Update README.md · eaa5a396
  Jeffrey Morgan authored Jan 23, 2024
  
  eaa5a396
23 Jan, 2024 5 commits
- Update README.md · 8ed22f5d
  Jeffrey Morgan authored Jan 23, 2024
  
  8ed22f5d
- Report more information about GPUs in verbose mode · 987c16b2
  Daniel Hiltgen authored Jan 22, 2024
```
This adds additional calls to both CUDA and ROCm management libraries to
discover additional attributes about the GPU(s) detected in the system, and
wires up runtime verbosity selection.  When users hit problems with GPUs we can
ask them to run with `OLLAMA_DEBUG=1 ollama serve` and share the results.
```
  987c16b2
- Update README.md · 950f636d
  Jeffrey Morgan authored Jan 23, 2024
  
  950f636d
- Load all layers on `arm64` macOS if model is small enough (#2149) · 4458efb7
  Jeffrey Morgan authored Jan 22, 2024
  
  4458efb7
- Merge pull request #2150 from dhiltgen/default_version · ceea5994
  Daniel Hiltgen authored Jan 22, 2024
```
Set a default version using git describe
```
  ceea5994