Commits · df325373126485ef37bfb23c1ccf59e4beec355a · OpenDAS / ollama

05 Jan, 2024 6 commits
- gpu: read memory info from all cuda devices (#1802) · df325373
  Jeffrey Morgan authored Jan 05, 2024
```
* gpu: read memory info from all cuda devices

* add `LOOKUP_SIZE` constant

* better constant name

* address comments
```
  df325373
- remove unused generate patches (#1810) · 3367b5f3
  Bruce MacDonald authored Jan 05, 2024
  
  3367b5f3
- Merge pull request #1801 from jmorganca/mattw/correctdockerlink · 46edbbc5
  Matt Williams authored Jan 04, 2024
  
  46edbbc5
- Merge pull request #1791 from jmorganca/mxyng/update-build · d2ff18cd
  Michael Yang authored Jan 04, 2024
```
update Dockerfile.build
```
  d2ff18cd
- fix docker doc to point to hub · df086d3c
  Matt Williams authored Jan 04, 2024
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  df086d3c
- update build · f9961c70
  Michael Yang authored Jan 04, 2024
  
  f9961c70
04 Jan, 2024 19 commits
- Merge pull request #1790 from dhiltgen/llm_code_shuffle · cd8fad33
  Daniel Hiltgen authored Jan 04, 2024
```
Cleaup stale submodule
```
  cd8fad33
- Cleaup stale submodule · 9983fa5f
  Daniel Hiltgen authored Jan 04, 2024
```
If the tree has a stale submodule, make sure we clean it up first
```
  9983fa5f
- Merge pull request #1788 from dhiltgen/llm_code_shuffle · dfda91c2
  Daniel Hiltgen authored Jan 04, 2024
```
Revamp code layout for the llm directory and llama.cpp submodule
```
  dfda91c2
- Init submodule with new path · fac9060d
  Daniel Hiltgen authored Jan 04, 2024
  
  fac9060d
- remove old llama.cpp submodule path · a554616f
  Daniel Hiltgen authored Jan 04, 2024
  
  a554616f
- Code shuffle to clean up the llm dir · 77d96da9
  Daniel Hiltgen authored Jan 04, 2024
  
  77d96da9
- Add embeddings to API (#1773) · 0d6e3565
  Brian Murray authored Jan 04, 2024
  
  0d6e3565
- Merge pull request #1785 from dhiltgen/win_native_cli · b5939008
  Daniel Hiltgen authored Jan 04, 2024
```
Load dynamic cpu lib on windows
```
  b5939008
- Load dynamic cpu lib on windows · e9ce91e9
  Daniel Hiltgen authored Jan 04, 2024
```
On linux, we link the CPU library in to the Go app and fall back to it
when no GPU match is found. On windows we do not link in the CPU library
so that we can better control our dependencies for the CLI.  This fixes
the logic so we correctly fallback to the dynamic CPU library
on windows.
```
  e9ce91e9
- fix: pull either original model or from model on create (#1774) · 4ad6c9b1
  Bruce MacDonald authored Jan 04, 2024
  
  4ad6c9b1
- tweak memory requirements error text · c0285158
  Jeffrey Morgan authored Jan 03, 2024
  
  c0285158
- add macOS memory check for 47B models · 77a66df7
  Jeffrey Morgan authored Jan 03, 2024
  
  77a66df7
- remove unused filetype check · 5b4837f8
  Jeffrey Morgan authored Jan 03, 2024
  
  5b4837f8
- update cmake flags for `amd64` macOS (#1780) · 29340c2e
  Jeffrey Morgan authored Jan 03, 2024
```
* update cmake flags for intel macOS

* remove `LLAMA_K_QUANTS`

* put back `CMAKE_OSX_DEPLOYMENT_TARGET` and disable `LLAMA_F16C`
```
  29340c2e
- Merge pull request #1779 from dhiltgen/refined_amd_gpu_list · d5ec7303
  Daniel Hiltgen authored Jan 03, 2024
```
Improve maintainability of Radeon card list
```
  d5ec7303
- Merge pull request #1778 from dhiltgen/wsl1 · 8bed487a
  Daniel Hiltgen authored Jan 03, 2024
```
Fail fast on WSL1 while allowing on WSL2
```
  8bed487a
- Merge pull request #1781 from dhiltgen/cpu_only_build · c1a10a6e
  Daniel Hiltgen authored Jan 03, 2024
```
Fix CPU only builds
```
  c1a10a6e
- Fix CPU only builds · ddbfa6fe
  Daniel Hiltgen authored Jan 03, 2024
```
Go embed doesn't like when there's no matching files, so put
a dummy placeholder in to allow building without any GPU support
If no "server" library is found, it's safely ignored at runtime.
```
  ddbfa6fe
- Fail fast on WSL1 while allowing on WSL2 · 2fcd41ef
  Daniel Hiltgen authored Jan 03, 2024
```
This prevents users from accidentally installing on WSL1 with instructions
guiding how to upgrade their WSL instance to version 2.  Once running WSL2
if you have an NVIDIA card, you can follow their instructions to set up
GPU passthrough and run models on the GPU.  This is not possible on WSL1.
```
  2fcd41ef
03 Jan, 2024 13 commits
- Improve maintainability of Radeon card list · 16f4603b
  Daniel Hiltgen authored Jan 03, 2024
```
This moves the list of AMD GPUs to an easier to maintain list which
should make it easier to update over time.
```
  16f4603b
- Merge pull request #1776 from dhiltgen/render_group · 11846866
  Daniel Hiltgen authored Jan 03, 2024
```
Add ollama user to render group for Radeon support
```
  11846866
- Add ollama user to render group for Radeon support · 2588cb2d
  Daniel Hiltgen authored Jan 03, 2024
```
For the ROCm libraries to access the driver, we need to add the ollama user
to the render group.
```
  2588cb2d
- set `num_gpu` to 1 only by default on darwin arm64 (#1771) · c7ea8f23
  Jeffrey Morgan authored Jan 03, 2024
  
  c7ea8f23
- fix: relay request opts to loaded llm prediction (#1761) · 0b3118e0
  Bruce MacDonald authored Jan 03, 2024
  
  0b3118e0
- Merge pull request #1683 from dhiltgen/fix_windows_test · 05face44
  Daniel Hiltgen authored Jan 03, 2024
```
Fix windows system memory lookup
```
  05face44
- Fix windows system memory lookup · a2ad9524
  Daniel Hiltgen authored Dec 22, 2023
```
This refines the gpu package error handling and fixes a bug with the
system memory lookup on windows.
```
  a2ad9524
- Merge pull request #1680 from dhiltgen/better_patching · 5fea4410
  Daniel Hiltgen authored Jan 03, 2024
```
Refactor how we augment llama.cpp and refine windows native build
```
  5fea4410
- Fix `template` api doc description (#1661) · b846eb64
  Bruce MacDonald authored Jan 03, 2024
  
  b846eb64
- Update README.md (#1766) · 3c5dd9ed
  Cole Gillespie authored Jan 03, 2024
  
  3c5dd9ed
- Update import.md · b17ccd05
  Jeffrey Morgan authored Jan 02, 2024
  
  b17ccd05
- keyboard shortcut help (#1764) · d0409f77
  Patrick Devine authored Jan 02, 2024
  
  d0409f77
- use `docker build` in build scripts · ec261422
  Jeffrey Morgan authored Jan 02, 2024
  
  ec261422
02 Jan, 2024 2 commits
- Get rid of one-line llama.log · 0498f7ce
  Daniel Hiltgen authored Dec 30, 2023
```
This one log line was triggering a single line llama.log to be generated
in the pwd of the server
```
  0498f7ce
- Rename the ollama cmakefile · 738a8d12
  Daniel Hiltgen authored Dec 24, 2023
  
  738a8d12