Commits · e1c9a2a00fd555f33dae7f97b7900a9d636166b3 · OpenDAS / ollama

08 Apr, 2024 2 commits
- no blob create if already exists · e1c9a2a0
  Michael Yang authored Apr 05, 2024
  
  e1c9a2a0
- Update README.md (#3539) · 1341ee1b
  writinwaters authored Apr 08, 2024
```
RAGFlow now supports integration with Ollama.
```
  1341ee1b
07 Apr, 2024 1 commit
- update generate scripts with new `LLAMA_CUDA` variable, set `HIP_PLATFORM` to... · 63efa075
  Jeffrey Morgan authored Apr 07, 2024
```
update generate scripts with new `LLAMA_CUDA` variable, set `HIP_PLATFORM` to avoid compiler errors (#3528)
```
  63efa075
06 Apr, 2024 3 commits
- Docs: Remove wrong parameter for Chat Completion (#3515) · cb03fc95
  Thomas Vitale authored Apr 06, 2024
```
Fixes gh-3514
Signed-off-by: Thomas Vitale <ThomasVitale@users.noreply.github.com>
```
  cb03fc95
- Merge pull request #3508 from ollama/mxyng/rope · a5ec9cfc
  Michael Yang authored Apr 05, 2024
  
  a5ec9cfc
- no rope parameters · be517e49
  Michael Yang authored Apr 05, 2024
  
  be517e49
05 Apr, 2024 1 commit
- Merge pull request #3496 from ollama/mxyng/cmd-r-graph · fc8e1086
  Michael Yang authored Apr 05, 2024
```
add command-r graph estimate
```
  fc8e1086
04 Apr, 2024 13 commits
- Merge pull request #3491 from dhiltgen/context_bust_test · c5d5c4a9
  Daniel Hiltgen authored Apr 04, 2024
```
Add test case for context exhaustion
```
  c5d5c4a9
- Merge pull request #3488 from mofanke/fix-windows-dll-compress · dfe330fa
  Daniel Hiltgen authored Apr 04, 2024
```
fix dll compress in windows building
```
  dfe330fa
- add command-r graph estimate · 01f77ae2
  Michael Yang authored Apr 04, 2024
  
  01f77ae2
- Merge pull request #3494 from dhiltgen/ci_release · 483b81a8
  Daniel Hiltgen authored Apr 04, 2024
```
Fail fast if mingw missing on windows
```
  483b81a8
- Fail fast if mingw missing on windows · 36bd9677
  Daniel Hiltgen authored Apr 04, 2024
  
  36bd9677
- use an older version of the mac os sdk in release (#3484) · b0e7d35d
  Jeffrey Morgan authored Apr 04, 2024
  
  b0e7d35d
- Add test case for context exhaustion · aeb1fb51
  Daniel Hiltgen authored Apr 04, 2024
```
Confirmed this fails on 0.1.30 with known regression
but passes on main
```
  aeb1fb51
- Merge pull request #3490 from dhiltgen/ci_fixes · a2e60ebc
  Daniel Hiltgen authored Apr 04, 2024
```
CI missing archive
```
  a2e60ebc
- CI missing archive · 883ec4d1
  Daniel Hiltgen authored Apr 04, 2024
  
  883ec4d1
- fix dll compress in windows building · 4de01267
  mofanke authored Apr 04, 2024
  
  4de01267
- Merge pull request #3481 from dhiltgen/ci_fixes · 9768e2dc
  Daniel Hiltgen authored Apr 03, 2024
```
CI subprocess path fix
```
  9768e2dc
- CI subprocess path fix · 08600d5b
  Daniel Hiltgen authored Apr 03, 2024
  
  08600d5b
- Merge pull request #3479 from dhiltgen/ci_fixes · a624e672
  Daniel Hiltgen authored Apr 03, 2024
```
Fix CI release glitches
```
  a624e672
03 Apr, 2024 8 commits
- Fix CI release glitches · e4a7e5b2
  Daniel Hiltgen authored Apr 03, 2024
```
The subprocess change moved the build directory
arm64 builds weren't setting cross-compilation flags when building on x86
```
  e4a7e5b2
- Merge pull request #3463 from ollama/mxyng/graph-estimate · a0a15cfd
  Michael Yang authored Apr 03, 2024
```
update graph size estimate
```
  a0a15cfd
- update graph size estimate · 12e923e1
  Michael Yang authored Apr 02, 2024
  
  12e923e1
- Fix macOS builds on older SDKs (#3467) · cd135317
  Jeffrey Morgan authored Apr 03, 2024
  
  cd135317
- Merge pull request #3466 from ollama/mxyng/head-kv · 4f895d63
  Michael Yang authored Apr 03, 2024
```
default head_kv to 1
```
  4f895d63
- cmd: provide feedback if OLLAMA_MODELS is set on non-serve command (#3470) · 7d05a6ee
  Blake Mizerany authored Apr 02, 2024
```
This also moves the checkServerHeartbeat call out of the "RunE" Cobra
stuff (that's the only word I have for that) to on-site where it's after
the check for OLLAMA_MODELS, which allows the helpful error message to
be printed before the server heartbeat check. This also arguably makes
the code more readable without the magic/superfluous "pre" function
caller.
```
  7d05a6ee
- Merge pull request #3464 from dhiltgen/subprocess · 464d8178
  Daniel Hiltgen authored Apr 02, 2024
```
Fix numgpu opt miscomparison
```
  464d8178
- feat: add OLLAMA_DEBUG in ollama server help message (#3461) · 531324a9
  Pier Francesco Contino authored Apr 03, 2024
```
Co-authored-by: Pier Francesco Contino <pfcontino@gmail.com>
```
  531324a9
02 Apr, 2024 8 commits
- Revert options as a ref in the server · 6589eb8a
  Daniel Hiltgen authored Apr 02, 2024
  
  6589eb8a
- default head_kv to 1 · 90f071c6
  Michael Yang authored Apr 02, 2024
  
  90f071c6
- Merge pull request #3465 from ollama/mxyng/fix-metal · a039e383
  Michael Yang authored Apr 02, 2024
```
fix metal gpu
```
  a039e383
- fix metal gpu · 80163ebc
  Michael Yang authored Apr 02, 2024
  
  80163ebc
- Merge pull request #3343 from dhiltgen/bump_more2 · a57818d9
  Daniel Hiltgen authored Apr 02, 2024
```
Bump llama.cpp to b2581
```
  a57818d9
- Fix windows lint CI flakiness · 841adda1
  Daniel Hiltgen authored Apr 02, 2024
  
  841adda1
- Bump to b2581 · 0035e31a
  Daniel Hiltgen authored Mar 25, 2024
  
  0035e31a
- Merge pull request #3218 from dhiltgen/subprocess · c863c6a9
  Daniel Hiltgen authored Apr 02, 2024
```
Switch back to subprocessing for llama.cpp
```
  c863c6a9
01 Apr, 2024 4 commits
- Refined min memory from testing · 1f11b525
  Daniel Hiltgen authored Apr 01, 2024
  
  1f11b525
- Release gpu discovery library after use · 526d4eb2
  Daniel Hiltgen authored Mar 30, 2024
```
Leaving the cudart library loaded kept ~30m of memory
pinned in the GPU in the main process.  This change ensures
we don't hold GPU resources when idle.
```
  526d4eb2
- Safeguard for noexec · 0a74cb31
  Daniel Hiltgen authored Mar 28, 2024
```
We may have users that run into problems with our current
payload model, so this gives us an escape valve.
```
  0a74cb31
- Detect too-old cuda driver · 10ed1b62
  Daniel Hiltgen authored Mar 28, 2024
```
"cudart init failure: 35" isn't particularly helpful in the logs.
```
  10ed1b62