Commits · c5ff443b9f7264d0973dcc2ce671d0ff174cc34f · OpenDAS / ollama

09 Apr, 2024 8 commits
- Handle very slow model loads · c5ff443b
  Daniel Hiltgen authored Apr 09, 2024
```
During testing, we're seeing some models take over 3 minutes.
```
  c5ff443b
- Revert "build.go: introduce a friendlier way to build Ollama (#3548)" (#3564) · 1524f323
  Blake Mizerany authored Apr 09, 2024
  
  1524f323
- build.go: introduce a friendlier way to build Ollama (#3548) · fccf3eec
  Blake Mizerany authored Apr 09, 2024
```
This commit introduces a more friendly way to build Ollama dependencies
and the binary without abusing `go generate` and removing the
unnecessary extra steps it brings with it.

This script also provides nicer feedback to the user about what is
happening during the build process.

At the end, it prints a helpful message to the user about what to do
next (e.g. run the new local Ollama).
```
  fccf3eec
- Merge pull request #3506 from ollama/mxyng/quantize-redux · c77d45d8
  Michael Yang authored Apr 09, 2024
```
cgo quantize
```
  c77d45d8
- update llama.cpp submodule to `1b67731` (#3561) · 5ec12cec
  Jeffrey Morgan authored Apr 09, 2024
  
  5ec12cec
- Merge pull request #3559 from ollama/mxyng/ci · d9578d2b
  Michael Yang authored Apr 09, 2024
```
ci: use go-version-file
```
  d9578d2b
- ci: use go-version-file · cb8352d6
  Michael Yang authored Apr 09, 2024
  
  cb8352d6
- Correct directory reference in macapp/README (#3555) · fc6558f4
  Alex Mavrogiannis authored Apr 09, 2024
  
  fc6558f4
08 Apr, 2024 3 commits
- cgo quantize · 9502e566
  Michael Yang authored Apr 05, 2024
  
  9502e566
- no blob create if already exists · e1c9a2a0
  Michael Yang authored Apr 05, 2024
  
  e1c9a2a0
- Update README.md (#3539) · 1341ee1b
  writinwaters authored Apr 08, 2024
```
RAGFlow now supports integration with Ollama.
```
  1341ee1b
07 Apr, 2024 1 commit
- update generate scripts with new `LLAMA_CUDA` variable, set `HIP_PLATFORM` to... · 63efa075
  Jeffrey Morgan authored Apr 07, 2024
```
update generate scripts with new `LLAMA_CUDA` variable, set `HIP_PLATFORM` to avoid compiler errors (#3528)
```
  63efa075
06 Apr, 2024 3 commits
- Docs: Remove wrong parameter for Chat Completion (#3515) · cb03fc95
  Thomas Vitale authored Apr 06, 2024
```
Fixes gh-3514
Signed-off-by: Thomas Vitale <ThomasVitale@users.noreply.github.com>
```
  cb03fc95
- Merge pull request #3508 from ollama/mxyng/rope · a5ec9cfc
  Michael Yang authored Apr 05, 2024
  
  a5ec9cfc
- no rope parameters · be517e49
  Michael Yang authored Apr 05, 2024
  
  be517e49
05 Apr, 2024 1 commit
- Merge pull request #3496 from ollama/mxyng/cmd-r-graph · fc8e1086
  Michael Yang authored Apr 05, 2024
```
add command-r graph estimate
```
  fc8e1086
04 Apr, 2024 13 commits
- Merge pull request #3491 from dhiltgen/context_bust_test · c5d5c4a9
  Daniel Hiltgen authored Apr 04, 2024
```
Add test case for context exhaustion
```
  c5d5c4a9
- Merge pull request #3488 from mofanke/fix-windows-dll-compress · dfe330fa
  Daniel Hiltgen authored Apr 04, 2024
```
fix dll compress in windows building
```
  dfe330fa
- add command-r graph estimate · 01f77ae2
  Michael Yang authored Apr 04, 2024
  
  01f77ae2
- Merge pull request #3494 from dhiltgen/ci_release · 483b81a8
  Daniel Hiltgen authored Apr 04, 2024
```
Fail fast if mingw missing on windows
```
  483b81a8
- Fail fast if mingw missing on windows · 36bd9677
  Daniel Hiltgen authored Apr 04, 2024
  
  36bd9677
- use an older version of the mac os sdk in release (#3484) · b0e7d35d
  Jeffrey Morgan authored Apr 04, 2024
  
  b0e7d35d
- Add test case for context exhaustion · aeb1fb51
  Daniel Hiltgen authored Apr 04, 2024
```
Confirmed this fails on 0.1.30 with known regression
but passes on main
```
  aeb1fb51
- Merge pull request #3490 from dhiltgen/ci_fixes · a2e60ebc
  Daniel Hiltgen authored Apr 04, 2024
```
CI missing archive
```
  a2e60ebc
- CI missing archive · 883ec4d1
  Daniel Hiltgen authored Apr 04, 2024
  
  883ec4d1
- fix dll compress in windows building · 4de01267
  mofanke authored Apr 04, 2024
  
  4de01267
- Merge pull request #3481 from dhiltgen/ci_fixes · 9768e2dc
  Daniel Hiltgen authored Apr 03, 2024
```
CI subprocess path fix
```
  9768e2dc
- CI subprocess path fix · 08600d5b
  Daniel Hiltgen authored Apr 03, 2024
  
  08600d5b
- Merge pull request #3479 from dhiltgen/ci_fixes · a624e672
  Daniel Hiltgen authored Apr 03, 2024
```
Fix CI release glitches
```
  a624e672
03 Apr, 2024 8 commits
- Fix CI release glitches · e4a7e5b2
  Daniel Hiltgen authored Apr 03, 2024
```
The subprocess change moved the build directory
arm64 builds weren't setting cross-compilation flags when building on x86
```
  e4a7e5b2
- Merge pull request #3463 from ollama/mxyng/graph-estimate · a0a15cfd
  Michael Yang authored Apr 03, 2024
```
update graph size estimate
```
  a0a15cfd
- update graph size estimate · 12e923e1
  Michael Yang authored Apr 02, 2024
  
  12e923e1
- Fix macOS builds on older SDKs (#3467) · cd135317
  Jeffrey Morgan authored Apr 03, 2024
  
  cd135317
- Merge pull request #3466 from ollama/mxyng/head-kv · 4f895d63
  Michael Yang authored Apr 03, 2024
```
default head_kv to 1
```
  4f895d63
- cmd: provide feedback if OLLAMA_MODELS is set on non-serve command (#3470) · 7d05a6ee
  Blake Mizerany authored Apr 02, 2024
```
This also moves the checkServerHeartbeat call out of the "RunE" Cobra
stuff (that's the only word I have for that) to on-site where it's after
the check for OLLAMA_MODELS, which allows the helpful error message to
be printed before the server heartbeat check. This also arguably makes
the code more readable without the magic/superfluous "pre" function
caller.
```
  7d05a6ee
- Merge pull request #3464 from dhiltgen/subprocess · 464d8178
  Daniel Hiltgen authored Apr 02, 2024
```
Fix numgpu opt miscomparison
```
  464d8178
- feat: add OLLAMA_DEBUG in ollama server help message (#3461) · 531324a9
  Pier Francesco Contino authored Apr 03, 2024
```
Co-authored-by: Pier Francesco Contino <pfcontino@gmail.com>
```
  531324a9
02 Apr, 2024 3 commits
- Revert options as a ref in the server · 6589eb8a
  Daniel Hiltgen authored Apr 02, 2024
  
  6589eb8a
- default head_kv to 1 · 90f071c6
  Michael Yang authored Apr 02, 2024
  
  90f071c6
- Merge pull request #3465 from ollama/mxyng/fix-metal · a039e383
  Michael Yang authored Apr 02, 2024
```
fix metal gpu
```
  a039e383