1. 02 Jan, 2024 1 commit
    • Daniel Hiltgen's avatar
      Refactor how we augment llama.cpp · 9a70aecc
      Daniel Hiltgen authored
      This changes the model for llama.cpp inclusion so we're not applying a patch,
      but instead have the C++ code directly in the ollama tree, which should make it
      easier to refine and update over time.
      9a70aecc
  2. 27 Dec, 2023 1 commit
  3. 22 Dec, 2023 4 commits
  4. 21 Dec, 2023 1 commit
    • Daniel Hiltgen's avatar
      Revive windows build · d9cd3d96
      Daniel Hiltgen authored
      The windows native setup still needs some more work, but this gets it building
      again and if you set the PATH properly, you can run the resulting exe on a cuda system.
      d9cd3d96
  5. 20 Dec, 2023 1 commit
    • Daniel Hiltgen's avatar
      Revamp the dynamic library shim · 7555ea44
      Daniel Hiltgen authored
      This switches the default llama.cpp to be CPU based, and builds the GPU variants
      as dynamically loaded libraries which we can select at runtime.
      
      This also bumps the ROCm library to version 6 given 5.7 builds don't work
      on the latest ROCm library that just shipped.
      7555ea44
  6. 19 Dec, 2023 10 commits
  7. 18 Dec, 2023 3 commits
  8. 14 Dec, 2023 1 commit
  9. 13 Dec, 2023 1 commit
  10. 12 Dec, 2023 2 commits
  11. 11 Dec, 2023 2 commits
  12. 10 Dec, 2023 4 commits
  13. 09 Dec, 2023 1 commit
  14. 05 Dec, 2023 7 commits
  15. 04 Dec, 2023 1 commit
    • Bruce MacDonald's avatar
      chat api (#991) · 7a0899d6
      Bruce MacDonald authored
      - update chat docs
      - add messages chat endpoint
      - remove deprecated context and template generate parameters from docs
      - context and template are still supported for the time being and will continue to work as expected
      - add partial response to chat history
      7a0899d6