1. 27 Jun, 2024 1 commit
  2. 25 Jun, 2024 1 commit
    • Blake Mizerany's avatar
      cmd: defer stating model info until necessary (#5248) · 2aa91a93
      Blake Mizerany authored
      This commit changes the 'ollama run' command to defer fetching model
      information until it really needs it. That is, when in interactive mode.
      
      It also removes one such case where the model information is fetch in
      duplicate, just before calling generateInteractive and then again, first
      thing, in generateInteractive.
      
      This positively impacts the performance of the command:
      
          ; time ./before run llama3 'hi'
          Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?
      
          ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.168 total
          ; time ./before run llama3 'hi'
          Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?
      
          ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.220 total
          ; time ./before run llama3 'hi'
          Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?
      
          ./before run llama3 'hi'  0.02s user 0.01s system 2% cpu 1.217 total
          ; time ./after run llama3 'hi'
          Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?
      
          ./after run llama3 'hi'  0.02s user 0.01s system 4% cpu 0.652 total
          ; time ./after run llama3 'hi'
          Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?
      
          ./after run llama3 'hi'  0.01s user 0.01s system 5% cpu 0.498 total
          ; time ./after run llama3 'hi'
          Hi! It's nice to meet you. Is there something I can help you with or would you like to chat?
      
          ./after run llama3 'hi'  0.01s user 0.01s system 3% cpu 0.479 total
          ; time ./after run llama3 'hi'
          Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?
      
          ./after run llama3 'hi'  0.02s user 0.01s system 5% cpu 0.507 total
          ; time ./after run llama3 'hi'
          Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat?
      
          ./after run llama3 'hi'  0.02s user 0.01s system 5% cpu 0.507 total
      2aa91a93
  3. 19 Jun, 2024 1 commit
    • royjhan's avatar
      Extend api/show and ollama show to return more model info (#4881) · fedf7163
      royjhan authored
      
      
      * API Show Extended
      
      * Initial Draft of Information
      Co-Authored-By: default avatarPatrick Devine <pdevine@sonic.net>
      
      * Clean Up
      
      * Descriptive arg error messages and other fixes
      
      * Second Draft of Show with Projectors Included
      
      * Remove Chat Template
      
      * Touches
      
      * Prevent wrapping from files
      
      * Verbose functionality
      
      * Docs
      
      * Address Feedback
      
      * Lint
      
      * Resolve Conflicts
      
      * Function Name
      
      * Tests for api/show model info
      
      * Show Test File
      
      * Add Projector Test
      
      * Clean routes
      
      * Projector Check
      
      * Move Show Test
      
      * Touches
      
      * Doc update
      
      ---------
      Co-authored-by: default avatarPatrick Devine <pdevine@sonic.net>
      fedf7163
  4. 12 Jun, 2024 1 commit
  5. 04 Jun, 2024 4 commits
  6. 30 May, 2024 3 commits
  7. 24 May, 2024 1 commit
  8. 20 May, 2024 2 commits
  9. 18 May, 2024 1 commit
  10. 16 May, 2024 3 commits
  11. 15 May, 2024 2 commits
  12. 14 May, 2024 2 commits
  13. 13 May, 2024 2 commits
  14. 11 May, 2024 1 commit
  15. 10 May, 2024 1 commit
  16. 06 May, 2024 1 commit
  17. 01 May, 2024 4 commits
  18. 30 Apr, 2024 1 commit
  19. 29 Apr, 2024 1 commit
  20. 26 Apr, 2024 1 commit
  21. 24 Apr, 2024 2 commits
  22. 15 Apr, 2024 2 commits
  23. 08 Apr, 2024 1 commit
  24. 03 Apr, 2024 1 commit
    • Blake Mizerany's avatar
      cmd: provide feedback if OLLAMA_MODELS is set on non-serve command (#3470) · 7d05a6ee
      Blake Mizerany authored
      This also moves the checkServerHeartbeat call out of the "RunE" Cobra
      stuff (that's the only word I have for that) to on-site where it's after
      the check for OLLAMA_MODELS, which allows the helpful error message to
      be printed before the server heartbeat check. This also arguably makes
      the code more readable without the magic/superfluous "pre" function
      caller.
      7d05a6ee