1. 14 Nov, 2024 5 commits
  2. 12 Nov, 2024 8 commits
  3. 11 Nov, 2024 4 commits
  4. 10 Nov, 2024 1 commit
  5. 08 Nov, 2024 3 commits
  6. 07 Nov, 2024 5 commits
  7. 06 Nov, 2024 3 commits
  8. 05 Nov, 2024 4 commits
    • RAPID ARCHITECT's avatar
      Update README.md (#7516) · 9d71bcc3
      RAPID ARCHITECT authored
      added reddit rate below hexabot, ollama powered reddit search and analysis with streamlit for the intervace
      9d71bcc3
    • Daniel Hiltgen's avatar
      One corrupt manifest should not wedge model operations (#7515) · a4c70fe1
      Daniel Hiltgen authored
      One potential failure mode is an empty file which bubbles up as an EOF error,
      leading to all pulls and listing operations failing.  Instead, continue and
      warn about the corrupt manifest.  This also allows re-pulling the corrupt
      manifest to repair the system.
      a4c70fe1
    • Jesse Gross's avatar
      prompt: Use a single token when estimating mllama context size · 34a75102
      Jesse Gross authored
      Currently we assume that images take 768 tokens of context size for
      the purposes of clipping old messages that exceed the context window.
      However, our mllama implementation stores the full image embedding
      in a single token. As a result, there is significant waste of context
      space.
      
      Ideally, we would handle this more generically and have the
      implementation report the number of tokens. However, at the moment
      this would just result in a similar set of 'if' conditions in the
      runner plus APIs to report it back. So for now, we just keep this
      simple.
      34a75102
    • Med Marrouchi's avatar
  9. 04 Nov, 2024 6 commits
  10. 02 Nov, 2024 1 commit