1. 30 Aug, 2023 1 commit
    • Quinn Slack's avatar
      treat stop as stop sequences, not exact tokens (#442) · f4432e1d
      Quinn Slack authored
      The `stop` option to the generate API is a list of sequences that should cause generation to stop. Although these are commonly called "stop tokens", they do not necessarily correspond to LLM tokens (per the LLM's tokenizer). For example, if the caller sends a generate request with `"stop":["\n"]`, then generation should stop on any token containing `\n` (and trim `\n` from the output), not just if the token exactly matches `\n`. If `stop` were interpreted strictly as LLM tokens, then it would require callers of the generate API to know the LLM's tokenizer and enumerate many tokens in the `stop` list.
      
      Fixes https://github.com/jmorganca/ollama/issues/295.
      f4432e1d
  2. 29 Aug, 2023 1 commit
  3. 28 Aug, 2023 2 commits
  4. 26 Aug, 2023 1 commit
  5. 22 Aug, 2023 6 commits
  6. 18 Aug, 2023 2 commits
    • Michael Yang's avatar
      retry on unauthorized chunk push · 3b49315f
      Michael Yang authored
      The token printed for authorized requests has a lifetime of 1h. If an
      upload exceeds 1h, a chunk push will fail since the token is created on
      a "start upload" request.
      
      This replaces the Pipe with SectionReader which is simpler and
      implements Seek, a requirement for makeRequestWithRetry. This is
      slightly worse than using a Pipe since the progress update is directly
      tied to the chunk size instead of controlled separately.
      3b49315f
    • Michael Yang's avatar
      copy metadata from source · 7eda70f2
      Michael Yang authored
      7eda70f2
  7. 17 Aug, 2023 3 commits
  8. 16 Aug, 2023 3 commits
  9. 15 Aug, 2023 3 commits
  10. 14 Aug, 2023 5 commits
  11. 11 Aug, 2023 1 commit
  12. 10 Aug, 2023 3 commits
  13. 09 Aug, 2023 3 commits
  14. 08 Aug, 2023 5 commits
  15. 03 Aug, 2023 1 commit