1. 30 Aug, 2023 1 commit
    • Quinn Slack's avatar
      treat stop as stop sequences, not exact tokens (#442) · f4432e1d
      Quinn Slack authored
      The `stop` option to the generate API is a list of sequences that should cause generation to stop. Although these are commonly called "stop tokens", they do not necessarily correspond to LLM tokens (per the LLM's tokenizer). For example, if the caller sends a generate request with `"stop":["\n"]`, then generation should stop on any token containing `\n` (and trim `\n` from the output), not just if the token exactly matches `\n`. If `stop` were interpreted strictly as LLM tokens, then it would require callers of the generate API to know the LLM's tokenizer and enumerate many tokens in the `stop` list.
      
      Fixes https://github.com/jmorganca/ollama/issues/295.
      f4432e1d
  2. 18 Aug, 2023 1 commit
  3. 17 Aug, 2023 1 commit
  4. 14 Aug, 2023 2 commits
  5. 13 Aug, 2023 1 commit
  6. 10 Aug, 2023 2 commits
  7. 09 Aug, 2023 2 commits
  8. 08 Aug, 2023 2 commits
  9. 07 Aug, 2023 1 commit
    • Michael Yang's avatar
      automatically set num_keep if num_keep < 0 · 4dc5b117
      Michael Yang authored
      num_keep defines how many tokens to keep in the context when truncating
      inputs. if left to its default value of -1, the server will calculate
      num_keep to be the left of the system instructions
      4dc5b117
  10. 04 Aug, 2023 1 commit
  11. 03 Aug, 2023 1 commit
  12. 02 Aug, 2023 1 commit
  13. 01 Aug, 2023 2 commits
  14. 28 Jul, 2023 3 commits
  15. 27 Jul, 2023 6 commits
  16. 25 Jul, 2023 2 commits
  17. 22 Jul, 2023 1 commit
  18. 15 Jul, 2023 2 commits
  19. 14 Jul, 2023 1 commit
  20. 13 Jul, 2023 1 commit
  21. 12 Jul, 2023 3 commits
  22. 11 Jul, 2023 2 commits
  23. 09 Jul, 2023 1 commit