"git@developer.sourcefind.cn:zhaoyu6/sglang.git" did not exist on "1b7adbb5a0cfb6826e8b7d45807fc3900b4a5f25"
  1. 21 Nov, 2024 4 commits
  2. 20 Nov, 2024 14 commits
  3. 19 Nov, 2024 5 commits
  4. 18 Nov, 2024 5 commits
  5. 17 Nov, 2024 5 commits
  6. 16 Nov, 2024 1 commit
  7. 15 Nov, 2024 3 commits
    • Jesse Gross's avatar
      runner.go: Propagate panics back to the user. · d875e99e
      Jesse Gross authored
      This is a partial revert of 8a35bb92
      "runner.go: Increase survivability of main processing loop", removing
      the panic handler.
      
      Although we want to avoid errors taking down the runner, we also
      should make the user aware of problems when they happen. In the
      future, we can restructure things so both parts are true.
      d875e99e
    • Jesse Gross's avatar
      runner.go: Increase survivability of main processing loop · 8a35bb92
      Jesse Gross authored
      Currently, if an error occurs during the prep stages (such as
      tokenizing) of a single request, it will only affect that request.
      However, if an error happens during decoding, it can take down the
      entire runner.
      
      Instead, it's better to drop the tokens that triggered the error and try to
      keep going. However, we also need to stop when we run out of tokens,
      otherwise, this just causes an infinite loop. This is likely the cause
      of at least some of the hanging issues that have been reported.
      
      Bug #7573
      8a35bb92
    • Daniel Hiltgen's avatar
      build: fix arm container image (#7674) · a0ea067b
      Daniel Hiltgen authored
      Fix a rebase glitch from the old C++ runner build model
      a0ea067b
  8. 14 Nov, 2024 3 commits
    • Patrick Devine's avatar
      4efb98cb
    • Bruce MacDonald's avatar
      chore(deps): bump golang.org/x dependencies (#7655) · 0679d491
      Bruce MacDonald authored
      - golang.org/x/sync v0.3.0 -> v0.9.0
      - golang.org/x/image v0.14.0 -> v0.22.0
      - golang.org/x/text v0.15.0 -> v0.20.0
      0679d491
    • Jesse Gross's avatar
      runner.go: Don't trim whitespace from inputs · c25ffde9
      Jesse Gross authored
      It's possible to get prompts that consist entirely of whitespace -
      this is most likely to happen when generating embeddings. Currently,
      we will trim this away, leaving an empty prompt, which will then
      generate an error.
      
      Generating embeddings from whitespace should not trigger an error,
      as this may break pipelines. It's better to just leave the whitespace
      in place and process what we are given. This is consistent with
      past versions of Ollama.
      
      Bug #7578
      c25ffde9