1. 18 Jul, 2023 5 commits
  2. 17 Jul, 2023 2 commits
  3. 15 Jul, 2023 1 commit
  4. 14 Jul, 2023 1 commit
  5. 13 Jul, 2023 4 commits
  6. 12 Jul, 2023 9 commits
  7. 10 Jul, 2023 1 commit
  8. 07 Jul, 2023 1 commit
  9. 06 Jul, 2023 2 commits
  10. 05 Jul, 2023 1 commit
  11. 04 Jul, 2023 5 commits
  12. 03 Jul, 2023 1 commit
  13. 01 Jul, 2023 1 commit
  14. 30 Jun, 2023 4 commits
  15. 28 Jun, 2023 1 commit
    • Robert Kimball's avatar
      feat(router): add header option to disable buffering for the generate_stream response (#498) · 70f485bf
      Robert Kimball authored
      # This PR adds an http header option to disable buffering for the
      generate_stream endpoint response stream.
      
      Problem: If a model is run behind a proxy server such as nginx that has
      buffering enabled then the response stream from generate_stream gets
      aggregated into a single response which basically disables streaming.
      Instead of getting a chunked response where each token is presented over
      time the response presents everything all at once.
      
      Solution: This change adds the `X-Accel-Buffering` http header which
      disables buffering for the generate_stream response, allowing the
      response to stream properly.
      70f485bf
  16. 26 Jun, 2023 1 commit