"docs/pages/kubernetes/autoscaling.md" did not exist on "f9050aae852b2f4985f8194cd775be432d8312e7"
  1. 24 Sep, 2025 1 commit
  2. 22 Aug, 2025 1 commit
  3. 21 Aug, 2025 1 commit
  4. 18 Aug, 2025 1 commit
  5. 15 Aug, 2025 1 commit
  6. 07 Aug, 2025 1 commit
  7. 05 Aug, 2025 1 commit
  8. 14 Jul, 2025 1 commit
  9. 01 Jul, 2025 1 commit
  10. 25 Jun, 2025 1 commit
  11. 12 Jun, 2025 1 commit
  12. 04 Jun, 2025 1 commit
  13. 27 May, 2025 1 commit
  14. 21 May, 2025 1 commit
  15. 19 May, 2025 1 commit
  16. 14 May, 2025 1 commit
    • Graham King's avatar
      feat(dynamo-run): Print HTTP routes on startup (#1010) · ed290f0a
      Graham King authored
      For #1006
      
      Prints this on startup:
      ```
      2025-05-09T13:06:34.529Z DEBUG dynamo_run::input::http: Supported routes: ["GET /metrics", "GET /dynamo/alpha/list-models", "GET /v1/models", "POST /v1/chat/completions", "POST /v1/completions"]
      ```
      ed290f0a
  17. 29 Apr, 2025 1 commit
    • Abrar Shivani's avatar
      feat: Add request template support for default inference parameters (#841) · adad2ecd
      Abrar Shivani authored
      Adds support for specifying default request parameters through a json template file that can be applied across all inference requests. This enables consistent parameter settings while still allowing per-request overrides.
      
      Changes:
      - Add --request-template CLI flag to specify template file path
      - Integrate template support in HTTP, batch and text input modes
      - Template values can be overridden by individual request parameters
      - Example template.json:
      ```
      {
          "model": "Qwen2.5-3B-Instruct",
          "temperature": 0.7,
          "max_completion_tokens": 4096
      }
      ```
      adad2ecd
  18. 25 Feb, 2025 2 commits
  19. 20 Feb, 2025 1 commit
  20. 12 Feb, 2025 1 commit
  21. 10 Feb, 2025 1 commit