1. 10 Mar, 2026 1 commit
  2. 08 Mar, 2026 1 commit
  3. 07 Mar, 2026 1 commit
  4. 06 Mar, 2026 1 commit
  5. 05 Mar, 2026 2 commits
  6. 03 Mar, 2026 1 commit
  7. 02 Mar, 2026 3 commits
  8. 27 Feb, 2026 1 commit
  9. 26 Feb, 2026 2 commits
  10. 25 Feb, 2026 3 commits
  11. 24 Feb, 2026 3 commits
  12. 20 Feb, 2026 1 commit
  13. 19 Feb, 2026 2 commits
  14. 15 Feb, 2026 1 commit
  15. 13 Feb, 2026 1 commit
  16. 12 Feb, 2026 2 commits
  17. 10 Feb, 2026 2 commits
  18. 09 Feb, 2026 1 commit
    • MatejKosec's avatar
      fix: profiler deployment timeout handling for MoE models (#6086) · 67329d10
      MatejKosec authored
      Wrap wait_for_deployment_ready() in try/except TimeoutError for both prefill and decode profiling sweeps
      On timeout: log error, record via add_profiling_error(), clean up the timed-out deployment, and continue to the next parallelization mapping
      Previously, a single deployment timeout would crash the entire profiler job
      67329d10
  19. 06 Feb, 2026 3 commits
  20. 05 Feb, 2026 3 commits
  21. 04 Feb, 2026 2 commits
  22. 03 Feb, 2026 2 commits
  23. 29 Jan, 2026 1 commit