1. 03 Mar, 2026 1 commit
  2. 02 Mar, 2026 3 commits
  3. 27 Feb, 2026 1 commit
  4. 26 Feb, 2026 2 commits
  5. 25 Feb, 2026 3 commits
  6. 24 Feb, 2026 3 commits
  7. 20 Feb, 2026 1 commit
  8. 19 Feb, 2026 2 commits
  9. 15 Feb, 2026 1 commit
  10. 13 Feb, 2026 1 commit
  11. 12 Feb, 2026 2 commits
  12. 10 Feb, 2026 2 commits
  13. 09 Feb, 2026 1 commit
    • MatejKosec's avatar
      fix: profiler deployment timeout handling for MoE models (#6086) · 67329d10
      MatejKosec authored
      Wrap wait_for_deployment_ready() in try/except TimeoutError for both prefill and decode profiling sweeps
      On timeout: log error, record via add_profiling_error(), clean up the timed-out deployment, and continue to the next parallelization mapping
      Previously, a single deployment timeout would crash the entire profiler job
      67329d10
  14. 06 Feb, 2026 3 commits
  15. 05 Feb, 2026 3 commits
  16. 04 Feb, 2026 2 commits
  17. 03 Feb, 2026 2 commits
  18. 29 Jan, 2026 1 commit
  19. 28 Jan, 2026 1 commit
  20. 27 Jan, 2026 2 commits
  21. 26 Jan, 2026 1 commit
  22. 23 Jan, 2026 1 commit
  23. 21 Jan, 2026 1 commit