1. 12 Feb, 2026 2 commits
  2. 10 Feb, 2026 2 commits
  3. 09 Feb, 2026 1 commit
    • MatejKosec's avatar
      fix: profiler deployment timeout handling for MoE models (#6086) · 67329d10
      MatejKosec authored
      Wrap wait_for_deployment_ready() in try/except TimeoutError for both prefill and decode profiling sweeps
      On timeout: log error, record via add_profiling_error(), clean up the timed-out deployment, and continue to the next parallelization mapping
      Previously, a single deployment timeout would crash the entire profiler job
      67329d10
  4. 06 Feb, 2026 3 commits
  5. 05 Feb, 2026 3 commits
  6. 04 Feb, 2026 2 commits
  7. 03 Feb, 2026 2 commits
  8. 29 Jan, 2026 1 commit
  9. 28 Jan, 2026 1 commit
  10. 27 Jan, 2026 2 commits
  11. 26 Jan, 2026 1 commit
  12. 23 Jan, 2026 1 commit
  13. 21 Jan, 2026 1 commit
  14. 18 Jan, 2026 1 commit
  15. 16 Jan, 2026 3 commits
  16. 15 Jan, 2026 1 commit
  17. 14 Jan, 2026 1 commit
  18. 13 Jan, 2026 1 commit
  19. 12 Jan, 2026 2 commits
  20. 09 Jan, 2026 1 commit
  21. 08 Jan, 2026 1 commit
  22. 07 Jan, 2026 1 commit
  23. 06 Jan, 2026 2 commits
  24. 05 Jan, 2026 1 commit
  25. 02 Jan, 2026 2 commits
  26. 01 Jan, 2026 1 commit