"vllm/vscode:/vscode.git/clone" did not exist on "1150b65c326dac66a30a96b3944c8bacc4b7e72d"
  • MatejKosec's avatar
    fix: profiler deployment timeout handling for MoE models (#6086) · 67329d10
    MatejKosec authored
    Wrap wait_for_deployment_ready() in try/except TimeoutError for both prefill and decode profiling sweeps
    On timeout: log error, record via add_profiling_error(), clean up the timed-out deployment, and continue to the next parallelization mapping
    Previously, a single deployment timeout would crash the entire profiler job
    67329d10
profile_sla.py 36.6 KB