- 12 Feb, 2026 1 commit
-
-
Jonathan Tong authored
Signed-off-by:Jont828 <jt572@cornell.edu>
-
- 10 Feb, 2026 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 09 Feb, 2026 1 commit
-
-
MatejKosec authored
Wrap wait_for_deployment_ready() in try/except TimeoutError for both prefill and decode profiling sweeps On timeout: log error, record via add_profiling_error(), clean up the timed-out deployment, and continue to the next parallelization mapping Previously, a single deployment timeout would crash the entire profiler job
-
- 06 Feb, 2026 1 commit
-
-
dagil-nvidia authored
Signed-off-by:
Dan Gil <dagil@nvidia.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-
- 05 Feb, 2026 1 commit
-
-
hhzhang16 authored
feat: remove default model name in Profiler; validate for one of served model name and model path in Profiler (#5950) Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 04 Feb, 2026 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 03 Feb, 2026 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 16 Jan, 2026 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 14 Jan, 2026 1 commit
-
-
MatejKosec authored
Extends the MOE planner profiler to support TEP (tensor expert parallel) and DEP (data expert parallel) configs with the vllm backend
-
- 12 Jan, 2026 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 09 Jan, 2026 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 06 Jan, 2026 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 05 Jan, 2026 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 02 Jan, 2026 1 commit
-
-
Tushar Sharma authored
Signed-off-by:Tushar Sharma <tusharma@nvidia.com>
-
- 01 Jan, 2026 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 31 Dec, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 18 Dec, 2025 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 16 Dec, 2025 1 commit
-
-
hhzhang16 authored
feat: Profiler WebUI improvements -- error handling, GPU hours, style fixes, preview configs (#4968) Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 12 Dec, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 10 Dec, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Signed-off-by:
Hongkuan Zhou <tedzhouhk@gmail.com> Co-authored-by:
hhzhang16 <54051230+hhzhang16@users.noreply.github.com>
-
- 03 Dec, 2025 2 commits
-
-
Jason Zhou authored
Signed-off-by:Jason Zhou <jasonzho@nvidia.com>
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 02 Dec, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 26 Nov, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 17 Nov, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 14 Nov, 2025 1 commit
-
-
Harrison Saturley-Hall authored
Signed-off-by:
Harrison King Saturley-Hall <hsaturleyhal@nvidia.com> Signed-off-by:
Harrison Saturley-Hall <harrison.saturley.hall@gmail.com> Co-authored-by:
hhzhang16 <54051230+hhzhang16@users.noreply.github.com>
-
- 13 Nov, 2025 1 commit
-
-
Nate Mailhot authored
feat: check for broken symlinks. add back lychee external link checker with retries to fix failure (#4125) Signed-off-by:Nate Mailhot <nmailhot@nvidia.com>
-
- 12 Nov, 2025 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 10 Nov, 2025 3 commits
-
-
Jason Zhou authored
Signed-off-by:
Jason Zhou <jasonzho@jasonzho-mlt.client.nvidia.com> Signed-off-by:
Jason Zhou <jasonzho@nvidia.com> Co-authored-by:
Jason Zhou <jasonzho@jasonzho-mlt.client.nvidia.com>
-
hhzhang16 authored
Signed-off-by:
Hannah Zhang <hannahz@nvidia.com> Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
hongkuanz <hongkuanz@nvidia.com>
-
Hongkuan Zhou authored
Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Signed-off-by:
Hongkuan Zhou <tedzhouhk@gmail.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
-
- 07 Nov, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Signed-off-by:
Hongkuan Zhou <tedzhouhk@gmail.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
-
- 05 Nov, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
- 04 Nov, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Signed-off-by:
Hannah Zhang <hannahz@nvidia.com> Co-authored-by:
Hannah Zhang <hannahz@nvidia.com>
-
- 03 Nov, 2025 1 commit
-
-
hhzhang16 authored
Signed-off-by:
Hannah Zhang <hannahz@nvidia.com> Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
Hongkuan Zhou <tedzhouhk@gmail.com>
-
- 31 Oct, 2025 2 commits
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
Anant Sharma authored
Signed-off-by:Anant Sharma <anants@nvidia.com>
-
- 30 Oct, 2025 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 29 Oct, 2025 1 commit
-
-
hhzhang16 authored
Signed-off-by:Hannah Zhang <hannahz@nvidia.com>
-
- 27 Oct, 2025 1 commit
-
-
hhzhang16 authored
Signed-off-by:
Hannah Zhang <hannahz@nvidia.com> Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Signed-off-by:
Hongkuan Zhou <tedzhouhk@gmail.com> Co-authored-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
Hongkuan Zhou <tedzhouhk@gmail.com>
-