index.rst 2.5 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
..
    SPDX-FileCopyrightText: Copyright (c) 2024-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
    SPDX-License-Identifier: Apache-2.0

    Licensed under the Apache License, Version 2.0 (the "License");
    you may not use this file except in compliance with the License.
    You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

    Unless required by applicable law or agreed to in writing, software
    distributed under the License is distributed on an "AS IS" BASIS,
    WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
    See the License for the specific language governing permissions and
    limitations under the License.

17
18
19
20
..
   Main Page
..

21
22
23
Welcome to NVIDIA Dynamo
========================

24
The NVIDIA Dynamo Platform is a high-performance, low-latency inference framework designed to serve all AI models—across any framework, architecture, or deployment scale.
25

26
27
28
.. admonition:: 💎 Discover the latest developments!
   :class: seealso

29
   This guide is a snapshot of a specific point in time. For the latest information, examples, and Release Assets, see the `Dynamo GitHub repository <https://github.com/ai-dynamo/dynamo/releases/latest>`_.
30

31
32
33
Quickstart
==========
.. include:: _includes/quick_start_local.rst
34

35
36
37
..
   Sidebar
..
38
39
40

.. toctree::
   :hidden:
41
   :caption: Getting Started
42

43
44
   Quickstart <self>
   Installation <_sections/installation>
45
   Support Matrix <support_matrix.md>
46
47
   Architecture <_sections/architecture>
   Examples <_sections/examples>
48
49
50

.. toctree::
   :hidden:
51
   :caption: Kubernetes Deployment
52

53
54
55
56
57
58
59
   Quickstart (K8s) <../kubernetes/README.md>
   Detailed Installation Guide <../kubernetes/installation_guide.md>
   Dynamo Operator <../kubernetes/dynamo_operator.md>
   Metrics <../kubernetes/metrics.md>
   Logging <../kubernetes/logging.md>
   Multinode <../kubernetes/multinode-deployment.md>
   Minikube Setup <../kubernetes/minikube.md>
60
61
62

.. toctree::
   :hidden:
63
   :caption: Components
64

65
66
67
68
   Backends <_sections/backends>
   Router <components/router/README>
   Planner <architecture/planner_intro>
   KVBM <architecture/kvbm_intro>
69
70
71

.. toctree::
   :hidden:
72
   :caption: Developer Guide
atchernych's avatar
atchernych committed
73

74
75
76
77
   Benchmarking Guide <benchmarks/benchmarking.md>
   Planner Benchmark Example <guides/planner_benchmark/README.md>
   Logging <guides/logging.md>
   Health Checks <guides/health_check.md>
78
79
   Tuning Disaggregated Serving Performance <guides/disagg_perf_tuning.md>
   Writing Python Workers in Dynamo <guides/backend.md>
80
   Glossary <dynamo_glossary.md>