hidden_toctree.rst 2.73 KB
Newer Older
1
2
:orphan:

3
..
4
    SPDX-FileCopyrightText: Copyright (c) 2024-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
5
6
7
8
9
10
11
12
13
    SPDX-License-Identifier: Apache-2.0

.. This hidden toctree includes readmes etc that aren't meant to be in the main table of contents but should be accounted for in the sphinx project structure


.. toctree::
   :maxdepth: 2
   :hidden:

14
15
16
17
18
19
20
21
22
23
24
25
   development/runtime-guide.md
   api/nixl_connect/connector.md
   api/nixl_connect/descriptor.md
   api/nixl_connect/device.md
   api/nixl_connect/device_kind.md
   api/nixl_connect/operation_status.md
   api/nixl_connect/rdma_metadata.md
   api/nixl_connect/readable_operation.md
   api/nixl_connect/writable_operation.md
   api/nixl_connect/read_operation.md
   api/nixl_connect/write_operation.md
   api/nixl_connect/README.md
26

27
   kubernetes/api_reference.md
28
   kubernetes/deployment/create_deployment.md
29
   kubernetes/deployment/dynamomodel-guide.md
30
31
32
33
34

   kubernetes/fluxcd.md
   kubernetes/grove.md
   kubernetes/model_caching_with_fluid.md
   kubernetes/README.md
35
36
37
38
   reference/cli.md
   observability/metrics.md
   kvbm/vllm-setup.md
   kvbm/trtllm-setup.md
39
   agents/tool-calling.md
40
   development/jail_stream.md
41

42
   router/kv_cache_routing.md
Yan Ru Pei's avatar
Yan Ru Pei committed
43
   router/kv_events.md
44
   planner/load_planner.md
45
   fault_tolerance/README.md
46
47
   fault_tolerance/request_migration.md
   fault_tolerance/request_cancellation.md
48
49
50
51
52
   fault_tolerance/graceful_shutdown.md
   fault_tolerance/request_rejection.md
   fault_tolerance/testing.md
   design_docs/request_plane.md
   design_docs/event_plane.md
53

54
55
56
57
58
   backends/trtllm/multinode/multinode-examples.md
   backends/trtllm/llama4_plus_eagle.md
   backends/trtllm/kv-cache-transfer.md
   backends/trtllm/gemma3_sliding_window_attention.md
   backends/trtllm/gpt-oss.md
59
   backends/trtllm/prometheus.md
60
61
62

   backends/sglang/expert-distribution-eplb.md
   backends/sglang/gpt-oss.md
63
   backends/sglang/diffusion-lm.md
64
   backends/sglang/profiling.md
65
   backends/sglang/sgl-hicache-example.md
66
   backends/sglang/sglang-disaggregation.md
67
   backends/sglang/prometheus.md
68
69

   examples/README.md
70
   examples/runtime/hello_world/README.md
71

72
73
   design_docs/distributed_runtime.md
   design_docs/dynamo_flow.md
74

75
76
   backends/vllm/deepseek-r1.md
   backends/vllm/gpt-oss.md
77
   backends/vllm/LMCache_Integration.md
78
   backends/vllm/multi-node.md
79
   backends/vllm/prometheus.md
80
   backends/vllm/prompt-embeddings.md
81
   backends/vllm/speculative_decoding.md
82

83
84
   benchmarks/kv-router-ab-testing.md

Yan Ru Pei's avatar
Yan Ru Pei committed
85
86
   mocker/mocker.md

87
88
   frontends/kserve.md
   _sections/frontends.rst
89

90
91
..   TODO: architecture/distributed_runtime.md and architecture/dynamo_flow.md
     have some outdated names/references and need a refresh.
92
93
..   TODO: Add an OpenAI frontend doc to complement the KServe GRPC doc
     in the Frontends section.