hidden_toctree.rst 2.6 KB
Newer Older
1
2
:orphan:

3
4
5
6
7
8
9
10
11
12
13
..
    SPDX-FileCopyrightText: Copyright (c) 2024-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
    SPDX-License-Identifier: Apache-2.0

.. This hidden toctree includes readmes etc that aren't meant to be in the main table of contents but should be accounted for in the sphinx project structure


.. toctree::
   :maxdepth: 2
   :hidden:

14
15
16
17
18
19
20
21
22
23
24
25
   development/runtime-guide.md
   api/nixl_connect/connector.md
   api/nixl_connect/descriptor.md
   api/nixl_connect/device.md
   api/nixl_connect/device_kind.md
   api/nixl_connect/operation_status.md
   api/nixl_connect/rdma_metadata.md
   api/nixl_connect/readable_operation.md
   api/nixl_connect/writable_operation.md
   api/nixl_connect/read_operation.md
   api/nixl_connect/write_operation.md
   api/nixl_connect/README.md
26

27
   kubernetes/api_reference.md
28
   kubernetes/deployment/create_deployment.md
29
   kubernetes/deployment/dynamomodel-guide.md
30
31
32
33
34

   kubernetes/fluxcd.md
   kubernetes/grove.md
   kubernetes/model_caching_with_fluid.md
   kubernetes/README.md
35
36
37
38
   reference/cli.md
   observability/metrics.md
   kvbm/vllm-setup.md
   kvbm/trtllm-setup.md
39
   agents/tool-calling.md
40
   guides/jail_stream_readme.md
41
   guides/request_plane.md
42

43
   router/kv_cache_routing.md
44
   planner/load_planner.md
45
46
   fault_tolerance/request_migration.md
   fault_tolerance/request_cancellation.md
47

48
49
50
51
52
   backends/trtllm/multinode/multinode-examples.md
   backends/trtllm/llama4_plus_eagle.md
   backends/trtllm/kv-cache-transfer.md
   backends/trtllm/gemma3_sliding_window_attention.md
   backends/trtllm/gpt-oss.md
53
   backends/trtllm/prometheus.md
54
55
56
57
58
59

   backends/sglang/multinode-examples.md
   backends/sglang/dsr1-wideep-gb200.md
   backends/sglang/dsr1-wideep-h100.md
   backends/sglang/expert-distribution-eplb.md
   backends/sglang/gpt-oss.md
60
   backends/sglang/profiling.md
61
   backends/sglang/sgl-hicache-example.md
62
   backends/sglang/sglang-disaggregation.md
63
   backends/sglang/prometheus.md
64
65

   examples/README.md
66
   examples/runtime/hello_world/README.md
67

68
69
   design_docs/distributed_runtime.md
   design_docs/dynamo_flow.md
70

71
72
   backends/vllm/deepseek-r1.md
   backends/vllm/gpt-oss.md
73
   backends/vllm/LMCache_Integration.md
74
   backends/vllm/multi-node.md
75
   backends/vllm/prometheus.md
76
   backends/vllm/speculative_decoding.md
77

78
79
   benchmarks/kv-router-ab-testing.md

80
81
   frontends/kserve.md
   _sections/frontends.rst
82

83
84
..   TODO: architecture/distributed_runtime.md and architecture/dynamo_flow.md
     have some outdated names/references and need a refresh.
85
86
..   TODO: Add an OpenAI frontend doc and then add top-level Frontends section
     to index.rst pointing to both OpenAI HTTP and KServe GRPC docs.