"vscode:/vscode.git/clone" did not exist on "598cbbb73b3e260b0c5439957822f4cbf1ab1079"
hidden_toctree.rst 2.52 KB
Newer Older
1
2
:orphan:

3
..
4
    SPDX-FileCopyrightText: Copyright (c) 2024-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
5
6
7
8
9
10
11
12
13
    SPDX-License-Identifier: Apache-2.0

.. This hidden toctree includes readmes etc that aren't meant to be in the main table of contents but should be accounted for in the sphinx project structure


.. toctree::
   :maxdepth: 2
   :hidden:

14
15
16
17
18
19
20
21
22
23
24
25
   development/runtime-guide.md
   api/nixl_connect/connector.md
   api/nixl_connect/descriptor.md
   api/nixl_connect/device.md
   api/nixl_connect/device_kind.md
   api/nixl_connect/operation_status.md
   api/nixl_connect/rdma_metadata.md
   api/nixl_connect/readable_operation.md
   api/nixl_connect/writable_operation.md
   api/nixl_connect/read_operation.md
   api/nixl_connect/write_operation.md
   api/nixl_connect/README.md
26

27
   kubernetes/api_reference.md
28
   kubernetes/deployment/create_deployment.md
29
   kubernetes/deployment/dynamomodel-guide.md
30
31
32
33
34

   kubernetes/fluxcd.md
   kubernetes/grove.md
   kubernetes/model_caching_with_fluid.md
   kubernetes/README.md
35
36
37
38
   reference/cli.md
   observability/metrics.md
   kvbm/vllm-setup.md
   kvbm/trtllm-setup.md
39
   agents/tool-calling.md
40
   guides/jail_stream_readme.md
41
   guides/request_plane.md
42

43
   router/kv_cache_routing.md
44
   planner/load_planner.md
45
46
   fault_tolerance/request_migration.md
   fault_tolerance/request_cancellation.md
47

48
49
50
51
52
   backends/trtllm/multinode/multinode-examples.md
   backends/trtllm/llama4_plus_eagle.md
   backends/trtllm/kv-cache-transfer.md
   backends/trtllm/gemma3_sliding_window_attention.md
   backends/trtllm/gpt-oss.md
53
   backends/trtllm/prometheus.md
54
55
56

   backends/sglang/expert-distribution-eplb.md
   backends/sglang/gpt-oss.md
57
   backends/sglang/profiling.md
58
   backends/sglang/sgl-hicache-example.md
59
   backends/sglang/sglang-disaggregation.md
60
   backends/sglang/prometheus.md
61
62

   examples/README.md
63
   examples/runtime/hello_world/README.md
64

65
66
   design_docs/distributed_runtime.md
   design_docs/dynamo_flow.md
67

68
69
   backends/vllm/deepseek-r1.md
   backends/vllm/gpt-oss.md
70
   backends/vllm/LMCache_Integration.md
71
   backends/vllm/multi-node.md
72
   backends/vllm/prometheus.md
73
   backends/vllm/prompt-embeddings.md
74
   backends/vllm/speculative_decoding.md
75

76
77
   benchmarks/kv-router-ab-testing.md

78
79
   frontends/kserve.md
   _sections/frontends.rst
80

81
82
..   TODO: architecture/distributed_runtime.md and architecture/dynamo_flow.md
     have some outdated names/references and need a refresh.
83
84
..   TODO: Add an OpenAI frontend doc and then add top-level Frontends section
     to index.rst pointing to both OpenAI HTTP and KServe GRPC docs.