"components/backends/trtllm/vscode:/vscode.git/clone" did not exist on "5f179186fadf82c87f0eab0485350dfdc2e14796"
hidden_toctree.rst 2.92 KB
Newer Older
1
2
:orphan:

3
..
4
    SPDX-FileCopyrightText: Copyright (c) 2024-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
5
6
7
8
9
10
11
12
13
    SPDX-License-Identifier: Apache-2.0

.. This hidden toctree includes readmes etc that aren't meant to be in the main table of contents but should be accounted for in the sphinx project structure


.. toctree::
   :maxdepth: 2
   :hidden:

14
15
16
17
18
19
20
21
22
23
24
25
   development/runtime-guide.md
   api/nixl_connect/connector.md
   api/nixl_connect/descriptor.md
   api/nixl_connect/device.md
   api/nixl_connect/device_kind.md
   api/nixl_connect/operation_status.md
   api/nixl_connect/rdma_metadata.md
   api/nixl_connect/readable_operation.md
   api/nixl_connect/writable_operation.md
   api/nixl_connect/read_operation.md
   api/nixl_connect/write_operation.md
   api/nixl_connect/README.md
26

27
   kubernetes/api_reference.md
28
   kubernetes/deployment/create_deployment.md
29
   kubernetes/deployment/dynamomodel-guide.md
30
31
32
   kubernetes/chrek/README.md
   kubernetes/chrek/dynamo.md
   kubernetes/chrek/standalone.md
33
34
35
36
37

   kubernetes/fluxcd.md
   kubernetes/grove.md
   kubernetes/model_caching_with_fluid.md
   kubernetes/README.md
38
39
40
41
   reference/cli.md
   observability/metrics.md
   kvbm/vllm-setup.md
   kvbm/trtllm-setup.md
42
   agents/tool-calling.md
43
   development/jail_stream.md
44

45
   router/kv_cache_routing.md
Yan Ru Pei's avatar
Yan Ru Pei committed
46
   router/kv_events.md
47
   planner/load_planner.md
48
   fault_tolerance/README.md
49
50
   fault_tolerance/request_migration.md
   fault_tolerance/request_cancellation.md
51
52
53
54
55
   fault_tolerance/graceful_shutdown.md
   fault_tolerance/request_rejection.md
   fault_tolerance/testing.md
   design_docs/request_plane.md
   design_docs/event_plane.md
56

57
58
59
60
61
   backends/trtllm/multinode/multinode-examples.md
   backends/trtllm/llama4_plus_eagle.md
   backends/trtllm/kv-cache-transfer.md
   backends/trtllm/gemma3_sliding_window_attention.md
   backends/trtllm/gpt-oss.md
62
   backends/trtllm/prometheus.md
63
64
65

   backends/sglang/expert-distribution-eplb.md
   backends/sglang/gpt-oss.md
66
   backends/sglang/diffusion-lm.md
67
   backends/sglang/profiling.md
68
   backends/sglang/sgl-hicache-example.md
69
   backends/sglang/sglang-disaggregation.md
70
   backends/sglang/prometheus.md
71
72

   examples/README.md
73
   examples/runtime/hello_world/README.md
74

75
76
   design_docs/distributed_runtime.md
   design_docs/dynamo_flow.md
77

78
79
   backends/vllm/deepseek-r1.md
   backends/vllm/gpt-oss.md
80
   backends/vllm/LMCache_Integration.md
81
   backends/vllm/multi-node.md
82
   backends/vllm/prometheus.md
83
   backends/vllm/prompt-embeddings.md
84
   backends/vllm/speculative_decoding.md
85

86
87
88
   features/speculative_decoding/README.md
   features/speculative_decoding/speculative_decoding_vllm.md

89
90
   benchmarks/kv-router-ab-testing.md

Yan Ru Pei's avatar
Yan Ru Pei committed
91
92
   mocker/mocker.md

93
94
   frontends/kserve.md
   _sections/frontends.rst
95

96
97
..   TODO: architecture/distributed_runtime.md and architecture/dynamo_flow.md
     have some outdated names/references and need a refresh.
98
99
..   TODO: Add an OpenAI frontend doc to complement the KServe GRPC doc
     in the Frontends section.