index.rst 1.41 KB
Newer Older
zhouxiang's avatar
zhouxiang committed
1
Welcome to LMDeploy's tutorials!
RunningLeon's avatar
RunningLeon committed
2
3
====================================

zhouxiang's avatar
zhouxiang committed
4
.. _get_started:
RunningLeon's avatar
RunningLeon committed
5
6
.. toctree::
   :maxdepth: 2
zhouxiang's avatar
zhouxiang committed
7
8
9
10
11
12
13
   :caption: Get Started

   get_started.md

.. _build:
.. toctree::
   :maxdepth: 1
RunningLeon's avatar
RunningLeon committed
14
15
16
17
   :caption: Build

   build.md

zhouxiang's avatar
zhouxiang committed
18
.. _benchmark:
RunningLeon's avatar
RunningLeon committed
19
.. toctree::
zhouxiang's avatar
zhouxiang committed
20
21
   :maxdepth: 1
   :caption: Benchmark
RunningLeon's avatar
RunningLeon committed
22

zhouxiang's avatar
zhouxiang committed
23
24
25
26
27
   benchmark/profile_generation.md
   benchmark/profile_throughput.md
   benchmark/profile_api_server.md
   benchmark/profile_triton_server.md
   benchmark/evaluate_with_opencompass.md
RunningLeon's avatar
RunningLeon committed
28

zhouxiang's avatar
zhouxiang committed
29
.. _supported_models:
RunningLeon's avatar
RunningLeon committed
30
.. toctree::
zhouxiang's avatar
zhouxiang committed
31
32
   :maxdepth: 1
   :caption: Supported Models
RunningLeon's avatar
RunningLeon committed
33

zhouxiang's avatar
zhouxiang committed
34
   supported_models/supported_models.md
RunningLeon's avatar
RunningLeon committed
35

zhouxiang's avatar
zhouxiang committed
36
.. _inference:
RunningLeon's avatar
RunningLeon committed
37
.. toctree::
zhouxiang's avatar
zhouxiang committed
38
39
   :maxdepth: 1
   :caption: Inference
RunningLeon's avatar
RunningLeon committed
40

zhouxiang's avatar
zhouxiang committed
41
42
   inference/pipeline.md
   inference/vl_pipeline.md
RunningLeon's avatar
RunningLeon committed
43

zhouxiang's avatar
zhouxiang committed
44
45

.. _serving:
RunningLeon's avatar
RunningLeon committed
46
.. toctree::
zhouxiang's avatar
zhouxiang committed
47
48
49
50
51
52
53
54
55
56
57
58
   :maxdepth: 1
   :caption: serving

   serving/api_server.md
   serving/api_server_vl.md
   serving/gradio.md
   serving/proxy_server.md

.. _quantization:
.. toctree::
   :maxdepth: 1
   :caption: Quantization
RunningLeon's avatar
RunningLeon committed
59

zhouxiang's avatar
zhouxiang committed
60
61
62
   quantization/w4a16.md
   quantization/kv_int8.md
   quantization/w8a8.md
RunningLeon's avatar
RunningLeon committed
63
64

.. toctree::
zhouxiang's avatar
zhouxiang committed
65
66
   :maxdepth: 1
   :caption: Advanced Guide
RunningLeon's avatar
RunningLeon committed
67

zhouxiang's avatar
zhouxiang committed
68
69
70
71
72
73
74
75
76
77
78
   inference/turbomind.md
   inference/pytorch.md
   advance/pytorch_new_model.md
   advance/long_context.md
   advance/chat_template.md
   advance/debug_turbomind.md
   serving/qos.md

.. toctree::
   :maxdepth: 1
   :caption: API Reference
RunningLeon's avatar
RunningLeon committed
79

zhouxiang's avatar
zhouxiang committed
80
   api/pipeline.rst
RunningLeon's avatar
RunningLeon committed
81
82
83
84
85
86

Indices and tables
==================

* :ref:`genindex`
* :ref:`search`