index.rst 1.42 KB
Newer Older
zhouxiang's avatar
zhouxiang committed
1
欢迎来到 LMDeploy 的中文教程!
RunningLeon's avatar
RunningLeon committed
2
3
4
====================================


zhouxiang's avatar
zhouxiang committed
5
.. _快速上手:
RunningLeon's avatar
RunningLeon committed
6
7
.. toctree::
   :maxdepth: 2
zhouxiang's avatar
zhouxiang committed
8
9
10
11
12
13
14
15
   :caption: 快速上手

   get_started.md

.. _编译和安装:
.. toctree::
   :maxdepth: 1
   :caption: 编译和安装
RunningLeon's avatar
RunningLeon committed
16
17
18

   build.md

zhouxiang's avatar
zhouxiang committed
19
.. _测试基准:
RunningLeon's avatar
RunningLeon committed
20
.. toctree::
zhouxiang's avatar
zhouxiang committed
21
22
   :maxdepth: 1
   :caption: 测试基准
RunningLeon's avatar
RunningLeon committed
23

zhouxiang's avatar
zhouxiang committed
24
25
26
27
28
   benchmark/profile_generation.md
   benchmark/profile_throughput.md
   benchmark/profile_api_server.md
   benchmark/profile_triton_server.md
   benchmark/evaluate_with_opencompass.md
RunningLeon's avatar
RunningLeon committed
29

zhouxiang's avatar
zhouxiang committed
30
.. _支持的模型:
RunningLeon's avatar
RunningLeon committed
31
.. toctree::
zhouxiang's avatar
zhouxiang committed
32
33
   :maxdepth: 1
   :caption: 模型列表
RunningLeon's avatar
RunningLeon committed
34

zhouxiang's avatar
zhouxiang committed
35
   supported_models/supported_models.md
RunningLeon's avatar
RunningLeon committed
36

zhouxiang's avatar
zhouxiang committed
37
.. _推理:
RunningLeon's avatar
RunningLeon committed
38
.. toctree::
zhouxiang's avatar
zhouxiang committed
39
40
41
42
43
44
45
46
47
48
   :maxdepth: 1
   :caption: 推理

   inference/pipeline.md
   inference/vl_pipeline.md


.. _服务:
.. toctree::
   :maxdepth: 1
RunningLeon's avatar
RunningLeon committed
49
50
   :caption: 服务

zhouxiang's avatar
zhouxiang committed
51
52
53
54
55
   serving/api_server.md
   serving/api_server_vl.md
   serving/gradio.md
   serving/proxy_server.md

RunningLeon's avatar
RunningLeon committed
56

zhouxiang's avatar
zhouxiang committed
57
.. _量化:
RunningLeon's avatar
RunningLeon committed
58
.. toctree::
zhouxiang's avatar
zhouxiang committed
59
60
   :maxdepth: 1
   :caption: 量化
RunningLeon's avatar
RunningLeon committed
61

zhouxiang's avatar
zhouxiang committed
62
63
64
   quantization/w4a16.md
   quantization/kv_int8.md
   quantization/w8a8.md
RunningLeon's avatar
RunningLeon committed
65
66

.. toctree::
zhouxiang's avatar
zhouxiang committed
67
68
   :maxdepth: 1
   :caption: 进阶指南
RunningLeon's avatar
RunningLeon committed
69

zhouxiang's avatar
zhouxiang committed
70
71
72
73
74
75
76
77
78
79
80
   inference/turbomind.md
   inference/pytorch.md
   advance/pytorch_new_model.md
   advance/long_context.md
   advance/chat_template.md
   advance/debug_turbomind.md
   serving/qos.md

.. toctree::
   :maxdepth: 1
   :caption: API 文档
RunningLeon's avatar
RunningLeon committed
81

zhouxiang's avatar
zhouxiang committed
82
   api/pipeline.rst
RunningLeon's avatar
RunningLeon committed
83

zhouxiang's avatar
zhouxiang committed
84
索引与表格
RunningLeon's avatar
RunningLeon committed
85
86
87
88
==================

* :ref:`genindex`
* :ref:`search`