[Doc] Add API reference for offline inference (#4710)

4bfa7e7f · Cyrus Leung · GitHub · ac1fbf7f · 4bfa7e7f · 4bfa7e7f
Unverified Commit 4bfa7e7f authored May 14, 2024 by Cyrus Leung Committed by GitHub May 13, 2024
4 changed files
--- a/docs/source/index.rst
+++ b/docs/source/index.rst
@@ -67,6 +67,13 @@ Documentation
   getting_started/quickstart
   getting_started/examples/examples_index
+.. toctree::
+   :maxdepth: 1
+   :caption: Offline Inference
+   offline_inference/llm
+   offline_inference/sampling_params
 .. toctree::
   :maxdepth: 1
   :caption: Serving
@@ -101,7 +108,6 @@ Documentation
   :maxdepth: 2
   :caption: Developer Documentation
-   dev/sampling_params
   dev/engine/engine_index
   dev/kernel/paged_attention
   dev/dockerfile/dockerfile

--- a/docs/source/offline_inference/llm.rst
+++ b/docs/source/offline_inference/llm.rst
+LLM Class
+==========
+.. autoclass:: vllm.LLM
+    :members:
+    :show-inheritance:
--- a/docs/source/dev/sampling_params.rst
+++ b/docs/source/dev/sampling_params.rst
-Sampling Params
+Sampling Parameters
-===============
+===================
 .. autoclass:: vllm.SamplingParams
    :members:
--- a/docs/source/serving/openai_compatible_server.md
+++ b/docs/source/serving/openai_compatible_server.md
@@ -48,7 +48,7 @@ completion = client.chat.completions.create(
 ```
 ### Extra Parameters for Chat API
-The following [sampling parameters (click through to see documentation)](../dev/sampling_params.rst) are supported.
+The following [sampling parameters (click through to see documentation)](../offline_inference/sampling_params.rst) are supported.
 ```{literalinclude} ../../../vllm/entrypoints/openai/protocol.py
 :language: python
@@ -65,7 +65,7 @@ The following extra parameters are supported:
 ```
 ### Extra Parameters for Completions API
-The following [sampling parameters (click through to see documentation)](../dev/sampling_params.rst) are supported.
+The following [sampling parameters (click through to see documentation)](../offline_inference/sampling_params.rst) are supported.
 ```{literalinclude} ../../../vllm/entrypoints/openai/protocol.py
 :language: python