update docs

55f4b512 · helloyongyang · abf36593 · 55f4b512 · 55f4b512 · 55f4b512
Commit 55f4b512 authored Jul 09, 2025 by helloyongyang
7 changed files
--- a/docs/EN/source/deploy_guides/deploy_service.md
+++ b/docs/EN/source/deploy_guides/deploy_service.md
+# Service Deployment
+lightx2v provides asynchronous service functionality. The code entry point is [here](https://github.com/ModelTC/lightx2v/blob/main/lightx2v/api_server.py)
+### Start the Service
+```shell
+# Modify the paths in the script
+bash scripts/start_server.sh
+```
+The `--port 8000` option means the service will bind to port `8000` on the local machine. You can change this as needed.
+### Client Sends Request
+```shell
+python scripts/post.py
+```
+The service endpoint is: `/v1/tasks/`
+The `message` parameter in `scripts/post.py` is as follows:
+```python
+message = {
+    "prompt": "Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage.",
+    "negative_prompt": "镜头晃动，色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走",
+    "image_path": "",
+}
+```
+1. `prompt`, `negative_prompt`, and `image_path` are basic inputs for video generation. `image_path` can be an empty string, indicating no image input is needed.
+### Client Checks Server Status
+```shell
+python scripts/check_status.py
+```
+The service endpoints include:
+1. `/v1/service/status` is used to check the status of the service. It returns whether the service is `busy` or `idle`. The service only accepts new requests when it is `idle`.
+2. `/v1/tasks/` is used to get all tasks received and completed by the server.
+3. `/v1/tasks/{task_id}/status` is used to get the status of a specified `task_id`. It returns whether the task is `processing` or `completed`.
+### Client Stops the Current Task on the Server at Any Time
+```shell
+python scripts/stop_running_task.py
+```
+The service endpoint is: `/v1/tasks/running`
+After terminating the task, the server will not exit but will return to waiting for new requests.
+### Starting Multiple Services on a Single Node
+On a single node, you can start multiple services using `scripts/start_server.sh` (Note that the port numbers under the same IP must be different for each service), or you can start multiple services at once using `scripts/start_multi_servers.sh`:
+```shell
+num_gpus=8 bash scripts/start_multi_servers.sh
+```
+Where `num_gpus` indicates the number of services to start; the services will run on consecutive ports starting from `--start_port`.
+### Scheduling Between Multiple Services
+```shell
+python scripts/post_multi_servers.py
+```
+`post_multi_servers.py` will schedule multiple client requests based on the idle status of the services.
+### API Endpoints Summary
+| Endpoint | Method | Description |
+|----------|--------|-------------|
+| `/v1/tasks/` | POST | Create video generation task |
+| `/v1/tasks/form` | POST | Create video generation task via form |
+| `/v1/tasks/` | GET | Get all task list |
+| `/v1/tasks/{task_id}/status` | GET | Get status of specified task |
+| `/v1/tasks/{task_id}/result` | GET | Get result video file of specified task |
+| `/v1/tasks/running` | DELETE | Stop currently running task |
+| `/v1/files/download/{file_path}` | GET | Download file |
+| `/v1/service/status` | GET | Get service status |
--- a/docs/EN/source/getting_started/quickstart.md
+++ b/docs/EN/source/getting_started/quickstart.md
-# 快速入门
+# Quick Start
-## 准备环境
+## Prepare Environment
-我们推荐使用docker环境，这是lightx2v的[dockerhub](https://hub.docker.com/r/lightx2v/lightx2v/tags)，请选择一个最新日期的tag，比如25061301
+We recommend using a docker environment. Here is the [dockerhub](https://hub.docker.com/r/lightx2v/lightx2v/tags) for lightx2v. Please select the tag with the latest date, for example, 25061301.
 ```shell
 docker pull lightx2v/lightx2v:25061301
-docker run --gpus all -itd --ipc=host --name [容器名] -v [挂载设置]  --entrypoint /bin/bash [镜像id]
+docker run --gpus all -itd --ipc=host --name [container_name] -v [mount_settings]  --entrypoint /bin/bash [image_id]
 ```
-对于中国大陆地区，若拉取镜像的时候，网络不稳定，可以从[渡渡鸟](https://docker.aityp.com/r/docker.io/lightx2v/lightx2v)上拉取
+If you want to set up the environment yourself using conda, you can refer to the following steps:
 ```shell
-docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/lightx2v/lightx2v:25061301
+# clone repo and submodules
-```
-如果你想使用conda自己搭建环境，可以参考如下步骤：
-```shell
-# 下载github代码
 git clone https://github.com/ModelTC/lightx2v.git lightx2v && cd lightx2v
 conda create -n lightx2v python=3.11 && conda activate lightx2v
 pip install -r requirements.txt
-# 单独重新安装transformers，避免pip的冲突检查
+# Install again separately to bypass the version conflict check
-# 混元模型需要在4.45.2版本的transformers下运行，如果不需要跑混元模型，可以忽略
+# The Hunyuan model needs to run under this version of transformers. If you do not need to run the Hunyuan model, you can ignore this step.
 pip install transformers==4.45.2
-# 安装 flash-attention 2
+# install flash-attention 2
 git clone https://github.com/Dao-AILab/flash-attention.git --recursive
 cd flash-attention && python setup.py install
-# 安装 flash-attention 3, 用于 hopper 显卡
+# install flash-attention 3, only if hopper
 cd flash-attention/hopper && python setup.py install
 ```
-## 推理
+## Infer
 ```shell
-# 修改脚本中的路径
+# Modify the path in the script
 bash scripts/run_wan_t2v.sh
 ```
-除了脚本中已有的输入参数，`--config_json`指向的`${lightx2v_path}/configs/wan_t2v.json`中也会存在一些必要的参数，可以根据需要，自行修改。
+In addition to the existing input arguments in the script, there are also some necessary parameters in the `${lightx2v_path}/configs/wan_t2v.json` file specified by `--config_json`. You can modify them as needed.
--- a/docs/EN/source/index.rst
+++ b/docs/EN/source/index.rst
-欢迎了解 Lightx2v!
+Welcome to Lightx2v!
 ==================
 .. figure:: ../../../assets/img_lightx2v.png
@@ -10,39 +10,39 @@
 .. raw:: html
   <p style="text-align:center">
-   <strong>一个轻量级的视频生成推理框架
+   <strong>A Light Video Generation Inference Framework
   </strong>
-文档列表
+Documentation
 -------------
 .. toctree::
   :maxdepth: 1
-   :caption: 快速入门
+   :caption: Quick Start
-   快速入门 <getting_started/quickstart.md>
+   Quick Start <getting_started/quickstart.md>
 .. toctree::
   :maxdepth: 1
-   :caption: 方法教程
+   :caption: Method Tutorials
-   模型量化 <method_tutorials/quantization.md>
+   Model Quantization <method_tutorials/quantization.md>
-   特征缓存 <method_tutorials/cache.md>
+   Feature Caching <method_tutorials/cache.md>
-   注意力机制 <method_tutorials/attention.md>
+   Attention Module <method_tutorials/attention.md>
-   参数卸载 <method_tutorials/offload.md>
+   Offloading <method_tutorials/offload.md>
-   并行推理 <method_tutorials/parallel.md>
+   Parallel Inference <method_tutorials/parallel.md>
 .. toctree::
   :maxdepth: 1
-   :caption: 部署指南
+   :caption: Deployment Guides
-   低延迟场景部署 <deploy_guides/for_low_latency.md>
+   Low Latency Deployment <deploy_guides/for_low_latency.md>
-   低资源场景部署 <deploy_guides/for_low_resource.md>
+   Low Resource Deployment <deploy_guides/for_low_resource.md>
-   服务化部署 <deploy_guides/deploy_server.md>
+   Server Deployment <deploy_guides/deploy_service.md>
-   gradio部署 <deploy_guides/deploy_gradio.md>
+   Gradio Deployment <deploy_guides/deploy_gradio.md>
-   comfyui部署 <deploy_guides/deploy_comfyui.md>
+   ComfyUI Deployment <deploy_guides/deploy_comfyui.md>
-   本地windows电脑部署 <deploy_guides/deploy_local_windows.md>
+   Local Windows Deployment <deploy_guides/deploy_local_windows.md>
 .. Indices and tables

--- a/docs/EN/source/method_tutorials/quantization.md
+++ b/docs/EN/source/method_tutorials/quantization.md
-# 模型量化
+# Model Quantization
-lightx2v支持对`Dit`中的线性层进行量化推理，支持`w8a8-int8`和`w8a8-fp8`的矩阵乘法。
+lightx2v supports quantized inference for linear layers in **Dit**, enabling `w8a8-int8` and `w8a8-fp8` matrix multiplication.
+## Generating Quantized Models
-## 生产量化模型
+### Automatic Quantization
-### 自动量化
+lightx2v supports automatic weight quantization during inference. Refer to the [configuration file](https://github.com/ModelTC/lightx2v/tree/main/configs/quantization/wan_i2v_quant_auto.json).
+**Key configuration**:
+Set `"mm_config": {"mm_type": "W-int8-channel-sym-A-int8-channel-sym-dynamic-Vllm", "weight_auto_quant": true}`.
+- `mm_type`: Specifies the quantized operator
+- `weight_auto_quant: true`: Enables automatic model quantization
-lightx2v支持推理时自动对模型权重进行量化，具体可参考[配置文件](https://github.com/ModelTC/lightx2v/tree/main/configs/quantization/wan_i2v_quant_auto.json)。
+### Offline Quantization
-值得注意的是，需要将配置文件的**mm_config**进行设置：**"mm_config": {"mm_type": "W-int8-channel-sym-A-int8-channel-sym-dynamic-Vllm","weight_auto_quant": true }**， **mm_type**代表希望使用的量化算子，**weight_auto_quant：true**代表自动转量化模型。
+lightx2v also supports direct loading of pre-quantized weights. For offline model quantization, refer to the [documentation](https://github.com/ModelTC/lightx2v/tree/main/tools/convert/readme.md).
+Configure the [quantization file](https://github.com/ModelTC/lightx2v/tree/main/configs/quantization/wan_i2v_quant_offline.json):
+1. Set `dit_quantized_ckpt` to the converted weight path
+2. Set `weight_auto_quant` to `false` in `mm_type`
-### 离线量化
-lightx2v同时支持直接加载量化好的权重进行推理，对模型进行离线量化可参考[文档](https://github.com/ModelTC/lightx2v/tree/main/tools/convert/readme_zh.md)。
+## Quantized Inference
-将转换的权重路径，写到[配置文件](https://github.com/ModelTC/lightx2v/tree/main/configs/quantization/wan_i2v_quant_offline.json)中的`dit_quantized_ckpt`中，同时`mm_type**中的**weight_auto_quant`置为`false`即可。
-## 量化推理
+### Automatic Quantization
-### 自动量化
 ```shell
 bash scripts/run_wan_i2v_quant_auto.sh
 ```
-### 离线量化
+### Offline Quantization
 ```shell
 bash scripts/run_wan_i2v_quant_offline.sh
 ```
-## 启动量化服务
+## Launching Quantization Service
-建议离线转好量化权重之后，`--config_json`指向到离线量化的`json`文件
+After offline quantization, point `--config_json` to the offline quantization JSON file.
-比如，将`scripts/start_server.sh`脚本进行如下改动：
+Example modification in `scripts/start_server.sh`:
 ```shell
 export RUNNING_FLAG=infer
@@ -44,6 +51,6 @@ python -m lightx2v.api_server \
 --port 8000
 ```
-## 高阶量化功能
+## Advanced Quantization Features
-具体可参考量化工具[LLMC的文档](https://github.com/ModelTC/llmc/blob/main/docs/zh_cn/source/backend/lightx2v.md)
+Refer to the quantization tool [LLMC documentation](https://github.com/ModelTC/llmc/blob/main/docs/en/source/backend/lightx2v.md) for details.
--- a/docs/ZH_CN/source/deploy_guides/deploy_server.md
+++ b/docs/ZH_CN/source/deploy_guides/deploy_server.md
-# 如何启动服务
-lightx2v 提供异步服务功能。代码入口点在 [这里](https://github.com/ModelTC/lightx2v/blob/main/lightx2v/api_server.py)
-### 启动服务
-```shell
-# 修改脚本中的路径
-bash scripts/start_server.sh
-```
-`--port 8000` 选项表示服务将绑定到本地机器的 `8000` 端口。您可以根据需要更改此端口。
-### 客户端发送请求
-```shell
-python scripts/post.py
-```
-服务端点：`/v1/tasks/`
-`scripts/post.py` 中的 `message` 参数如下：
-```python
-message = {
-    "prompt": "Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage.",
-    "negative_prompt": "镜头晃动，色调艳丽，过曝，静态，细节模糊不清，字幕，风格，作品，画作，画面，静止，整体发灰，最差质量，低质量，JPEG压缩残留，丑陋的，残缺的，多余的手指，画得不好的手部，画得不好的脸部，畸形的，毁容的，形态畸形的肢体，手指融合，静止不动的画面，杂乱的背景，三条腿，背景人很多，倒着走",
-    "image_path": ""
-}
-```
-1. `prompt`、`negative_prompt` 和 `image_path` 是视频生成的基本输入。`image_path` 可以是空字符串，表示不需要图像输入。
-### 客户端检查服务器状态
-```shell
-python scripts/check_status.py
-```
-服务端点包括：
-1. `/v1/service/status` 用于检查服务状态。返回服务是 `busy` 还是 `idle`。服务只有在 `idle` 时才接受新请求。
-2. `/v1/tasks/` 用于获取服务器接收和完成的所有任务。
-3. `/v1/tasks/{task_id}/status` 用于获取指定 `task_id` 的任务状态。返回任务是 `processing` 还是 `completed`。
-### 客户端随时停止服务器上的当前任务
-```shell
-python scripts/stop_running_task.py
-```
-服务端点：`/v1/tasks/running`
-终止任务后，服务器不会退出，而是返回等待新请求的状态。
-### 在单个节点上启动多个服务
-在单个节点上，您可以使用 `scripts/start_server.sh` 启动多个服务（注意同一 IP 下的端口号必须不同），或者可以使用 `scripts/start_multi_servers.sh` 同时启动多个服务：
-```shell
-num_gpus=8 bash scripts/start_multi_servers.sh
-```
-其中 `num_gpus` 表示要启动的服务数量；服务将从 `--start_port` 开始在连续端口上运行。
-### 多个服务之间的调度
-```shell
-python scripts/post_multi_servers.py
-```
-`post_multi_servers.py` 将根据服务的空闲状态调度多个客户端请求。
-### API 端点总结
-| 端点 | 方法 | 描述 |
-|------|------|------|
-| `/v1/tasks/` | POST | 创建视频生成任务 |
-| `/v1/tasks/form` | POST | 通过表单创建视频生成任务 |
-| `/v1/tasks/` | GET | 获取所有任务列表 |
-| `/v1/tasks/{task_id}/status` | GET | 获取指定任务状态 |
-| `/v1/tasks/{task_id}/result` | GET | 获取指定任务的结果视频文件 |
-| `/v1/tasks/running` | DELETE | 停止当前运行的任务 |
-| `/v1/files/download/{file_path}` | GET | 下载文件 |
-| `/v1/service/status` | GET | 获取服务状态 |
--- a/docs/EN/source/deploy_guides/deploy_server.md
+++ b/docs/EN/source/deploy_guides/deploy_server.md
-# 如何启动服务
+# 服务化部署
 lightx2v 提供异步服务功能。代码入口点在 [这里](https://github.com/ModelTC/lightx2v/blob/main/lightx2v/api_server.py)

--- a/docs/ZH_CN/source/index.rst
+++ b/docs/ZH_CN/source/index.rst
@@ -39,7 +39,7 @@
   低延迟场景部署 <deploy_guides/for_low_latency.md>
   低资源场景部署 <deploy_guides/for_low_resource.md>
-   服务化部署 <deploy_guides/deploy_server.md>
+   服务化部署 <deploy_guides/deploy_service.md>
   gradio部署 <deploy_guides/deploy_gradio.md>
   comfyui部署 <deploy_guides/deploy_comfyui.md>
   本地windows电脑部署 <deploy_guides/deploy_local_windows.md>