@@ -38,4 +38,8 @@ RUN git clone https://github.com/ModelTC/SageAttention.git
...
@@ -38,4 +38,8 @@ RUN git clone https://github.com/ModelTC/SageAttention.git
RUN cd SageAttention &&CUDA_ARCHITECTURES="8.0,8.6,8.9,9.0,12.0"EXT_PARALLEL=4 NVCC_APPEND_FLAGS="--threads 8"MAX_JOBS=32 pip install--no-cache-dir-v-e .
RUN cd SageAttention &&CUDA_ARCHITECTURES="8.0,8.6,8.9,9.0,12.0"EXT_PARALLEL=4 NVCC_APPEND_FLAGS="--threads 8"MAX_JOBS=32 pip install--no-cache-dir-v-e .
RUN git clone https://github.com/KONAKONA666/q8_kernels.git
RUN cd q8_kernels && git submodule init && git submodule update && python setup.py install&&rm-rf build
@@ -27,16 +27,16 @@ We strongly recommend using the Docker environment, which is the simplest and fa
...
@@ -27,16 +27,16 @@ We strongly recommend using the Docker environment, which is the simplest and fa
#### 1. Pull Image
#### 1. Pull Image
Visit LightX2V's [Docker Hub](https://hub.docker.com/r/lightx2v/lightx2v/tags), select a tag with the latest date, such as `25082901-cu128`:
Visit LightX2V's [Docker Hub](https://hub.docker.com/r/lightx2v/lightx2v/tags), select a tag with the latest date, such as `25090503-cu128`:
```bash
```bash
docker pull lightx2v/lightx2v:25082901-cu128
docker pull lightx2v/lightx2v:25090503-cu128
```
```
We recommend using the `cuda128` environment for faster inference speed. If you need to use the `cuda124` environment, you can use image versions with the `-cu124` suffix:
We recommend using the `cuda128` environment for faster inference speed. If you need to use the `cuda124` environment, you can use image versions with the `-cu124` suffix:
```bash
```bash
docker pull lightx2v/lightx2v:25082901-cu124
docker pull lightx2v/lightx2v:25090503-cu124
```
```
#### 2. Run Container
#### 2. Run Container
...
@@ -51,10 +51,10 @@ For mainland China, if the network is unstable when pulling images, you can pull
...
@@ -51,10 +51,10 @@ For mainland China, if the network is unstable when pulling images, you can pull