### 复现指南🔥🔥🔥 ```shell # 1. 环境准备 docker pull image.sourcefind.cn:5000/dcu/admin/base/vllm:0.9.2-ubuntu22.04-dtk25.04.1-rc5-rocblas101839-0811-das1.6-py3.10-20250908-rc1 # 2. 创建容器 docker run -it \ --network=host \ --hostname=localhost \ --name=HUNYUAN \ -v /opt/hyhal:/opt/hyhal:ro \ -v $PWD:/workspace \ --ipc=host \ --device=/dev/kfd \ --device=/dev/mkfd \ --device=/dev/dri \ --shm-size=512G \ --privileged \ --group-add video \ --cap-add=SYS_PTRACE \ --security-opt seccomp=unconfined \ image.sourcefind.cn:5000/dcu/admin/base/vllm:0.9.2-ubuntu22.04-dtk25.04.1-rc5-rocblas101839-0811-das1.6-py3.10-20250908-rc1 \ /bin/bash # 3. 拉取代码 git clone http://developer.sourcefind.cn/codes/bw_bestperf/hunyuan-dit.git # 4. 获取&安装依赖 Apex: curl -f -C - -o apex-1.5.0+das.opt1.dtk25041-cp310-cp310-linux_x86_64.whl https://ksefile.hpccube.com:65241/efile/s/d/amVycnJycnk=/e759f4e7fbb64b10 Lightop curl -f -C - -o lightop-0.5.0+das.dtk25041.unknown-cp310-cp310-linux_x86_64.whl https://ksefile.hpccube.com:65241/efile/s/d/amVycnJycnk=/3ca9654a8fc1b0b5 Deepspeed wget https://download.sourcefind.cn:65024/directlink/4/deepspeed/DAS1.6/deepspeed-0.14.2+das.opt1.dtk25041-cp310-cp310-manylinux_2_28_x86_64.whl pip install apex-1.5.0+das.opt1.dtk25041-cp310-cp310-linux_x86_64.whl pip install lightop-0.5.0+das.dtk25041.unknown-cp310-cp310-linux_x86_64.whl pip install deepspeed-0.14.2+das.opt1.dtk25041-cp310-cp310-manylinux_2_28_x86_64.whl pip install -r requirements.txt -i https://mirrors.tuna.tsinghua.edu.cn/pypi/web/simple # 5. 下载优化包 hipblaslt curl -f -C - -o hipblaslt-install0925.tar.gz https://ksefile.hpccube.com:65241/efile/s/d/amVycnJycnk=/5857030947151012 miopen curl -f -C - -o package_0915_ubuntu.tar.gz https://ksefile.hpccube.com:65241/efile/s/d/amVycnJycnk=/0c80d0e60b9af80d # 6. 下载模型 模型详见:https://modelscope.cn/models/dengcao/HunyuanDiT-v1.2 pip install modelscope modelscope download --model dengcao/HunyuanDiT-v1.2 --local_dir ./HunyuanDiT-v1.2 还需要下载vae,tokenizer和tex_encoder cd HunyuanDiT-v1.2 wget https://dit.hunyuan.tencent.com/download/HunyuanDiT/tokenizer.zip wget https://dit.hunyuan.tencent.com/download/HunyuanDiT/sdxl-vae-fp16-fix.zip wget https://dit.hunyuan.tencent.com/download/HunyuanDiT/clip_text_encoder.zip 下载完模型结构如下 ```

## 测试指令 ``` export LD_LIBRARY_PATH=/workspace/OEM_ADVTG_TEST/hunyuan/hipblaslt-install/lib/:$LD_LIBRARY_PATH export LD_LIBRARY_PATH=/workspace/OEM_ADVTG_TEST/hunyuan/package/miopen/lib/:$LD_LIBRARY_PATH python sample_t2i_dcu.py --model-root /workspace/OEM_ADVTG_TEST/hunyuan/HunyuanDiT-v1.2/ --batch-size 4 --infer-mode fa --prompt "青花瓷风格,一只可爱的哈士奇" --no-enhance --load-key module --image-size 1024 1024 --infer-steps 20 ```