Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
xuwx1
LightX2V
Commits
6de0a3b4
Commit
6de0a3b4
authored
Sep 02, 2025
by
Yang Yong(雍洋)
Committed by
GitHub
Sep 02, 2025
Browse files
Add audio input files and update pre-commit config for larger files (#283)
parent
8de61521
Changes
24
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
15 additions
and
15 deletions
+15
-15
scripts/seko_talk/run_seko_talk_10_fp8_dist_fixed_min_area.sh
...pts/seko_talk/run_seko_talk_10_fp8_dist_fixed_min_area.sh
+3
-3
scripts/seko_talk/run_seko_talk_11_fp8_dist_fixed_shape.sh
scripts/seko_talk/run_seko_talk_11_fp8_dist_fixed_shape.sh
+3
-3
scripts/seko_talk/run_seko_talk_12_fp8_dist_fixed_shape_8gpus_1s.sh
...ko_talk/run_seko_talk_12_fp8_dist_fixed_shape_8gpus_1s.sh
+3
-3
test_cases/run_seko_talk_01_base.sh
test_cases/run_seko_talk_01_base.sh
+6
-6
No files found.
scripts/seko_talk/run_seko_talk_10_fp8_dist_fixed_min_area.sh
View file @
6de0a3b4
...
...
@@ -18,8 +18,8 @@ torchrun --nproc-per-node 4 -m lightx2v.infer \
--task
i2v
\
--model_path
$model_path
\
--config_json
${
lightx2v_path
}
/configs/seko_talk/seko_talk_10_fp8_dist_fixed_min_area.json
\
--prompt
"The video features a
old lady is saying something and knitting a sweater
."
\
--prompt
"The video features a
male speaking to the camera with arms spread out, a slightly furrowed brow, and a focused gaze
."
\
--negative_prompt
色调艳丽,过曝,静态,细节模糊不清,字幕,风格,作品,画作,画面,静止,整体发灰,最差质量,低质量,JPEG压缩残留,丑陋的,残缺的,多余的手指,画得不好的手部,画得不好的脸部,畸形的,毁容的,形态畸形的肢体,手指融合,静止不动的画面,杂乱的背景,三条腿,背景人很多,倒着走
\
--image_path
${
lightx2v_path
}
/assets/inputs/audio/
15
.png
\
--audio_path
${
lightx2v_path
}
/assets/inputs/audio/
15
.wav
\
--image_path
${
lightx2v_path
}
/assets/inputs/audio/
seko_input
.png
\
--audio_path
${
lightx2v_path
}
/assets/inputs/audio/
seko_input
.wav
\
--save_video_path
${
lightx2v_path
}
/save_results/output_lightx2v_seko_talk.mp4
scripts/seko_talk/run_seko_talk_11_fp8_dist_fixed_shape.sh
View file @
6de0a3b4
...
...
@@ -18,8 +18,8 @@ torchrun --nproc-per-node 4 -m lightx2v.infer \
--task
i2v
\
--model_path
$model_path
\
--config_json
${
lightx2v_path
}
/configs/seko_talk/seko_talk_11_fp8_dist_fixed_shape.json
\
--prompt
"The video features a
old lady is saying something and knitting a sweater
."
\
--prompt
"The video features a
male speaking to the camera with arms spread out, a slightly furrowed brow, and a focused gaze
."
\
--negative_prompt
色调艳丽,过曝,静态,细节模糊不清,字幕,风格,作品,画作,画面,静止,整体发灰,最差质量,低质量,JPEG压缩残留,丑陋的,残缺的,多余的手指,画得不好的手部,画得不好的脸部,畸形的,毁容的,形态畸形的肢体,手指融合,静止不动的画面,杂乱的背景,三条腿,背景人很多,倒着走
\
--image_path
${
lightx2v_path
}
/assets/inputs/audio/
15
.png
\
--audio_path
${
lightx2v_path
}
/assets/inputs/audio/
15
.wav
\
--image_path
${
lightx2v_path
}
/assets/inputs/audio/
seko_input
.png
\
--audio_path
${
lightx2v_path
}
/assets/inputs/audio/
seko_input
.wav
\
--save_video_path
${
lightx2v_path
}
/save_results/output_lightx2v_seko_talk.mp4
scripts/seko_talk/run_seko_talk_12_fp8_dist_fixed_shape_8gpus_1s.sh
View file @
6de0a3b4
...
...
@@ -18,8 +18,8 @@ torchrun --nproc-per-node 8 -m lightx2v.infer \
--task
i2v
\
--model_path
$model_path
\
--config_json
${
lightx2v_path
}
/configs/seko_talk/seko_talk_12_fp8_dist_fixed_shape_8gpus_1s.json
\
--prompt
"The video features a
old lady is saying something and knitting a sweater
."
\
--prompt
"The video features a
male speaking to the camera with arms spread out, a slightly furrowed brow, and a focused gaze
."
\
--negative_prompt
色调艳丽,过曝,静态,细节模糊不清,字幕,风格,作品,画作,画面,静止,整体发灰,最差质量,低质量,JPEG压缩残留,丑陋的,残缺的,多余的手指,画得不好的手部,画得不好的脸部,畸形的,毁容的,形态畸形的肢体,手指融合,静止不动的画面,杂乱的背景,三条腿,背景人很多,倒着走
\
--image_path
${
lightx2v_path
}
/assets/inputs/audio/
15
.png
\
--audio_path
${
lightx2v_path
}
/assets/inputs/audio/
15
.wav
\
--image_path
${
lightx2v_path
}
/assets/inputs/audio/
seko_input
.png
\
--audio_path
${
lightx2v_path
}
/assets/inputs/audio/
seko_input
.wav
\
--save_video_path
${
lightx2v_path
}
/save_results/output_lightx2v_seko_talk.mp4
test_cases/run_seko_talk_01_base.sh
View file @
6de0a3b4
#!/bin/bash
lightx2v_path
=
model_path
=
lightx2v_path
=
/path/to/Lightx2v
model_path
=
/path/to/SekoTalk-Distill
export
CUDA_VISIBLE_DEVICES
=
0
...
...
@@ -17,9 +17,9 @@ python -m lightx2v.infer \
--model_cls
seko_talk
\
--task
i2v
\
--model_path
$model_path
\
--config_json
${
lightx2v_path
}
/configs/seko_talk/
wan_i2v_audio
.json
\
--prompt
"The video features a
old lady is saying something and knitting a sweater
."
\
--config_json
${
lightx2v_path
}
/configs/seko_talk/
seko_talk_01_base
.json
\
--prompt
"The video features a
male speaking to the camera with arms spread out, a slightly furrowed brow, and a focused gaze
."
\
--negative_prompt
色调艳丽,过曝,静态,细节模糊不清,字幕,风格,作品,画作,画面,静止,整体发灰,最差质量,低质量,JPEG压缩残留,丑陋的,残缺的,多余的手指,画得不好的手部,画得不好的脸部,畸形的,毁容的,形态畸形的肢体,手指融合,静止不动的画面,杂乱的背景,三条腿,背景人很多,倒着走
\
--image_path
${
lightx2v_path
}
/assets/inputs/audio/
15
.png
\
--audio_path
${
lightx2v_path
}
/assets/inputs/audio/
15
.wav
\
--image_path
${
lightx2v_path
}
/assets/inputs/audio/
seko_input
.png
\
--audio_path
${
lightx2v_path
}
/assets/inputs/audio/
seko_input
.wav
\
--save_video_path
${
lightx2v_path
}
/save_results/output_lightx2v_seko_talk.mp4
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment