api_server_v2.log

2026-01-07 09:55:53.359 | INFO     | indextts.infer_vllm_v2:__init__:105 - >> GPT weights restored from: checkpoints/IndexTTS-2-vLLM/gpt.pth
2026-01-07 09:55:53.372 | INFO     | indextts.infer_vllm_v2:__init__:116 - >> Failed to load custom CUDA kernel for BigVGAN. Falling back to torch.
2026-01-07 09:55:56.908 | INFO     | indextts.infer_vllm_v2:__init__:137 - >> semantic_codec weights restored from: checkpoints/IndexTTS-2-vLLM/semantic_codec/model.safetensors
2026-01-07 10:08:12.331 | INFO     | indextts.infer_vllm_v2:__init__:105 - >> GPT weights restored from: checkpoints/IndexTTS-2-vLLM/gpt.pth
2026-01-07 10:08:12.340 | INFO     | indextts.infer_vllm_v2:__init__:116 - >> Failed to load custom CUDA kernel for BigVGAN. Falling back to torch.
2026-01-07 10:08:15.863 | INFO     | indextts.infer_vllm_v2:__init__:137 - >> semantic_codec weights restored from: checkpoints/IndexTTS-2-vLLM/semantic_codec/model.safetensors
2026-01-07 10:08:18.419 | INFO     | indextts.infer_vllm_v2:__init__:152 - >> s2mel weights restored from: checkpoints/IndexTTS-2-vLLM/s2mel.pth
2026-01-07 10:08:18.781 | INFO     | indextts.infer_vllm_v2:__init__:163 - >> campplus_model weights restored from: checkpoints/IndexTTS-2-vLLM/campplus/campplus_cn_common.bin
2026-01-07 10:08:22.673 | INFO     | indextts.infer_vllm_v2:__init__:171 - >> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x
2026-01-07 10:13:42.833 | INFO     | indextts.infer_vllm_v2:__init__:105 - >> GPT weights restored from: checkpoints/IndexTTS-2-vLLM/gpt.pth
2026-01-07 10:13:42.842 | INFO     | indextts.infer_vllm_v2:__init__:116 - >> Failed to load custom CUDA kernel for BigVGAN. Falling back to torch.
2026-01-07 10:13:46.408 | INFO     | indextts.infer_vllm_v2:__init__:137 - >> semantic_codec weights restored from: checkpoints/IndexTTS-2-vLLM/semantic_codec/model.safetensors
2026-01-07 10:13:48.547 | INFO     | indextts.infer_vllm_v2:__init__:152 - >> s2mel weights restored from: checkpoints/IndexTTS-2-vLLM/s2mel.pth
2026-01-07 10:13:48.892 | INFO     | indextts.infer_vllm_v2:__init__:163 - >> campplus_model weights restored from: checkpoints/IndexTTS-2-vLLM/campplus/campplus_cn_common.bin
2026-01-07 10:13:52.163 | INFO     | indextts.infer_vllm_v2:__init__:171 - >> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x
2026-01-07 11:35:01.871 | INFO     | indextts.infer_vllm_v2:__init__:105 - >> GPT weights restored from: checkpoints/IndexTTS-2-vLLM/gpt.pth
2026-01-07 11:35:01.882 | INFO     | indextts.infer_vllm_v2:__init__:116 - >> Failed to load custom CUDA kernel for BigVGAN. Falling back to torch.
2026-01-07 11:35:05.240 | INFO     | indextts.infer_vllm_v2:__init__:137 - >> semantic_codec weights restored from: checkpoints/IndexTTS-2-vLLM/semantic_codec/model.safetensors
2026-01-07 11:35:07.533 | INFO     | indextts.infer_vllm_v2:__init__:152 - >> s2mel weights restored from: checkpoints/IndexTTS-2-vLLM/s2mel.pth
2026-01-07 11:35:07.892 | INFO     | indextts.infer_vllm_v2:__init__:163 - >> campplus_model weights restored from: checkpoints/IndexTTS-2-vLLM/campplus/campplus_cn_common.bin
2026-01-07 11:35:11.961 | INFO     | indextts.infer_vllm_v2:__init__:171 - >> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x
2026-01-07 11:53:45.877 | INFO     | indextts.infer_vllm_v2:__init__:105 - >> GPT weights restored from: checkpoints/IndexTTS-2-vLLM/gpt.pth
2026-01-07 11:53:45.889 | INFO     | indextts.infer_vllm_v2:__init__:116 - >> Failed to load custom CUDA kernel for BigVGAN. Falling back to torch.
2026-01-07 11:53:49.144 | INFO     | indextts.infer_vllm_v2:__init__:137 - >> semantic_codec weights restored from: checkpoints/IndexTTS-2-vLLM/semantic_codec/model.safetensors
2026-01-07 11:53:51.373 | INFO     | indextts.infer_vllm_v2:__init__:152 - >> s2mel weights restored from: checkpoints/IndexTTS-2-vLLM/s2mel.pth
2026-01-07 11:53:51.762 | INFO     | indextts.infer_vllm_v2:__init__:163 - >> campplus_model weights restored from: checkpoints/IndexTTS-2-vLLM/campplus/campplus_cn_common.bin
2026-01-07 11:53:55.438 | INFO     | indextts.infer_vllm_v2:__init__:171 - >> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x
2026-01-07 11:54:32.456 | INFO     | indextts.infer_vllm_v2:__init__:176 - >> TextNormalizer loaded
2026-01-07 11:54:32.484 | INFO     | indextts.infer_vllm_v2:__init__:178 - >> bpe model loaded from: checkpoints/IndexTTS-2-vLLM/bpe.model
2026-01-07 12:26:15.303 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 12:26:33.498 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 12:26:34.437 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [9e538135d39f4ee1b990f7940806b044] [prefill time: 0.9366]
2026-01-07 12:26:46.639 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [9e538135d39f4ee1b990f7940806b044] [decode time: 12.2017] [decode len: 1761]
2026-01-07 12:26:47.435 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 12:26:47.855 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 12:26:47.874 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [1c403009d06c4c3c9811620525f339aa] [prefill time: 0.0177]
2026-01-07 12:26:59.764 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [1c403009d06c4c3c9811620525f339aa] [decode time: 11.8893] [decode len: 1761]
2026-01-07 12:27:00.146 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 12:27:00.777 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 12:27:00.801 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [d8300dd0b6b749b08352ea5468732605] [prefill time: 0.0174]
2026-01-07 12:27:12.761 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [d8300dd0b6b749b08352ea5468732605] [decode time: 11.9596] [decode len: 1761]
2026-01-07 12:27:13.116 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 12:27:14.408 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 12:27:14.432 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [f0d11d0427cd4ac1a83f89d987d137e3] [prefill time: 0.0176]
2026-01-07 12:27:15.221 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [f0d11d0427cd4ac1a83f89d987d137e3] [decode time: 0.7887] [decode len: 135]
2026-01-07 15:39:20.324 | INFO     | indextts.infer_vllm_v2:__init__:105 - >> GPT weights restored from: checkpoints/IndexTTS-2-vLLM/gpt.pth
2026-01-07 15:39:20.332 | INFO     | indextts.infer_vllm_v2:__init__:116 - >> Failed to load custom CUDA kernel for BigVGAN. Falling back to torch.
2026-01-07 15:39:23.893 | INFO     | indextts.infer_vllm_v2:__init__:137 - >> semantic_codec weights restored from: checkpoints/IndexTTS-2-vLLM/semantic_codec/model.safetensors
2026-01-07 15:39:26.238 | INFO     | indextts.infer_vllm_v2:__init__:152 - >> s2mel weights restored from: checkpoints/IndexTTS-2-vLLM/s2mel.pth
2026-01-07 15:39:26.591 | INFO     | indextts.infer_vllm_v2:__init__:163 - >> campplus_model weights restored from: checkpoints/IndexTTS-2-vLLM/campplus/campplus_cn_common.bin
2026-01-07 15:39:30.100 | INFO     | indextts.infer_vllm_v2:__init__:171 - >> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x
2026-01-07 15:39:32.076 | INFO     | indextts.infer_vllm_v2:__init__:176 - >> TextNormalizer loaded
2026-01-07 15:39:32.108 | INFO     | indextts.infer_vllm_v2:__init__:178 - >> bpe model loaded from: checkpoints/IndexTTS-2-vLLM/bpe.model
2026-01-07 15:40:39.322 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 15:40:45.193 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 15:40:46.224 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [2c7b38cf2739452c9ebdab36aa5c4db0] [prefill time: 1.0290]
2026-01-07 15:40:48.352 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [2c7b38cf2739452c9ebdab36aa5c4db0] [decode time: 2.1280] [decode len: 336]
2026-01-07 15:40:48.576 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 15:40:48.999 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 15:40:49.018 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [00454b8a816048a998374d8b6dd2df81] [prefill time: 0.0177]
2026-01-07 15:40:53.017 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [00454b8a816048a998374d8b6dd2df81] [decode time: 3.9981] [decode len: 616]
2026-01-07 15:40:53.215 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 15:40:53.674 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 15:40:53.696 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [eced97d4f81d4023a9406bf25f2af326] [prefill time: 0.0200]
2026-01-07 15:40:56.550 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [eced97d4f81d4023a9406bf25f2af326] [decode time: 2.8534] [decode len: 439]
2026-01-07 15:40:56.712 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 15:40:57.999 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 15:40:58.018 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [36cdd0e5c0fa4b31b80e1733fc2aa77b] [prefill time: 0.0171]
2026-01-07 15:40:58.186 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [36cdd0e5c0fa4b31b80e1733fc2aa77b] [decode time: 0.1675] [decode len: 29]
2026-01-07 17:16:04.466 | INFO     | indextts.infer_vllm_v2:__init__:105 - >> GPT weights restored from: checkpoints/IndexTTS-2-vLLM/gpt.pth
2026-01-07 17:16:04.473 | INFO     | indextts.infer_vllm_v2:__init__:116 - >> Failed to load custom CUDA kernel for BigVGAN. Falling back to torch.
2026-01-07 17:16:07.821 | INFO     | indextts.infer_vllm_v2:__init__:137 - >> semantic_codec weights restored from: checkpoints/IndexTTS-2-vLLM/semantic_codec/model.safetensors
2026-01-07 17:16:09.645 | INFO     | indextts.infer_vllm_v2:__init__:152 - >> s2mel weights restored from: checkpoints/IndexTTS-2-vLLM/s2mel.pth
2026-01-07 17:16:09.974 | INFO     | indextts.infer_vllm_v2:__init__:163 - >> campplus_model weights restored from: checkpoints/IndexTTS-2-vLLM/campplus/campplus_cn_common.bin
2026-01-07 17:16:13.027 | INFO     | indextts.infer_vllm_v2:__init__:171 - >> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x
2026-01-07 17:16:15.973 | INFO     | indextts.infer_vllm_v2:__init__:176 - >> TextNormalizer loaded
2026-01-07 17:16:16.002 | INFO     | indextts.infer_vllm_v2:__init__:178 - >> bpe model loaded from: checkpoints/IndexTTS-2-vLLM/bpe.model
2026-01-07 17:17:34.294 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 17:17:40.179 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 17:17:41.249 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [22f7281eaa15416488635de965f49159] [prefill time: 1.0658]
2026-01-07 17:17:52.958 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [22f7281eaa15416488635de965f49159] [decode time: 11.7088] [decode len: 1761]
2026-01-07 17:18:06.446 | INFO     | indextts.infer_vllm_v2:infer:458 - >> gpt_gen_time: 12.86 seconds
2026-01-07 17:18:06.446 | INFO     | indextts.infer_vllm_v2:infer:459 - >> gpt_forward_time: 0.18 seconds
2026-01-07 17:18:06.447 | INFO     | indextts.infer_vllm_v2:infer:460 - >> s2mel_time: 11.15 seconds
2026-01-07 17:18:06.448 | INFO     | indextts.infer_vllm_v2:infer:461 - >> bigvgan_time: 2.13 seconds
2026-01-07 17:18:06.448 | INFO     | indextts.infer_vllm_v2:infer:462 - >> Total inference time: 32.13 seconds
2026-01-07 17:18:06.449 | INFO     | indextts.infer_vllm_v2:infer:463 - >> Generated audio length: 35.12 seconds
2026-01-07 17:18:06.449 | INFO     | indextts.infer_vllm_v2:infer:464 - >> RTF: 0.9149
2026-01-07 17:18:06.465 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 17:18:06.856 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 17:18:06.875 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [4bee66e70af64239a75fa180dbe00801] [prefill time: 0.0163]
2026-01-07 17:18:18.407 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [4bee66e70af64239a75fa180dbe00801] [decode time: 11.5326] [decode len: 1761]
2026-01-07 17:18:29.940 | INFO     | indextts.infer_vllm_v2:infer:458 - >> gpt_gen_time: 11.62 seconds
2026-01-07 17:18:29.941 | INFO     | indextts.infer_vllm_v2:infer:459 - >> gpt_forward_time: 0.18 seconds
2026-01-07 17:18:29.942 | INFO     | indextts.infer_vllm_v2:infer:460 - >> s2mel_time: 9.59 seconds
2026-01-07 17:18:29.942 | INFO     | indextts.infer_vllm_v2:infer:461 - >> bigvgan_time: 0.08 seconds
2026-01-07 17:18:29.942 | INFO     | indextts.infer_vllm_v2:infer:462 - >> Total inference time: 23.45 seconds
2026-01-07 17:18:29.942 | INFO     | indextts.infer_vllm_v2:infer:463 - >> Generated audio length: 35.12 seconds
2026-01-07 17:18:29.942 | INFO     | indextts.infer_vllm_v2:infer:464 - >> RTF: 0.6677
2026-01-07 17:18:29.960 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 17:18:30.355 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 17:18:30.377 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [991824af0b334ef797bfd3a59d5cced6] [prefill time: 0.0148]
2026-01-07 17:18:35.654 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [991824af0b334ef797bfd3a59d5cced6] [decode time: 5.2770] [decode len: 876]
2026-01-07 17:18:42.032 | INFO     | indextts.infer_vllm_v2:infer:458 - >> gpt_gen_time: 5.34 seconds
2026-01-07 17:18:42.032 | INFO     | indextts.infer_vllm_v2:infer:459 - >> gpt_forward_time: 0.03 seconds
2026-01-07 17:18:42.034 | INFO     | indextts.infer_vllm_v2:infer:460 - >> s2mel_time: 5.48 seconds
2026-01-07 17:18:42.034 | INFO     | indextts.infer_vllm_v2:infer:461 - >> bigvgan_time: 0.84 seconds
2026-01-07 17:18:42.034 | INFO     | indextts.infer_vllm_v2:infer:462 - >> Total inference time: 12.04 seconds
2026-01-07 17:18:42.034 | INFO     | indextts.infer_vllm_v2:infer:463 - >> Generated audio length: 17.45 seconds
2026-01-07 17:18:42.035 | INFO     | indextts.infer_vllm_v2:infer:464 - >> RTF: 0.6902
2026-01-07 17:18:42.049 | INFO     | indextts.infer_vllm_v2:infer:243 - >> start inference...
2026-01-07 17:18:43.081 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:222 - Use the specified emotion vector
2026-01-07 17:18:43.099 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:248 - [8382f6c343604dcab57888aa85574f6c] [prefill time: 0.0162]
2026-01-07 17:18:54.859 | INFO     | indextts.gpt.model_vllm_v2:inference_speech:251 - [8382f6c343604dcab57888aa85574f6c] [decode time: 11.7604] [decode len: 1761]
2026-01-07 17:19:06.392 | INFO     | indextts.infer_vllm_v2:infer:458 - >> gpt_gen_time: 11.84 seconds
2026-01-07 17:19:06.393 | INFO     | indextts.infer_vllm_v2:infer:459 - >> gpt_forward_time: 0.04 seconds
2026-01-07 17:19:06.393 | INFO     | indextts.infer_vllm_v2:infer:460 - >> s2mel_time: 9.74 seconds
2026-01-07 17:19:06.394 | INFO     | indextts.infer_vllm_v2:infer:461 - >> bigvgan_time: 0.08 seconds
2026-01-07 17:19:06.395 | INFO     | indextts.infer_vllm_v2:infer:462 - >> Total inference time: 24.32 seconds
2026-01-07 17:19:06.395 | INFO     | indextts.infer_vllm_v2:infer:463 - >> Generated audio length: 35.12 seconds
2026-01-07 17:19:06.395 | INFO     | indextts.infer_vllm_v2:infer:464 - >> RTF: 0.6925