Error in inference. Audio output with silence
modify F5-TTS/src/f5_tts/infer/utils_infer.py
1、
asr_pipe = pipeline(
"automatic-speech-recognition",
model="openai/whisper-large-v3-turbo",
# torch_dtype=dtype,
device=device,
)
2、
dtype = torch.float32 # if mel_spec_type == "bigvgan" else None
3、
audio, sr = torchaudio.load(ref_audio, backend="soundfile")