Update data augmentation tutorial (#2595)

Summary: In https://github.com/pytorch/audio/pull/2285, the SNR calculation was fixed, but there was still one that was not fixed. This commit fixes it. Also following the feedback https://github.com/pytorch/tutorials/issues/1930#issuecomment-1199741336, update the variable name. Pull Request resolved: https://github.com/pytorch/audio/pull/2595 Reviewed By: carolineechen Differential Revision: D38314672 Pulled By: mthrok fbshipit-source-id: b2015e2709729190d97264aa191651b3af4ba856

Update data augmentation tutorial (#2595)
Summary: In https://github.com/pytorch/audio/pull/2285, the SNR calculation was fixed, but there was still one that was not fixed. This commit fixes it. Also following the feedback https://github.com/pytorch/tutorials/issues/1930#issuecomment-1199741336, update the variable name. Pull Request resolved: https://github.com/pytorch/audio/pull/2595 Reviewed By: carolineechen Differential Revision: D38314672 Pulled By: mthrok fbshipit-source-id: b2015e2709729190d97264aa191651b3af4ba856
f1443b8f · moto · Facebook GitHub Bot · e502df01 · f1443b8f
Commit f1443b8f authored Aug 01, 2022 by moto Committed by Facebook GitHub Bot Aug 01, 2022
Hide whitespace changes
Inline Side-by-side

Showing with 4 additions and 4 deletions

examples/tutorials/audio_data_augmentation_tutorial.py examples/tutorials/audio_data_augmentation_tutorial.py +4 -4

No files found.
--- a/examples/tutorials/audio_data_augmentation_tutorial.py
+++ b/examples/tutorials/audio_data_augmentation_tutorial.py
@@ -239,14 +239,14 @@ speech, _ = torchaudio.load(SAMPLE_SPEECH)
 noise, _ = torchaudio.load(SAMPLE_NOISE)
 noise = noise[:, : speech.shape[1]]

-speech_power = speech.norm(p=2)
-noise_power = noise.norm(p=2)
+speech_rms = speech.norm(p=2)
+noise_rms = noise.norm(p=2)

 snr_dbs = [20, 10, 3]
 noisy_speeches = []
 for snr_db in snr_dbs:
    snr = 10 ** (snr_db / 20)
-    scale = snr * noise_power / speech_power
+    scale = snr * noise_rms / speech_rms
    noisy_speeches.append((scale * speech + noise) / 2)

 ######################################################################
@@ -376,7 +376,7 @@ noise, _ = torchaudio.load(SAMPLE_NOISE)
 noise = noise[:, : rir_applied.shape[1]]

 snr_db = 8
-scale = math.exp(snr_db / 10) * noise.norm(p=2) / rir_applied.norm(p=2)
+scale = (10 ** (snr_db / 20)) * noise.norm(p=2) / rir_applied.norm(p=2)
 bg_added = (scale * rir_applied + noise) / 2

 plot_specgram(bg_added, sample_rate, title="BG noise added")