Commits · 47ccabbf9cf49a52cf177230fb7e6646a2c278dc · OpenDAS / Torchaudio

29 Jul, 2021 1 commit
- Add an inference example for wavernn (#1637) · 47ccabbf
  yangarbiter authored Jul 29, 2021
  
  47ccabbf
15 Apr, 2021 1 commit
- Use torchaudio melscale 'slaney' instead of librosa in WaveRNN pipeline preprocessing (#1444) · e061b268
  discort authored Apr 16, 2021
```
* Use torchaudio melscale instead of librosa
```
  e061b268
07 Aug, 2020 1 commit

Add spectrogram normalization option (#863) · 6aafbb6d

jimchen90 authored Aug 07, 2020



* Add spectrogram normalization option
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

6aafbb6d

21 Jul, 2020 1 commit

Add wavernn example pipeline (#749) · fac1bba9

jimchen90 authored Jul 21, 2020

* Add WaveRNN example

This is the pipeline example based on [WaveRNN model](https://github.com/pytorch/audio/pull/735) in torchaudio. The design of this pipeline is inspired by [#632](https://github.com/pytorch/audio/pull/632). It offers a standardized implementation of WaveRNN vocoder in torchaudio.

* Add utils and readme

The metric logger is added based on the Wav2letter pipeline [#632](https://github.com/pytorch/audio/pull/632). It offers the way to parse the standard output as described in readme.

* Add channel dimension

The channel dimension of waveform in datasets is added to match the input dimensions of WaveRNN model because the channel dimensions of waveform and spectrogram are added in [this part] (https://github.com/pytorch/audio/blob/master/torchaudio/models/_wavernn.py#L281) of WaveRNN model.

* Update date split and transform

The design of dataset structure is discussed in [this comment](https://github.com/pytorch/audio/pull/749#discussion_r454627027

). Now the dataset file has a clearer workflow after using the random-split function instead of walking through all the files. All transform functions are put together inside the transforms block.
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

fac1bba9