1. 20 Jul, 2020 2 commits
    • jimchen90's avatar
      Update default form in docstring (#802) · e82cc350
      jimchen90 authored
      
      
      * Update default form in docstring
      Co-authored-by: default avatarJi Chen <jimchen90@devfair0160.h2.fair>
      e82cc350
    • jimchen90's avatar
      Add LibriTTS dataset (#790) · 4b8aad7a
      jimchen90 authored
      
      
      * Add libritts
      
      Add LibriTTS dataset draft
      
      * Add libritts
      
      Use two separate ids for utterance_id.
      
      * Update output form
      
      Use full_id as utterance_id.
      
      * Update format
      
      Add space and test black format
      
      * Update test method
      
      * Add audio and text test
      
      Generate audio and test files on-the-fly in test 
      
      * Update format
      
      * Fix test error and remove assets libritts
      
      The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed.
      
      * Add seed in `get_whitenoise` function
      
      * Change utterance to text
      
      Change `_utterance` to `_text`.
      Co-authored-by: default avatarJi Chen <jimchen90@devfair0160.h2.fair>
      4b8aad7a
  2. 17 Jul, 2020 2 commits
  3. 16 Jul, 2020 4 commits
  4. 14 Jul, 2020 5 commits
  5. 13 Jul, 2020 1 commit
  6. 12 Jul, 2020 1 commit
  7. 08 Jul, 2020 3 commits
  8. 06 Jul, 2020 2 commits
  9. 01 Jul, 2020 6 commits
  10. 30 Jun, 2020 1 commit
  11. 29 Jun, 2020 1 commit
  12. 26 Jun, 2020 2 commits
  13. 25 Jun, 2020 2 commits
    • moto's avatar
      Add load function (#731) · 793eeab8
      moto authored
      This is a part of PRs to add new "sox_io" backend. #726 and depends on #718 and #728 .
      
      This PR adds `load` function to "sox_io" backend, which is  tested on the following audio formats;
       - `wav`
       - `mp3`
       - `flac`
       - `ogg/vorbis` *
      
      By default, "sox_io" backend returns Tensor with `float32` dtype and the shape of `[channel, time]`. The samples are normalized to fit in the range of `[-1.0, 1.0]`.
      
      Unlike existing "sox" backend, the new `load` function can handle WAV file natively, when the input format is WAV with integer type, (such as 32-bit signed integer, 16-bit signed integer and 8-bit unsigned integer) by providing `normalize=False`, this function can return integer Tensor, where the samples are expressed within the whole range of the corresponding dtype, that is, `int32` tensor for `32-bit PCM`, `int16` for `16-bit PCM` and `uint8` for `8-bit PCM`. This behavior follows [scipy.io.wavfile.read](https://docs.scipy.org/doc/scipy/reference/generated/scipy.io.wavfile.read.html). `normalize` parameter has no effect for other formats and the load function always return normalized value with `float32` Tensor.
      
      __* Note__ The current binary distribution of torchaudio does not contain `ogg/vorbis` and `opus` codecs. To handle these files, one needs to build torchaudio from the source with proper codecs in the system.
      
      __Note 2__ Since this PR, `scipy` becomes required module for running test. 
      793eeab8
    • moto's avatar
      0f0d0af3
  14. 24 Jun, 2020 2 commits
  15. 23 Jun, 2020 5 commits
  16. 22 Jun, 2020 1 commit