1. 23 Jul, 2020 1 commit
  2. 22 Jul, 2020 2 commits
    • moto's avatar
      [BC Breaking] Split `list_formats()` for read and write (#811) · f16f74af
      moto authored
      * Separate sox list format function for read and write
      
      * Guard MP3 smoke test
      f16f74af
    • moto's avatar
      Add smoke tests to sox_io and sox_effects (#806) · daa0007a
      moto authored
      Currently all the tests in `sox_io_backend` and `sox_effects` (for new SoX effects implementation) requires additional `sox`, and this prevents running test in environment where `sox` command is not available even though `torchaudio` extension is available (such as fb internal). This PR adds smoke tests for these modules, which just runs functions to see if they do not crash.
      daa0007a
  3. 08 Jul, 2020 1 commit
  4. 01 Jul, 2020 4 commits
  5. 25 Jun, 2020 1 commit
    • moto's avatar
      Add load function (#731) · 793eeab8
      moto authored
      This is a part of PRs to add new "sox_io" backend. #726 and depends on #718 and #728 .
      
      This PR adds `load` function to "sox_io" backend, which is  tested on the following audio formats;
       - `wav`
       - `mp3`
       - `flac`
       - `ogg/vorbis` *
      
      By default, "sox_io" backend returns Tensor with `float32` dtype and the shape of `[channel, time]`. The samples are normalized to fit in the range of `[-1.0, 1.0]`.
      
      Unlike existing "sox" backend, the new `load` function can handle WAV file natively, when the input format is WAV with integer type, (such as 32-bit signed integer, 16-bit signed integer and 8-bit unsigned integer) by providing `normalize=False`, this function can return integer Tensor, where the samples are expressed within the whole range of the corresponding dtype, that is, `int32` tensor for `32-bit PCM`, `int16` for `16-bit PCM` and `uint8` for `8-bit PCM`. This behavior follows [scipy.io.wavfile.read](https://docs.scipy.org/doc/scipy/reference/generated/scipy.io.wavfile.read.html). `normalize` parameter has no effect for other formats and the load function always return normalized value with `float32` Tensor.
      
      __* Note__ The current binary distribution of torchaudio does not contain `ogg/vorbis` and `opus` codecs. To handle these files, one needs to build torchaudio from the source with proper codecs in the system.
      
      __Note 2__ Since this PR, `scipy` becomes required module for running test. 
      793eeab8
  6. 23 Jun, 2020 1 commit
  7. 19 Jun, 2020 1 commit
    • moto's avatar
      Add TorchScript-able "info" func to sox_io backend (#728) · 88fccd14
      moto authored
      This is a part of PRs to add new "sox_io" backend #726, and depends on #718.
      
      This PR adds `info` function to "sox_io" backend, which allows users to fetch some metadata of an audio file. 
      At this moment, the information retrieved are;
      
       - Number of samples in the audio file
       - Sampling rate
       - Number of channels
      88fccd14