- 22 Dec, 2020 1 commit
-
-
moto authored
-
- 04 Dec, 2020 1 commit
-
-
moto authored
-
- 01 Dec, 2020 1 commit
-
-
moto authored
-
- 22 Jul, 2020 1 commit
-
-
moto authored
* Replace save/load function with sox effects chain
-
- 16 Jul, 2020 1 commit
-
-
moto authored
* Add sox_utils module * Make init/shutdown thread safe * Add sox effects implementation * Add test for sox effects * Update docstrings and add examples
-
- 14 Jul, 2020 1 commit
-
-
moto authored
In #779, we plan to remove `frames_per_chunk` parameter from `save` function, but it will take some time before we can land #779, so we go ahead and remove the parameter first to reduce the conflict caused by interface change.
-
- 08 Jul, 2020 1 commit
-
-
moto authored
-
- 01 Jul, 2020 1 commit
-
-
moto authored
This is a part of PRs to add new "sox_io" backend. #726 and depends on #718, #728 and #731. This PR adds `save` function to "sox_io" backend, which can save Tensor to a file with the following audio formats; - `wav` - `mp3` - `flac` - `ogg/vorbis`
-
- 25 Jun, 2020 1 commit
-
-
moto authored
This is a part of PRs to add new "sox_io" backend. #726 and depends on #718 and #728 . This PR adds `load` function to "sox_io" backend, which is tested on the following audio formats; - `wav` - `mp3` - `flac` - `ogg/vorbis` * By default, "sox_io" backend returns Tensor with `float32` dtype and the shape of `[channel, time]`. The samples are normalized to fit in the range of `[-1.0, 1.0]`. Unlike existing "sox" backend, the new `load` function can handle WAV file natively, when the input format is WAV with integer type, (such as 32-bit signed integer, 16-bit signed integer and 8-bit unsigned integer) by providing `normalize=False`, this function can return integer Tensor, where the samples are expressed within the whole range of the corresponding dtype, that is, `int32` tensor for `32-bit PCM`, `int16` for `16-bit PCM` and `uint8` for `8-bit PCM`. This behavior follows [scipy.io.wavfile.read](https://docs.scipy.org/doc/scipy/reference/generated/scipy.io.wavfile.read.html). `normalize` parameter has no effect for other formats and the load function always return normalized value with `float32` Tensor. __* Note__ The current binary distribution of torchaudio does not contain `ogg/vorbis` and `opus` codecs. To handle these files, one needs to build torchaudio from the source with proper codecs in the system. __Note 2__ Since this PR, `scipy` becomes required module for running test.
-
- 19 Jun, 2020 1 commit
-
-
moto authored
This is a part of PRs to add new "sox_io" backend #726, and depends on #718. This PR adds `info` function to "sox_io" backend, which allows users to fetch some metadata of an audio file. At this moment, the information retrieved are; - Number of samples in the audio file - Sampling rate - Number of channels
-