Commits · 00cc000e35880b280bb681cba2c53b40ec1a52de · OpenDAS / Torchaudio

22 Jul, 2020 3 commits

Rename test_case_utils to case_utils (#808) · 00cc000e
moto authored Jul 22, 2020
```
buck gets confused with utility module name with `test_` prefix.
```
00cc000e

Refactor test_sox_effects (#805) · 05746042

moto authored Jul 22, 2020

1. Move misplaced sox compatibility test (T,Fade, T.Vol, T.Vad) to test/test_sox_compatibility.py
2. Move test_sox_effects to test/sox_effect/ where all the other functionalities from torchaudio.sox_effects are tested

05746042

Add smoke tests to sox_io and sox_effects (#806) · daa0007a

moto authored Jul 22, 2020

Currently all the tests in `sox_io_backend` and `sox_effects` (for new SoX effects implementation) requires additional `sox`, and this prevents running test in environment where `sox` command is not available even though `torchaudio` extension is available (such as fb internal). This PR adds smoke tests for these modules, which just runs functions to see if they do not crash.

daa0007a

21 Jul, 2020 1 commit
- Remove if __name__ == __main__ from test code (#804) · 3781cb23
  top0coder authored Jul 21, 2020
```
Co-authored-by: Jeff Zhang <jeffzhang@fb.com>
```
  3781cb23
20 Jul, 2020 1 commit

Add LibriTTS dataset (#790) · 4b8aad7a

jimchen90 authored Jul 20, 2020



* Add libritts

Add LibriTTS dataset draft

* Add libritts

Use two separate ids for utterance_id.

* Update output form

Use full_id as utterance_id.

* Update format

Add space and test black format

* Update test method

* Add audio and text test

Generate audio and test files on-the-fly in test 

* Update format

* Fix test error and remove assets libritts

The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed.

* Add seed in `get_whitenoise` function

* Change utterance to text

Change `_utterance` to `_text`.
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

4b8aad7a

17 Jul, 2020 2 commits

Update variable names in wavernn model (#797) · 209858ea

jimchen90 authored Jul 17, 2020



* Change the name of  n_output and n_hidden

* Replace the mode by n_classes and sample_rate

* Change the definition of n_output and n_hidden
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

209858ea

Changed GTZAN so that it only traverses filenames belonging to the dataset (#791) · 47eb1e6a

Emmanouil Theofanis Chourdakis authored Jul 17, 2020

* Addressed review issues in PR #668

* Changed GTZAN so that it only traverses filenames belonging to the dataset

Now, instead of walking the whole directory and subdirectories of the dataset
GTZAN only looks for files under a `genre`/`genre`.`5 digit number`.wav format, where `genre` is an allowed GTZAN genre label.
This allows moving or removing files from the dataset (e.g. for fixing duplication or mislabeling issues).

47eb1e6a

16 Jul, 2020 3 commits

Generate YESNO dataset on-the-fly for test (#792) · 102174e9
moto authored Jul 16, 2020

102174e9

Get rid of whitenoise and sinewave files from test (#783) · 02b898ff

engineerchuan authored Jul 16, 2020



* Get rid of sine wave files and whitenoise files
* Refactor integer encoding
* Relax rtol from 1e-8 to 1e-7 for compliance kaldi
* relax waveform multi channel resample atol to 1e-7 from 1e-8
* relax tolerance for length consistency for speed effect
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

02b898ff

Add Torchscript sox effects (#760) · 60a8e23d

moto authored Jul 15, 2020

* Add sox_utils module

* Make init/shutdown thread safe

* Add sox effects implementation

* Add test for sox effects

* Update docstrings and add examples

60a8e23d

14 Jul, 2020 3 commits

Do not use SoxEffectsChain in sox compatibility test (#781) · db8f2bf3

moto authored Jul 14, 2020

This PR replaces `torchaudio.sox_effects.SoxEffectsChain` in `test_sox_compatibility` with bare `sox` command.

The parity of `torchaudio.sox_effects.SoxEffectsChain` against `sox` command is not tested and it has known issues https://github.com/pytorch/audio/issues/771, therefore it is not appropriate to use this class for testing other functions.

db8f2bf3

Skip lowpass_speed on macOS (#782) · 131e48b6

moto authored Jul 14, 2020

`test/test_sox_effects.py::Test_SoxEffectsChain::test_lowpass_speed` has some issue on our macOS CI, even though there was no issue at #777 .

While we figure out the fix, we disable this test for macOS.

131e48b6

Stop using whitenoise.wav, mp3 and torchaudio.load in sox effect test · d11ad6bb

engineerchuan authored Jul 14, 2020

Part of #764

 - Replace `whitenoise.wav` with on-the-fly data generation
 - Replace `torchaudio.load` with `common_utils.load_wav`
 - Replace `steam-train-whistle-daniel_simon.mp3` with `.wav`

d11ad6bb

13 Jul, 2020 1 commit
- Use default backend for TestCommonVoice (#775) · c9142fd5
  engineerchuan authored Jul 13, 2020
```
* Change 'sox' to 'default'
```
  c9142fd5
12 Jul, 2020 1 commit
- Convert CommonVoice test asset to wav, and remove unused test asset (#772) · 26941fa3
  engineerchuan authored Jul 11, 2020
```
* converted CommonVoice tartar mp3 to wav using rate 8000 Hz

* Remove Unused dtmf_30s_stereo.mp3
```
  26941fa3
08 Jul, 2020 3 commits

Add Waveforms for Testing Purposes section to test/README.md (#759) · c375490f

Artyom Astafurov authored Jul 08, 2020



* add Waveforms for Testing Purposes section

* Update test/README.md

use wrapper function for scipy.io.wavfile.read
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

* remove un-used files from the doc

* Update test/README.md

Rename variable
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

* fix indent; remove mentions of unused files

* remove whitenoise* files from README.md
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

c375490f

Get rid of typedefs/SignalInfo and replace AudioMetaData (#761) · 180ede8e
moto authored Jul 08, 2020

180ede8e

Add WaveRNN Model (#735) · 68cc72da

jimchen90 authored Jul 07, 2020



* upsamplenetwork

* update variable names

* update variable name

* add wavernn model

* update test

* update format

* update format

* update format

* fix conflicts and add transpose

* import update

* update transpose

* update format

* update docstring

* add n_channel in input

* add comment

* update docstring

* update docstring
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

68cc72da

06 Jul, 2020 1 commit
- Replace torchaudio.load in test with scipy func (#762) · e43ee196
  moto authored Jul 06, 2020
  
  e43ee196
01 Jul, 2020 5 commits

Add sox_io_backend (#726) · 4b583eab
moto authored Jul 01, 2020

4b583eab
Add opus support (#755) · 894959a7
moto authored Jul 01, 2020

894959a7
Refactor test utilities (#756) · a20da5e3
moto authored Jul 01, 2020

a20da5e3

UpsampleNetwork (#724) · 6b159054

jimchen90 authored Jul 01, 2020



* upsamplenetwork

* update name

* update name and docstring

* update format

* rebase

* update docstring

* update docstring

* remove transpose and update docstring
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

6b159054

Add TorchScript-able "save" func to sox_io backend (#732) · 3324283c

moto authored Jul 01, 2020

This is a part of PRs to add new "sox_io" backend. #726 and depends on #718, #728 and #731.

This PR adds `save` function to "sox_io" backend, which can save Tensor to a file with the following audio formats;
 - `wav`
 - `mp3`
 - `flac`
 - `ogg/vorbis`

3324283c

29 Jun, 2020 1 commit

Update MelResNet (#751) · 878d3dac

jimchen90 authored Jun 29, 2020



* update varible names and docstring

* update format

* update docsting and output value
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

878d3dac

25 Jun, 2020 1 commit

Add load function (#731) · 793eeab8

moto authored Jun 25, 2020

This is a part of PRs to add new "sox_io" backend. #726 and depends on #718 and #728 .

This PR adds `load` function to "sox_io" backend, which is  tested on the following audio formats;
 - `wav`
 - `mp3`
 - `flac`
 - `ogg/vorbis` *

By default, "sox_io" backend returns Tensor with `float32` dtype and the shape of `[channel, time]`. The samples are normalized to fit in the range of `[-1.0, 1.0]`.

Unlike existing "sox" backend, the new `load` function can handle WAV file natively, when the input format is WAV with integer type, (such as 32-bit signed integer, 16-bit signed integer and 8-bit unsigned integer) by providing `normalize=False`, this function can return integer Tensor, where the samples are expressed within the whole range of the corresponding dtype, that is, `int32` tensor for `32-bit PCM`, `int16` for `16-bit PCM` and `uint8` for `8-bit PCM`. This behavior follows [scipy.io.wavfile.read](https://docs.scipy.org/doc/scipy/reference/generated/scipy.io.wavfile.read.html). `normalize` parameter has no effect for other formats and the load function always return normalized value with `float32` Tensor.

__* Note__ The current binary distribution of torchaudio does not contain `ogg/vorbis` and `opus` codecs. To handle these files, one needs to build torchaudio from the source with proper codecs in the system.

__Note 2__ Since this PR, `scipy` becomes required module for running test.

793eeab8

23 Jun, 2020 2 commits

Add subclass in model test classes (#727) · b8ddeb35

jimchen90 authored Jun 23, 2020



* add unittest in test_models

* update test method

* remove unittest main function
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

b8ddeb35

Fix SignalInfo member name to frame (#734) · e0f4c0ec

moto authored Jun 23, 2020

This PR fixes the wrong member name of SignalInfo introduced in #718. 

 - `num_samples` == `num_frames` * `num_channels`.

e0f4c0ec

19 Jun, 2020 1 commit

Add TorchScript-able "info" func to sox_io backend (#728) · 88fccd14

moto authored Jun 19, 2020

This is a part of PRs to add new "sox_io" backend #726, and depends on #718.

This PR adds `info` function to "sox_io" backend, which allows users to fetch some metadata of an audio file. 
At this moment, the information retrieved are;

 - Number of samples in the audio file
 - Sampling rate
 - Number of channels

88fccd14

18 Jun, 2020 1 commit

Make TestCases backend-aware (#719) · b17da7a4

moto authored Jun 18, 2020

* Make tests backend aware by introducing TorchaudioTestCase and reset backend for each TestCase.

* Set backends for the test cases that require specific backend.

b17da7a4

16 Jun, 2020 3 commits

Add MelResNet Block (#705) · 4318fc5c

jimchen90 authored Jun 16, 2020



* Add MelResNet Block

* add default value

* update model and test

* rebase and small changes

* add pad variable

* update format

* update reference in docstrings

* add underscore name
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

4318fc5c

update wav2letter test (#722) · ab733e7b
jimchen90 authored Jun 16, 2020
```
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>
```
ab733e7b

Refactor backend and not rely on global variables on switching (#698) · e9f19c35

moto authored Jun 15, 2020

* Refactor backend switching

1. Do not rely on global variables for backend switch
   So that load/save/info/load_wav functions will be torchscript-able
2. Add no_backend module to for the case there is no backend module available
   [bonus] This allows the whole codebase importable on systems that do not have torchaudio C++ extension nor soundfile.

e9f19c35

11 Jun, 2020 3 commits

Get rid of dynamic test suite generation (#716) · 08217121

moto authored Jun 11, 2020

`type` used in `common_utils` generates test class definition in `common_utils` and
this modifies the module state after it's imported. This is anti-pattern.
This PR get rid of the related utility functions and define test suite manually.

08217121

Fix integer division (#714) · 50939b75
moto authored Jun 11, 2020

50939b75

Change parameterized testing system to be compatible with unittest (#712) · d2724481

moto authored Jun 11, 2020



* Change parameterized testing system to be compatible with unittest

Summary: The previous implementation of parameterized testing worked by modifying test.common_utils inplace.  This doesn't work in general because unittest's contract with test modules is such that it must be able to load the module and run the test itself.  Because the previous implementation needed to load the module and modify it, it is incompatible.

Reviewed By: mthrok

Differential Revision: D21964676
Co-authored-by: Ben Mehne <bmehne@fb.com>

d2724481

10 Jun, 2020 1 commit

Add cmu_arctic dataset (#710) · 55b5c80c

jimchen90 authored Jun 10, 2020



* Add cmu_arctic dataset

* add dataset name

* update audio test file with whitenoise.wav file

* add test text file

* update text method and file name

* update comment

* change datasets order in doc

* add line length
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

55b5c80c

08 Jun, 2020 3 commits
- Add backend module and isolate backend-related functionalities (#700) · e3642896
  moto authored Jun 08, 2020
  
  e3642896
- List backends dynamically based on availability (#697) · 2fd32dd0
  moto authored Jun 08, 2020
  
  2fd32dd0
- Clean up migrated Kaldi compliance test files (#703) · 0bd91484
  Bhargav Kathivarapu authored Jun 08, 2020
```
* kaldi compliance files cleanup for spec, fbank, mfcc

* kaldi compliance tests removal for spec, fbank, mfcc
Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>
```
  0bd91484