- 22 Jul, 2020 3 commits
-
-
moto authored
buck gets confused with utility module name with `test_` prefix.
-
moto authored
1. Move misplaced sox compatibility test (T,Fade, T.Vol, T.Vad) to test/test_sox_compatibility.py 2. Move test_sox_effects to test/sox_effect/ where all the other functionalities from torchaudio.sox_effects are tested
-
moto authored
Currently all the tests in `sox_io_backend` and `sox_effects` (for new SoX effects implementation) requires additional `sox`, and this prevents running test in environment where `sox` command is not available even though `torchaudio` extension is available (such as fb internal). This PR adds smoke tests for these modules, which just runs functions to see if they do not crash.
-
- 21 Jul, 2020 1 commit
-
-
top0coder authored
Co-authored-by:Jeff Zhang <jeffzhang@fb.com>
-
- 20 Jul, 2020 1 commit
-
-
jimchen90 authored
* Add libritts Add LibriTTS dataset draft * Add libritts Use two separate ids for utterance_id. * Update output form Use full_id as utterance_id. * Update format Add space and test black format * Update test method * Add audio and text test Generate audio and test files on-the-fly in test * Update format * Fix test error and remove assets libritts The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed. * Add seed in `get_whitenoise` function * Change utterance to text Change `_utterance` to `_text`. Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 17 Jul, 2020 2 commits
-
-
jimchen90 authored
* Change the name of n_output and n_hidden * Replace the mode by n_classes and sample_rate * Change the definition of n_output and n_hidden Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
Emmanouil Theofanis Chourdakis authored
* Addressed review issues in PR #668 * Changed GTZAN so that it only traverses filenames belonging to the dataset Now, instead of walking the whole directory and subdirectories of the dataset GTZAN only looks for files under a `genre`/`genre`.`5 digit number`.wav format, where `genre` is an allowed GTZAN genre label. This allows moving or removing files from the dataset (e.g. for fixing duplication or mislabeling issues).
-
- 16 Jul, 2020 3 commits
-
-
moto authored
-
engineerchuan authored
* Get rid of sine wave files and whitenoise files * Refactor integer encoding * Relax rtol from 1e-8 to 1e-7 for compliance kaldi * relax waveform multi channel resample atol to 1e-7 from 1e-8 * relax tolerance for length consistency for speed effect Co-authored-by:moto <855818+mthrok@users.noreply.github.com>
-
moto authored
* Add sox_utils module * Make init/shutdown thread safe * Add sox effects implementation * Add test for sox effects * Update docstrings and add examples
-
- 14 Jul, 2020 3 commits
-
-
moto authored
This PR replaces `torchaudio.sox_effects.SoxEffectsChain` in `test_sox_compatibility` with bare `sox` command. The parity of `torchaudio.sox_effects.SoxEffectsChain` against `sox` command is not tested and it has known issues https://github.com/pytorch/audio/issues/771, therefore it is not appropriate to use this class for testing other functions.
-
moto authored
`test/test_sox_effects.py::Test_SoxEffectsChain::test_lowpass_speed` has some issue on our macOS CI, even though there was no issue at #777 . While we figure out the fix, we disable this test for macOS.
-
engineerchuan authored
Part of #764 - Replace `whitenoise.wav` with on-the-fly data generation - Replace `torchaudio.load` with `common_utils.load_wav` - Replace `steam-train-whistle-daniel_simon.mp3` with `.wav`
-
- 13 Jul, 2020 1 commit
-
-
engineerchuan authored
* Change 'sox' to 'default'
-
- 12 Jul, 2020 1 commit
-
-
engineerchuan authored
* converted CommonVoice tartar mp3 to wav using rate 8000 Hz * Remove Unused dtmf_30s_stereo.mp3
-
- 08 Jul, 2020 3 commits
-
-
Artyom Astafurov authored
* add Waveforms for Testing Purposes section * Update test/README.md use wrapper function for scipy.io.wavfile.read Co-authored-by:
moto <855818+mthrok@users.noreply.github.com> * remove un-used files from the doc * Update test/README.md Rename variable Co-authored-by:
moto <855818+mthrok@users.noreply.github.com> * fix indent; remove mentions of unused files * remove whitenoise* files from README.md Co-authored-by:
moto <855818+mthrok@users.noreply.github.com>
-
moto authored
-
jimchen90 authored
* upsamplenetwork * update variable names * update variable name * add wavernn model * update test * update format * update format * update format * fix conflicts and add transpose * import update * update transpose * update format * update docstring * add n_channel in input * add comment * update docstring * update docstring Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 06 Jul, 2020 1 commit
-
-
moto authored
-
- 01 Jul, 2020 5 commits
-
-
moto authored
-
moto authored
-
moto authored
-
jimchen90 authored
* upsamplenetwork * update name * update name and docstring * update format * rebase * update docstring * update docstring * remove transpose and update docstring Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
moto authored
This is a part of PRs to add new "sox_io" backend. #726 and depends on #718, #728 and #731. This PR adds `save` function to "sox_io" backend, which can save Tensor to a file with the following audio formats; - `wav` - `mp3` - `flac` - `ogg/vorbis`
-
- 29 Jun, 2020 1 commit
-
-
jimchen90 authored
* update varible names and docstring * update format * update docsting and output value Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 25 Jun, 2020 1 commit
-
-
moto authored
This is a part of PRs to add new "sox_io" backend. #726 and depends on #718 and #728 . This PR adds `load` function to "sox_io" backend, which is tested on the following audio formats; - `wav` - `mp3` - `flac` - `ogg/vorbis` * By default, "sox_io" backend returns Tensor with `float32` dtype and the shape of `[channel, time]`. The samples are normalized to fit in the range of `[-1.0, 1.0]`. Unlike existing "sox" backend, the new `load` function can handle WAV file natively, when the input format is WAV with integer type, (such as 32-bit signed integer, 16-bit signed integer and 8-bit unsigned integer) by providing `normalize=False`, this function can return integer Tensor, where the samples are expressed within the whole range of the corresponding dtype, that is, `int32` tensor for `32-bit PCM`, `int16` for `16-bit PCM` and `uint8` for `8-bit PCM`. This behavior follows [scipy.io.wavfile.read](https://docs.scipy.org/doc/scipy/reference/generated/scipy.io.wavfile.read.html). `normalize` parameter has no effect for other formats and the load function always return normalized value with `float32` Tensor. __* Note__ The current binary distribution of torchaudio does not contain `ogg/vorbis` and `opus` codecs. To handle these files, one needs to build torchaudio from the source with proper codecs in the system. __Note 2__ Since this PR, `scipy` becomes required module for running test.
-
- 23 Jun, 2020 2 commits
-
-
jimchen90 authored
* add unittest in test_models * update test method * remove unittest main function Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
moto authored
This PR fixes the wrong member name of SignalInfo introduced in #718. - `num_samples` == `num_frames` * `num_channels`.
-
- 19 Jun, 2020 1 commit
-
-
moto authored
This is a part of PRs to add new "sox_io" backend #726, and depends on #718. This PR adds `info` function to "sox_io" backend, which allows users to fetch some metadata of an audio file. At this moment, the information retrieved are; - Number of samples in the audio file - Sampling rate - Number of channels
-
- 18 Jun, 2020 1 commit
-
-
moto authored
* Make tests backend aware by introducing TorchaudioTestCase and reset backend for each TestCase. * Set backends for the test cases that require specific backend.
-
- 16 Jun, 2020 3 commits
-
-
jimchen90 authored
* Add MelResNet Block * add default value * update model and test * rebase and small changes * add pad variable * update format * update reference in docstrings * add underscore name Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
jimchen90 authored
Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
moto authored
* Refactor backend switching 1. Do not rely on global variables for backend switch So that load/save/info/load_wav functions will be torchscript-able 2. Add no_backend module to for the case there is no backend module available [bonus] This allows the whole codebase importable on systems that do not have torchaudio C++ extension nor soundfile.
-
- 11 Jun, 2020 3 commits
-
-
moto authored
`type` used in `common_utils` generates test class definition in `common_utils` and this modifies the module state after it's imported. This is anti-pattern. This PR get rid of the related utility functions and define test suite manually.
-
moto authored
-
moto authored
* Change parameterized testing system to be compatible with unittest Summary: The previous implementation of parameterized testing worked by modifying test.common_utils inplace. This doesn't work in general because unittest's contract with test modules is such that it must be able to load the module and run the test itself. Because the previous implementation needed to load the module and modify it, it is incompatible. Reviewed By: mthrok Differential Revision: D21964676 Co-authored-by:Ben Mehne <bmehne@fb.com>
-
- 10 Jun, 2020 1 commit
-
-
jimchen90 authored
* Add cmu_arctic dataset * add dataset name * update audio test file with whitenoise.wav file * add test text file * update text method and file name * update comment * change datasets order in doc * add line length Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 08 Jun, 2020 3 commits
-
-
moto authored
-
moto authored
-
Bhargav Kathivarapu authored
* kaldi compliance files cleanup for spec, fbank, mfcc * kaldi compliance tests removal for spec, fbank, mfcc Signed-off-by:Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>
-