- 20 Jul, 2020 3 commits
-
-
jimchen90 authored
Fix output type of upsampling Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
jimchen90 authored
* Update default form in docstring Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
jimchen90 authored
* Add libritts Add LibriTTS dataset draft * Add libritts Use two separate ids for utterance_id. * Update output form Use full_id as utterance_id. * Update format Add space and test black format * Update test method * Add audio and text test Generate audio and test files on-the-fly in test * Update format * Fix test error and remove assets libritts The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed. * Add seed in `get_whitenoise` function * Change utterance to text Change `_utterance` to `_text`. Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 17 Jul, 2020 2 commits
-
-
jimchen90 authored
* Change the name of n_output and n_hidden * Replace the mode by n_classes and sample_rate * Change the definition of n_output and n_hidden Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
Emmanouil Theofanis Chourdakis authored
* Addressed review issues in PR #668 * Changed GTZAN so that it only traverses filenames belonging to the dataset Now, instead of walking the whole directory and subdirectories of the dataset GTZAN only looks for files under a `genre`/`genre`.`5 digit number`.wav format, where `genre` is an allowed GTZAN genre label. This allows moving or removing files from the dataset (e.g. for fixing duplication or mislabeling issues).
-
- 16 Jul, 2020 4 commits
-
-
moto authored
-
engineerchuan authored
* Get rid of sine wave files and whitenoise files * Refactor integer encoding * Relax rtol from 1e-8 to 1e-7 for compliance kaldi * relax waveform multi channel resample atol to 1e-7 from 1e-8 * relax tolerance for length consistency for speed effect Co-authored-by:moto <855818+mthrok@users.noreply.github.com>
-
moto authored
-
moto authored
* Add sox_utils module * Make init/shutdown thread safe * Add sox effects implementation * Add test for sox effects * Update docstrings and add examples
-
- 14 Jul, 2020 5 commits
-
-
moto authored
This PR replaces `torchaudio.sox_effects.SoxEffectsChain` in `test_sox_compatibility` with bare `sox` command. The parity of `torchaudio.sox_effects.SoxEffectsChain` against `sox` command is not tested and it has known issues https://github.com/pytorch/audio/issues/771, therefore it is not appropriate to use this class for testing other functions.
-
moto authored
`test/test_sox_effects.py::Test_SoxEffectsChain::test_lowpass_speed` has some issue on our macOS CI, even though there was no issue at #777 . While we figure out the fix, we disable this test for macOS.
-
engineerchuan authored
Part of #764 - Replace `whitenoise.wav` with on-the-fly data generation - Replace `torchaudio.load` with `common_utils.load_wav` - Replace `steam-train-whistle-daniel_simon.mp3` with `.wav`
-
moto authored
In #779, we plan to remove `frames_per_chunk` parameter from `save` function, but it will take some time before we can land #779, so we go ahead and remove the parameter first to reduce the conflict caused by interface change.
-
moto authored
-
- 13 Jul, 2020 1 commit
-
-
engineerchuan authored
* Change 'sox' to 'default'
-
- 12 Jul, 2020 1 commit
-
-
engineerchuan authored
* converted CommonVoice tartar mp3 to wav using rate 8000 Hz * Remove Unused dtmf_30s_stereo.mp3
-
- 08 Jul, 2020 3 commits
-
-
Artyom Astafurov authored
* add Waveforms for Testing Purposes section * Update test/README.md use wrapper function for scipy.io.wavfile.read Co-authored-by:
moto <855818+mthrok@users.noreply.github.com> * remove un-used files from the doc * Update test/README.md Rename variable Co-authored-by:
moto <855818+mthrok@users.noreply.github.com> * fix indent; remove mentions of unused files * remove whitenoise* files from README.md Co-authored-by:
moto <855818+mthrok@users.noreply.github.com>
-
moto authored
-
jimchen90 authored
* upsamplenetwork * update variable names * update variable name * add wavernn model * update test * update format * update format * update format * fix conflicts and add transpose * import update * update transpose * update format * update docstring * add n_channel in input * add comment * update docstring * update docstring Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 06 Jul, 2020 2 commits
- 01 Jul, 2020 6 commits
-
-
moto authored
-
moto authored
-
moto authored
-
jimchen90 authored
* upsamplenetwork * update name * update name and docstring * update format * rebase * update docstring * update docstring * remove transpose and update docstring Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
moto authored
This is a part of PRs to add new "sox_io" backend. #726 and depends on #718, #728 and #731. This PR adds `save` function to "sox_io" backend, which can save Tensor to a file with the following audio formats; - `wav` - `mp3` - `flac` - `ogg/vorbis`
-
moto authored
* Use cmake for third party * Apply patch to libmad * Update gitignore * Update docker test image
-
- 30 Jun, 2020 1 commit
-
-
Artyom Astafurov authored
-
- 29 Jun, 2020 1 commit
-
-
jimchen90 authored
* update varible names and docstring * update format * update docsting and output value Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 26 Jun, 2020 2 commits
-
-
moto authored
-
lbjcom authored
* Update functional.py rollback torch.norm() in spectrogram() to v0.4.0. * Update functional.py comment out `spec_f = complex_norm(spec_f, power=power)`. * fixed complex_norm() instead of spectrogram() for torch.norm() issue. * lint Co-authored-by:
bongjin.lee <bongjin.lee@navercorp.com> Co-authored-by:
Vincent QB <vincentqb@users.noreply.github.com>
-
- 25 Jun, 2020 2 commits
-
-
moto authored
This is a part of PRs to add new "sox_io" backend. #726 and depends on #718 and #728 . This PR adds `load` function to "sox_io" backend, which is tested on the following audio formats; - `wav` - `mp3` - `flac` - `ogg/vorbis` * By default, "sox_io" backend returns Tensor with `float32` dtype and the shape of `[channel, time]`. The samples are normalized to fit in the range of `[-1.0, 1.0]`. Unlike existing "sox" backend, the new `load` function can handle WAV file natively, when the input format is WAV with integer type, (such as 32-bit signed integer, 16-bit signed integer and 8-bit unsigned integer) by providing `normalize=False`, this function can return integer Tensor, where the samples are expressed within the whole range of the corresponding dtype, that is, `int32` tensor for `32-bit PCM`, `int16` for `16-bit PCM` and `uint8` for `8-bit PCM`. This behavior follows [scipy.io.wavfile.read](https://docs.scipy.org/doc/scipy/reference/generated/scipy.io.wavfile.read.html). `normalize` parameter has no effect for other formats and the load function always return normalized value with `float32` Tensor. __* Note__ The current binary distribution of torchaudio does not contain `ogg/vorbis` and `opus` codecs. To handle these files, one needs to build torchaudio from the source with proper codecs in the system. __Note 2__ Since this PR, `scipy` becomes required module for running test.
-
moto authored
-
- 24 Jun, 2020 2 commits
-
-
Eli Uriegas authored
* packaging: Add test channels to pytorch dependency resolution Signed-off-by:
Eli Uriegas <eliuriegas@fb.com> * .circleci: Add test channel to smoke tests Signed-off-by:
Eli Uriegas <eliuriegas@fb.com> * .circleci: Put pytorch-test into a higher priority pytorch-nightly was getting prioritized over pytorch-nightly which shouldn't be the case Signed-off-by:
Eli Uriegas <eliuriegas@fb.com>
-
Eli Uriegas authored
Signed-off-by:Eli Uriegas <eliuriegas@fb.com>
-
- 23 Jun, 2020 5 commits
-
-
moto authored
-
moto authored
In #728, linux unit test switches to libsox provided by apt. For CPU jobs this is fine because all the job steps share the same Docker container, but on CPU job, each job step runs a script in a new Docker container, so libsox installed in a step is not available to the subsequent steps. To fix this, this PR moves the installation of libsox and sox to Docker build.
-
moto authored
This PR refactors cache generation mechanism by introducing dedicated command and bust cache on daily basis. At this moment, Windows unittest job for 3.6 and 3.7 are broken because of broken scipy but the environment is cached this persists until the next week. As we have nightly build, we do not need to keep cache for one week.
-
jimchen90 authored
* add unittest in test_models * update test method * remove unittest main function Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
moto authored
This PR fixes the wrong member name of SignalInfo introduced in #718. - `num_samples` == `num_frames` * `num_channels`.
-