Commits · 937d52f87de157aec6fbd5bccde962b468201ebb · OpenDAS / Torchaudio

"vscode:/vscode.git/clone" did not exist on "b66a85ae6c344b838e90d65740e68051ea69ffc8"

20 Jul, 2020 3 commits

Fix output type of upsampling (#801) · 937d52f8

jimchen90 authored Jul 20, 2020



Fix output type of upsampling
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

937d52f8

Update default form in docstring (#802) · e82cc350

jimchen90 authored Jul 20, 2020



* Update default form in docstring
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

e82cc350

Add LibriTTS dataset (#790) · 4b8aad7a

jimchen90 authored Jul 20, 2020



* Add libritts

Add LibriTTS dataset draft

* Add libritts

Use two separate ids for utterance_id.

* Update output form

Use full_id as utterance_id.

* Update format

Add space and test black format

* Update test method

* Add audio and text test

Generate audio and test files on-the-fly in test 

* Update format

* Fix test error and remove assets libritts

The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed.

* Add seed in `get_whitenoise` function

* Change utterance to text

Change `_utterance` to `_text`.
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

4b8aad7a

17 Jul, 2020 2 commits

Update variable names in wavernn model (#797) · 209858ea

jimchen90 authored Jul 17, 2020



* Change the name of  n_output and n_hidden

* Replace the mode by n_classes and sample_rate

* Change the definition of n_output and n_hidden
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

209858ea

Changed GTZAN so that it only traverses filenames belonging to the dataset (#791) · 47eb1e6a

Emmanouil Theofanis Chourdakis authored Jul 17, 2020

* Addressed review issues in PR #668

* Changed GTZAN so that it only traverses filenames belonging to the dataset

Now, instead of walking the whole directory and subdirectories of the dataset
GTZAN only looks for files under a `genre`/`genre`.`5 digit number`.wav format, where `genre` is an allowed GTZAN genre label.
This allows moving or removing files from the dataset (e.g. for fixing duplication or mislabeling issues).

47eb1e6a

16 Jul, 2020 4 commits

Generate YESNO dataset on-the-fly for test (#792) · 102174e9
moto authored Jul 16, 2020

102174e9

Get rid of whitenoise and sinewave files from test (#783) · 02b898ff

engineerchuan authored Jul 16, 2020



* Get rid of sine wave files and whitenoise files
* Refactor integer encoding
* Relax rtol from 1e-8 to 1e-7 for compliance kaldi
* relax waveform multi channel resample atol to 1e-7 from 1e-8
* relax tolerance for length consistency for speed effect
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

02b898ff

Add deprication warnings to SoxEffect and SoxEffectsChain (#787) · 8181a83b
moto authored Jul 16, 2020

8181a83b

Add Torchscript sox effects (#760) · 60a8e23d

moto authored Jul 15, 2020

* Add sox_utils module

* Make init/shutdown thread safe

* Add sox effects implementation

* Add test for sox effects

* Update docstrings and add examples

60a8e23d

14 Jul, 2020 5 commits

Do not use SoxEffectsChain in sox compatibility test (#781) · db8f2bf3

moto authored Jul 14, 2020

This PR replaces `torchaudio.sox_effects.SoxEffectsChain` in `test_sox_compatibility` with bare `sox` command.

The parity of `torchaudio.sox_effects.SoxEffectsChain` against `sox` command is not tested and it has known issues https://github.com/pytorch/audio/issues/771, therefore it is not appropriate to use this class for testing other functions.

db8f2bf3

Skip lowpass_speed on macOS (#782) · 131e48b6

moto authored Jul 14, 2020

`test/test_sox_effects.py::Test_SoxEffectsChain::test_lowpass_speed` has some issue on our macOS CI, even though there was no issue at #777 .

While we figure out the fix, we disable this test for macOS.

131e48b6

Stop using whitenoise.wav, mp3 and torchaudio.load in sox effect test · d11ad6bb

engineerchuan authored Jul 14, 2020

Part of #764

 - Replace `whitenoise.wav` with on-the-fly data generation
 - Replace `torchaudio.load` with `common_utils.load_wav`
 - Replace `steam-train-whistle-daniel_simon.mp3` with `.wav`

d11ad6bb

Remove frames_per_chunk argument from save (#780) · 4b3e9052

moto authored Jul 14, 2020

In #779, we plan to remove `frames_per_chunk` parameter from `save` function, but it will take some time before we can land #779, so we go ahead and remove the parameter first to reduce the conflict caused by interface change.

4b3e9052

Add macOS CPU unittest (#777) · f6dc2f67
moto authored Jul 14, 2020

f6dc2f67

13 Jul, 2020 1 commit
- Use default backend for TestCommonVoice (#775) · c9142fd5
  engineerchuan authored Jul 13, 2020
```
* Change 'sox' to 'default'
```
  c9142fd5
12 Jul, 2020 1 commit
- Convert CommonVoice test asset to wav, and remove unused test asset (#772) · 26941fa3
  engineerchuan authored Jul 11, 2020
```
* converted CommonVoice tartar mp3 to wav using rate 8000 Hz

* Remove Unused dtmf_30s_stereo.mp3
```
  26941fa3
08 Jul, 2020 3 commits

Add Waveforms for Testing Purposes section to test/README.md (#759) · c375490f

Artyom Astafurov authored Jul 08, 2020



* add Waveforms for Testing Purposes section

* Update test/README.md

use wrapper function for scipy.io.wavfile.read
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

* remove un-used files from the doc

* Update test/README.md

Rename variable
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

* fix indent; remove mentions of unused files

* remove whitenoise* files from README.md
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

c375490f

Get rid of typedefs/SignalInfo and replace AudioMetaData (#761) · 180ede8e
moto authored Jul 08, 2020

180ede8e

Add WaveRNN Model (#735) · 68cc72da

jimchen90 authored Jul 07, 2020



* upsamplenetwork

* update variable names

* update variable name

* add wavernn model

* update test

* update format

* update format

* update format

* fix conflicts and add transpose

* import update

* update transpose

* update format

* update docstring

* add n_channel in input

* add comment

* update docstring

* update docstring
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

68cc72da

06 Jul, 2020 2 commits
- Fix CI: Cannot uninstall a distutils installed project (#766) · ad7f43fe
  moto authored Jul 06, 2020
```
* Pin llvmlite

* Add comments
```
  ad7f43fe
- Replace torchaudio.load in test with scipy func (#762) · e43ee196
  moto authored Jul 06, 2020
  
  e43ee196
01 Jul, 2020 6 commits

Add sox_io_backend (#726) · 4b583eab
moto authored Jul 01, 2020

4b583eab
Add opus support (#755) · 894959a7
moto authored Jul 01, 2020

894959a7
Refactor test utilities (#756) · a20da5e3
moto authored Jul 01, 2020

a20da5e3

UpsampleNetwork (#724) · 6b159054

jimchen90 authored Jul 01, 2020



* upsamplenetwork

* update name

* update name and docstring

* update format

* rebase

* update docstring

* update docstring

* remove transpose and update docstring
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

6b159054

Add TorchScript-able "save" func to sox_io backend (#732) · 3324283c

moto authored Jul 01, 2020

This is a part of PRs to add new "sox_io" backend. #726 and depends on #718, #728 and #731.

This PR adds `save` function to "sox_io" backend, which can save Tensor to a file with the following audio formats;
 - `wav`
 - `mp3`
 - `flac`
 - `ogg/vorbis`

3324283c

Use cmake for third party (#753) · ea42513f

moto authored Jul 01, 2020

* Use cmake for third party

* Apply patch to libmad

* Update gitignore

* Update docker test image

ea42513f

30 Jun, 2020 1 commit
- add probot (#737) · d71661aa
  Artyom Astafurov authored Jun 30, 2020
  
  d71661aa
29 Jun, 2020 1 commit

Update MelResNet (#751) · 878d3dac

jimchen90 authored Jun 29, 2020



* update varible names and docstring

* update format

* update docsting and output value
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

878d3dac

26 Jun, 2020 2 commits

Add vorbis to binary build (#750) · 4daf2fb7
moto authored Jun 26, 2020

4daf2fb7

rollback torch.norm() in spectrogram() (#747) · 66f4cdf9

lbjcom authored Jun 26, 2020



* Update functional.py

rollback torch.norm() in spectrogram() to v0.4.0.

* Update functional.py

comment out `spec_f = complex_norm(spec_f, power=power)`.

* fixed complex_norm() instead of spectrogram() for torch.norm() issue.

* lint
Co-authored-by: bongjin.lee <bongjin.lee@navercorp.com>
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

66f4cdf9

25 Jun, 2020 2 commits

Add load function (#731) · 793eeab8

moto authored Jun 25, 2020

This is a part of PRs to add new "sox_io" backend. #726 and depends on #718 and #728 .

This PR adds `load` function to "sox_io" backend, which is  tested on the following audio formats;
 - `wav`
 - `mp3`
 - `flac`
 - `ogg/vorbis` *

By default, "sox_io" backend returns Tensor with `float32` dtype and the shape of `[channel, time]`. The samples are normalized to fit in the range of `[-1.0, 1.0]`.

Unlike existing "sox" backend, the new `load` function can handle WAV file natively, when the input format is WAV with integer type, (such as 32-bit signed integer, 16-bit signed integer and 8-bit unsigned integer) by providing `normalize=False`, this function can return integer Tensor, where the samples are expressed within the whole range of the corresponding dtype, that is, `int32` tensor for `32-bit PCM`, `int16` for `16-bit PCM` and `uint8` for `8-bit PCM`. This behavior follows [scipy.io.wavfile.read](https://docs.scipy.org/doc/scipy/reference/generated/scipy.io.wavfile.read.html). `normalize` parameter has no effect for other formats and the load function always return normalized value with `float32` Tensor.

__* Note__ The current binary distribution of torchaudio does not contain `ogg/vorbis` and `opus` codecs. To handle these files, one needs to build torchaudio from the source with proper codecs in the system.

__Note 2__ Since this PR, `scipy` becomes required module for running test.

793eeab8

Replace sox_effects init/list/shutdown with TS binding (#748) · 0f0d0af3
moto authored Jun 25, 2020

0f0d0af3

24 Jun, 2020 2 commits

Cherry picking changes for release improvements (#746) · 0c847aaa

Eli Uriegas authored Jun 24, 2020



* packaging: Add test channels to pytorch dependency resolution
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

* .circleci: Add test channel to smoke tests
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

* .circleci: Put pytorch-test into a higher priority

pytorch-nightly was getting prioritized over pytorch-nightly which
shouldn't be the case
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

0c847aaa

bump nightly versions to 0.7.0 (#745) · fd6e3b4a
Eli Uriegas authored Jun 24, 2020
```
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>
```
fd6e3b4a

23 Jun, 2020 5 commits

Pin SciPy version to 1.4.1 in Windows unittest job (#743) · 37e194f4
moto authored Jun 23, 2020

37e194f4

Bake libsox in test base Docker image (#739) · 80bfb28b

moto authored Jun 23, 2020

In #728, linux unit test switches to libsox provided by apt.
For CPU jobs this is fine because all the job steps share the same Docker container,
but on CPU job, each job step runs a script in a new Docker container, so
libsox installed in a step is not available to the subsequent steps.

To fix this, this PR moves the installation of libsox and sox to Docker build.

80bfb28b

Refactor Cache bust mechanism and bust on daily basis (#742) · d9e6ce45

moto authored Jun 23, 2020

This PR refactors cache generation mechanism by introducing dedicated command
and bust cache on daily basis.

At this moment, Windows unittest job for 3.6 and 3.7 are broken because of
broken scipy but the environment is cached this persists until the next week.

As we have nightly build, we do not need to keep cache for one week.

d9e6ce45

Add subclass in model test classes (#727) · b8ddeb35

jimchen90 authored Jun 23, 2020



* add unittest in test_models

* update test method

* remove unittest main function
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

b8ddeb35

Fix SignalInfo member name to frame (#734) · e0f4c0ec

moto authored Jun 23, 2020

This PR fixes the wrong member name of SignalInfo introduced in #718. 

 - `num_samples` == `num_frames` * `num_channels`.

e0f4c0ec