Commits · 68f6a6a05a5a25fbb7d5d11fde5bf9248e6bc3de · OpenDAS / Torchaudio

23 Jul, 2020 2 commits
- Make GTZAN dataset sorted and use on-the-fly data in GTZAN test (#819) · 68f6a6a0
  moto authored Jul 23, 2020
  
  68f6a6a0
- Refactor datasets test (#817) · 3cdcd7ba
  moto authored Jul 23, 2020
  
  3cdcd7ba
22 Jul, 2020 7 commits

Replace sox_io save/load with sox effects chain in C++ (#779) · 0406d30d
moto authored Jul 22, 2020
```
* Replace save/load function with sox effects chain
```
0406d30d
Get rid of whitenoise_1min.mp3 (#813) · 0812f22a
moto authored Jul 22, 2020
```
Part of #764
```
0812f22a
[BC Breaking] Split `list_formats()` for read and write (#811) · f16f74af
moto authored Jul 22, 2020
```
* Separate sox list format function for read and write

* Guard MP3 smoke test
```
f16f74af
Remove unused files from CCI (#809) · d346cacb
moto authored Jul 22, 2020

d346cacb
Rename test_case_utils to case_utils (#808) · 00cc000e
moto authored Jul 22, 2020
```
buck gets confused with utility module name with `test_` prefix.
```
00cc000e

Refactor test_sox_effects (#805) · 05746042

moto authored Jul 22, 2020

1. Move misplaced sox compatibility test (T,Fade, T.Vol, T.Vad) to test/test_sox_compatibility.py
2. Move test_sox_effects to test/sox_effect/ where all the other functionalities from torchaudio.sox_effects are tested

05746042

Add smoke tests to sox_io and sox_effects (#806) · daa0007a

moto authored Jul 22, 2020

Currently all the tests in `sox_io_backend` and `sox_effects` (for new SoX effects implementation) requires additional `sox`, and this prevents running test in environment where `sox` command is not available even though `torchaudio` extension is available (such as fb internal). This PR adds smoke tests for these modules, which just runs functions to see if they do not crash.

daa0007a

21 Jul, 2020 2 commits

Remove if __name__ == __main__ from test code (#804) · 3781cb23
top0coder authored Jul 21, 2020
```
Co-authored-by: Jeff Zhang <jeffzhang@fb.com>
```
3781cb23

Add wavernn example pipeline (#749) · fac1bba9

jimchen90 authored Jul 21, 2020

* Add WaveRNN example

This is the pipeline example based on [WaveRNN model](https://github.com/pytorch/audio/pull/735) in torchaudio. The design of this pipeline is inspired by [#632](https://github.com/pytorch/audio/pull/632). It offers a standardized implementation of WaveRNN vocoder in torchaudio.

* Add utils and readme

The metric logger is added based on the Wav2letter pipeline [#632](https://github.com/pytorch/audio/pull/632). It offers the way to parse the standard output as described in readme.

* Add channel dimension

The channel dimension of waveform in datasets is added to match the input dimensions of WaveRNN model because the channel dimensions of waveform and spectrogram are added in [this part] (https://github.com/pytorch/audio/blob/master/torchaudio/models/_wavernn.py#L281) of WaveRNN model.

* Update date split and transform

The design of dataset structure is discussed in [this comment](https://github.com/pytorch/audio/pull/749#discussion_r454627027

). Now the dataset file has a clearer workflow after using the random-split function instead of walking through all the files. All transform functions are put together inside the transforms block.
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

fac1bba9

20 Jul, 2020 4 commits

Update documentation and fix docstrings (#788) · 2381dd89

moto authored Jul 20, 2020

- Addresses #549 #638 #786 
- Add `torchaudio` top level module doc
- Separate `torchaudio` top level module doc from `index.html`
- Add `backend` module doc.
- Remove `-> None` from function signature as it adds noise to documentation
- Changed function argument name of `torchaudio.backend.sox_io_backend.save` from `tensor` to `src`, so that it matches with the reset of backends.
- Tweak bunch of docstrings

2381dd89

Fix output type of upsampling (#801) · 937d52f8

jimchen90 authored Jul 20, 2020



Fix output type of upsampling
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

937d52f8

Update default form in docstring (#802) · e82cc350

jimchen90 authored Jul 20, 2020



* Update default form in docstring
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

e82cc350

Add LibriTTS dataset (#790) · 4b8aad7a

jimchen90 authored Jul 20, 2020



* Add libritts

Add LibriTTS dataset draft

* Add libritts

Use two separate ids for utterance_id.

* Update output form

Use full_id as utterance_id.

* Update format

Add space and test black format

* Update test method

* Add audio and text test

Generate audio and test files on-the-fly in test 

* Update format

* Fix test error and remove assets libritts

The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed.

* Add seed in `get_whitenoise` function

* Change utterance to text

Change `_utterance` to `_text`.
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

4b8aad7a

17 Jul, 2020 2 commits

Update variable names in wavernn model (#797) · 209858ea

jimchen90 authored Jul 17, 2020



* Change the name of  n_output and n_hidden

* Replace the mode by n_classes and sample_rate

* Change the definition of n_output and n_hidden
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

209858ea

Changed GTZAN so that it only traverses filenames belonging to the dataset (#791) · 47eb1e6a

Emmanouil Theofanis Chourdakis authored Jul 17, 2020

* Addressed review issues in PR #668

* Changed GTZAN so that it only traverses filenames belonging to the dataset

Now, instead of walking the whole directory and subdirectories of the dataset
GTZAN only looks for files under a `genre`/`genre`.`5 digit number`.wav format, where `genre` is an allowed GTZAN genre label.
This allows moving or removing files from the dataset (e.g. for fixing duplication or mislabeling issues).

47eb1e6a

16 Jul, 2020 4 commits

Generate YESNO dataset on-the-fly for test (#792) · 102174e9
moto authored Jul 16, 2020

102174e9

Get rid of whitenoise and sinewave files from test (#783) · 02b898ff

engineerchuan authored Jul 16, 2020



* Get rid of sine wave files and whitenoise files
* Refactor integer encoding
* Relax rtol from 1e-8 to 1e-7 for compliance kaldi
* relax waveform multi channel resample atol to 1e-7 from 1e-8
* relax tolerance for length consistency for speed effect
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

02b898ff

Add deprication warnings to SoxEffect and SoxEffectsChain (#787) · 8181a83b
moto authored Jul 16, 2020

8181a83b

Add Torchscript sox effects (#760) · 60a8e23d

moto authored Jul 15, 2020

* Add sox_utils module

* Make init/shutdown thread safe

* Add sox effects implementation

* Add test for sox effects

* Update docstrings and add examples

60a8e23d

14 Jul, 2020 5 commits

Do not use SoxEffectsChain in sox compatibility test (#781) · db8f2bf3

moto authored Jul 14, 2020

This PR replaces `torchaudio.sox_effects.SoxEffectsChain` in `test_sox_compatibility` with bare `sox` command.

The parity of `torchaudio.sox_effects.SoxEffectsChain` against `sox` command is not tested and it has known issues https://github.com/pytorch/audio/issues/771, therefore it is not appropriate to use this class for testing other functions.

db8f2bf3

Skip lowpass_speed on macOS (#782) · 131e48b6

moto authored Jul 14, 2020

`test/test_sox_effects.py::Test_SoxEffectsChain::test_lowpass_speed` has some issue on our macOS CI, even though there was no issue at #777 .

While we figure out the fix, we disable this test for macOS.

131e48b6

Stop using whitenoise.wav, mp3 and torchaudio.load in sox effect test · d11ad6bb

engineerchuan authored Jul 14, 2020

Part of #764

 - Replace `whitenoise.wav` with on-the-fly data generation
 - Replace `torchaudio.load` with `common_utils.load_wav`
 - Replace `steam-train-whistle-daniel_simon.mp3` with `.wav`

d11ad6bb

Remove frames_per_chunk argument from save (#780) · 4b3e9052

moto authored Jul 14, 2020

In #779, we plan to remove `frames_per_chunk` parameter from `save` function, but it will take some time before we can land #779, so we go ahead and remove the parameter first to reduce the conflict caused by interface change.

4b3e9052

Add macOS CPU unittest (#777) · f6dc2f67
moto authored Jul 14, 2020

f6dc2f67

13 Jul, 2020 1 commit
- Use default backend for TestCommonVoice (#775) · c9142fd5
  engineerchuan authored Jul 13, 2020
```
* Change 'sox' to 'default'
```
  c9142fd5
12 Jul, 2020 1 commit
- Convert CommonVoice test asset to wav, and remove unused test asset (#772) · 26941fa3
  engineerchuan authored Jul 11, 2020
```
* converted CommonVoice tartar mp3 to wav using rate 8000 Hz

* Remove Unused dtmf_30s_stereo.mp3
```
  26941fa3
08 Jul, 2020 3 commits

Add Waveforms for Testing Purposes section to test/README.md (#759) · c375490f

Artyom Astafurov authored Jul 08, 2020



* add Waveforms for Testing Purposes section

* Update test/README.md

use wrapper function for scipy.io.wavfile.read
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

* remove un-used files from the doc

* Update test/README.md

Rename variable
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

* fix indent; remove mentions of unused files

* remove whitenoise* files from README.md
Co-authored-by: moto <855818+mthrok@users.noreply.github.com>

c375490f

Get rid of typedefs/SignalInfo and replace AudioMetaData (#761) · 180ede8e
moto authored Jul 08, 2020

180ede8e

Add WaveRNN Model (#735) · 68cc72da

jimchen90 authored Jul 07, 2020



* upsamplenetwork

* update variable names

* update variable name

* add wavernn model

* update test

* update format

* update format

* update format

* fix conflicts and add transpose

* import update

* update transpose

* update format

* update docstring

* add n_channel in input

* add comment

* update docstring

* update docstring
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

68cc72da

06 Jul, 2020 2 commits
- Fix CI: Cannot uninstall a distutils installed project (#766) · ad7f43fe
  moto authored Jul 06, 2020
```
* Pin llvmlite

* Add comments
```
  ad7f43fe
- Replace torchaudio.load in test with scipy func (#762) · e43ee196
  moto authored Jul 06, 2020
  
  e43ee196
01 Jul, 2020 6 commits

Add sox_io_backend (#726) · 4b583eab
moto authored Jul 01, 2020

4b583eab
Add opus support (#755) · 894959a7
moto authored Jul 01, 2020

894959a7
Refactor test utilities (#756) · a20da5e3
moto authored Jul 01, 2020

a20da5e3

UpsampleNetwork (#724) · 6b159054

jimchen90 authored Jul 01, 2020



* upsamplenetwork

* update name

* update name and docstring

* update format

* rebase

* update docstring

* update docstring

* remove transpose and update docstring
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

6b159054

Add TorchScript-able "save" func to sox_io backend (#732) · 3324283c

moto authored Jul 01, 2020

This is a part of PRs to add new "sox_io" backend. #726 and depends on #718, #728 and #731.

This PR adds `save` function to "sox_io" backend, which can save Tensor to a file with the following audio formats;
 - `wav`
 - `mp3`
 - `flac`
 - `ogg/vorbis`

3324283c

Use cmake for third party (#753) · ea42513f

moto authored Jul 01, 2020

* Use cmake for third party

* Apply patch to libmad

* Update gitignore

* Update docker test image

ea42513f

30 Jun, 2020 1 commit
- add probot (#737) · d71661aa
  Artyom Astafurov authored Jun 30, 2020
  
  d71661aa