Commits · 457148ea932d9a1c622af73654e8d1200a256179 · OpenDAS / Torchaudio

26 Feb, 2021 1 commit
- Fixes #1314 (#1316) · 457148ea
  Vincent QB authored Feb 26, 2021
  
  457148ea
24 Feb, 2021 1 commit
- Remove legacy backends (#1311) · 33dc817c
  Prabhat Roy authored Feb 24, 2021
  
  33dc817c
17 Feb, 2021 1 commit
- Make the version a link to /versions.html (#1273) · fa71c5e2
  Matti Picus authored Feb 17, 2021
  
  fa71c5e2
12 Feb, 2021 1 commit
- Add compute_kaldi_pitch to doc (#1260) · 4f9b5520
  moto authored Feb 12, 2021
  
  4f9b5520
11 Feb, 2021 1 commit
- DOC Fix sphinx warnings and turn warnings into errors (#1247) · a7e93c15
  Nicolas Hug authored Feb 11, 2021
  
  a7e93c15
11 Jan, 2021 1 commit
- add doc for rnnt loss (#1171) · b57f05c4
  Vincent QB authored Jan 11, 2021
  
  b57f05c4
04 Dec, 2020 1 commit

[Doc] Add missing modules and minor fixes (#1022) · 2a02d7f5

Krishna Kalyan authored Dec 04, 2020



* Add griffinlim and DB_to_amplitude
* Fix Dataset docstring
* Fix other formatting
Co-authored-by: krishnakalyan3 <skalyan@cloudera.com>

2a02d7f5

06 Nov, 2020 1 commit
- [Doc] Group filtering in functinal.rst (#1005) · 4b4b8bf6
  moto authored Nov 06, 2020
  
  4b4b8bf6
05 Nov, 2020 1 commit
- [Doc] Remove sox_effects util from top level module (#1001) · 076052f1
  moto authored Nov 05, 2020
  
  076052f1
27 Oct, 2020 2 commits

Remove legacy sox effects (#977) · 0076ab07
moto authored Oct 27, 2020

0076ab07

Switch the default backend to the ones with new interfaces (#978) · fa2e4fd4

moto authored Oct 27, 2020

Refer to #903 for the overview of planned I/O changes.

* Change the default backend from `"sox"(deprecated)` to `"sox_io"`
* Change the default interface of `"soundfile"` backend to the one identical to `"sox_io"` backend.
* Deprecate torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE
* Update documentations
    * Re-order backends (default first)
    * Update overhaul timeline (removed 0.7.0)
    * Simplify `"soundfile"` backend description

fa2e4fd4

19 Oct, 2020 1 commit

Update index.rst (#968) · ba1698ba

Brian Johnson authored Oct 19, 2020

Adds introductory context and links to the PyTorch Libraries to audio docs.

ba1698ba

09 Oct, 2020 1 commit
- [doc] Update backend docstring/documentation (#935) · e17c2634
  moto authored Oct 09, 2020
  
  e17c2634
02 Oct, 2020 1 commit
- Update docstrings/documentations of all the datasets (#931) · e3d1d746
  moto authored Oct 02, 2020
  
  e3d1d746
01 Oct, 2020 1 commit
- Update model documentation (#933) · 1df9e201
  moto authored Oct 01, 2020
  
  1df9e201
15 Sep, 2020 1 commit
- Add tedlium dataset (#882) · 914a846d
  Jaime Ferrando Huertas authored Sep 15, 2020
  
  914a846d
20 Aug, 2020 1 commit

Update VCTK_092 interface and add tests (#875) · 2205cc9e

JianwuXu authored Aug 20, 2020

* Tweak docstring, audio_ext, load method signature and constructor of VCTK_092

* Add test for VCTK_092 dataset.

2205cc9e

19 Aug, 2020 1 commit

Add VCTK_092 dataset (#812) · 4bfebd85

Abhishek Dubey authored Aug 19, 2020



* Added version 0.92 of VCTK dataset
Signed-off-by: Abhishek Dubey <abhi.dubey011999@gmail.com>

4bfebd85

30 Jul, 2020 1 commit

Remove istft (#841) · dab7f64b

Jeremy Chen authored Jul 30, 2020



* `istft` has been migrated to `pytorch`, and `torchaudio.functional.istft` has been deprecated in 0.6.0 release. This PR removes it
Co-authored-by: Jeremy Chen <jeremyyy@fb.com>

dab7f64b

29 Jul, 2020 1 commit
- Add model name in docs (#836) · de1cb83d
  jimchen90 authored Jul 29, 2020
```
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>
```
  de1cb83d
20 Jul, 2020 2 commits

Update documentation and fix docstrings (#788) · 2381dd89

moto authored Jul 20, 2020

- Addresses #549 #638 #786 
- Add `torchaudio` top level module doc
- Separate `torchaudio` top level module doc from `index.html`
- Add `backend` module doc.
- Remove `-> None` from function signature as it adds noise to documentation
- Changed function argument name of `torchaudio.backend.sox_io_backend.save` from `tensor` to `src`, so that it matches with the reset of backends.
- Tweak bunch of docstrings

2381dd89

Add LibriTTS dataset (#790) · 4b8aad7a

jimchen90 authored Jul 20, 2020



* Add libritts

Add LibriTTS dataset draft

* Add libritts

Use two separate ids for utterance_id.

* Update output form

Use full_id as utterance_id.

* Update format

Add space and test black format

* Update test method

* Add audio and text test

Generate audio and test files on-the-fly in test 

* Update format

* Fix test error and remove assets libritts

The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed.

* Add seed in `get_whitenoise` function

* Change utterance to text

Change `_utterance` to `_text`.
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

4b8aad7a

16 Jul, 2020 1 commit

Add Torchscript sox effects (#760) · 60a8e23d

moto authored Jul 15, 2020

* Add sox_utils module

* Make init/shutdown thread safe

* Add sox effects implementation

* Add test for sox effects

* Update docstrings and add examples

60a8e23d

10 Jun, 2020 1 commit

Add cmu_arctic dataset (#710) · 55b5c80c

jimchen90 authored Jun 10, 2020



* Add cmu_arctic dataset

* add dataset name

* update audio test file with whitenoise.wav file

* add test text file

* update text method and file name

* update comment

* change datasets order in doc

* add line length
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

55b5c80c

03 Jun, 2020 1 commit

Add Bass with Biquad (#661) · a466b3c2

jimchen90 authored Jun 03, 2020



* Add bass with biquad

* Update functional.py

Add the normalization coefficients

* Update test_sox_compatibility.py

In test_sox_compatibility.py file, I add two bass tests: one test sets gain = 30, atol = 1e-4, the other sets gain = 40, atol = 1.5e-4. The details can be seen in pytorch#676

* Update torchscript_consistency_impl.py

Add torchscript test

* Add flake8 test
Co-authored-by: Ji Chen <jimchen90@devfair0160.h2.fair>

a466b3c2

02 Jun, 2020 2 commits

Added the popular GTZAN dataset: (#668) · b0367251

Emmanouil Theofanis Chourdakis authored Jun 03, 2020



* Added the popular GTZAN dataset:

* Added the GTZAN class in torchaudio.datasets using the same format as the rest of the datasets.
* Added the appropriate test function in test_datasets.py.
* Added the GTZAN class in the datasets.rst documentation file.

* Addressed review issues in PR #668

* Added dummy noise .wav in `test/assets/`
* Removed transforms of input and output from the dataset
  `__init__` function, as well as the corresponding methods.
* Replaced rendundant `filtered` and `subset` methods from
  class initialization and also changed the corresponding
  assertion message.

* Fixed E303: too many blank lines error

* Added GTZAN to __init__.__all__

* Fixed incorrectly not importing GTZAN

* removed duplicate warning

* lint
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

b0367251

Add flanger to functional.py (#651) · 9e27cf3d

Bhargav Kathivarapu authored Jun 02, 2020



* Add flanger to functional
Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>

* Add random seed
Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>

* fix flanger
Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>

* shape

* Change bool arguments to strings
Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>

* Refactor tests
Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

9e27cf3d

01 May, 2020 1 commit
- Add phaser to functional (#587) · 8e813596
  Bhargav Kathivarapu authored May 02, 2020
```
Signed-off-by: Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>
```
  8e813596
28 Apr, 2020 2 commits

Add model Wav2Letter (#462) · d678357f

Tomás Osório authored Apr 28, 2020

* add wav2letter model

* add unit_test to model

* add docstrings

* add documentation

* fix minor error, change logic on forward

* update padding same with ceil

* add inline typing and minor fixes to docstrings

* remove python2

* add formula do docstrings, change param name

* add test with mfcc, add pytest

* fix bug, update docstrings

* change parameter name

d678357f

Port sox::vad (#578) · 3ecc7016

Artyom Astafurov authored Apr 28, 2020

* initial test, stub function, transform and docstring

* add draft working implementation, update docstrings

* merge VadSate into Vad calss, move Channel into Vad class

* remove functional stub for vad

* add wav file for test

* refactor _measure() to improve performance

* rename argument

* replace copy_ with assignment

* refactor init, update documentation, update test for readability

* clean up default values

* move code from transforms.py to funtional.py and integrate state into a function

* remove Channel state class

* fix calcuation of a flush point

* make multiple channels work

* clean up multi-channel, update test

* rename variables and re-org arguments for _measure

* fix linting errors

* add torchscript consistency test and fix errors

* support and test batch consistency, fix normalization

* update documentation, switch torchscript consistancy test to use transform to improve coverage

* fix linting errors

* remove un-used imports

* address PR comments

* add doc references into rst

3ecc7016

27 Apr, 2020 1 commit
- Update documentation (#568) · 3012050d
  Vincent QB authored Apr 27, 2020
```
* formatting.

* update datasets.
```
  3012050d
22 Apr, 2020 1 commit

Add overdrive to functional (#569) · e7cb18c1

Bhargav Kathivarapu authored Apr 23, 2020



* Add overdrive to functional

* Minor change to overdrive

* Minor change to overdrive

* minor flake8 changes

* changes to make overdrive generic
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

e7cb18c1

20 Apr, 2020 1 commit

Add dcshift to functional (#558) · 91e59231

Bhargav Kathivarapu authored Apr 20, 2020

* Add dcshift to functional

* Doc string change and remove inplace clamp

* Minor Fix to dcshit and separate sox test refactoring

* Minor change to limiter_gain type

* adding dcshift to __all__ in functional

91e59231

17 Apr, 2020 1 commit

add cmvn (#540) · b42d6100

wanglong001 authored Apr 17, 2020



* add cmvn

* Update transforms.rst

add cmvn

* Correct the format

* Correct the format

* Correct the format

* add test unit and cmvn change to cmn

* fix bug
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

b42d6100

16 Apr, 2020 1 commit

Add contrast to functional (#551) · 8a742e0f

Bhargav Kathivarapu authored Apr 17, 2020

* Add contrast to functional

* add tests for contrast and update functional.rst

* Minor changes to sox and batch tests for contrast

8a742e0f

24 Mar, 2020 1 commit

Add Vol Transformation (#468) · 11fb22aa

Tomás Osório authored Mar 24, 2020

* Add Vol with gain_type amplitude

* add gain in db and add tests

* add gain_type "power" and tests

* add functional DB_to_amplitude

* simplify

* remove functional

* improve docstring

* add to documentation

11fb22aa

10 Mar, 2020 1 commit

Add fade (#449) · 9efc3503

Tomás Osório authored Mar 10, 2020



* add basics for Fade

* add fade possibilities: at start, end or both

* add different types of fade

* add docstrings, add overriding possibility

* remove unnecessary logic

* correct typing

* agnostic to batch size or n_channels

* add batch test to Fade

* add transform to options

* add test_script_module

* add coherency with test batch

* remove extra step for waveform_length

* update docstring

* add test to compare fade with sox

* change name of fade_shape

* update test fade vs sox with new nomenclature for fade_shape

* add Documentation
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

9efc3503

28 Feb, 2020 1 commit

Add test for InverseMelScale (#448) · babc24af

moto authored Feb 28, 2020



* Inverse Mel Scale Implementation

* Inverse Mel Scale Docs

* Better working version.

* GPU fix

* These shouldn't go on git..

* Even better one, but does not support JITability.

* Remove JITability test

* Flake8

* n_stft is a must

* minor clean up of initialization

* Add librosa consistency test

This PR follows up #366 and adds test for `InverseMelScale` (and `MelScale`) for librosa compatibility.

For `MelScale` compatibility test;
1. Generate spectrogram
2. Feed the spectrogram to `torchaudio.transforms.MelScale` instance
3. Feed the spectrogram to `librosa.feature.melspectrogram` function.
4. Compare the result from 2 and 3 elementwise.
Element-wise numerical comparison is possible because under the hood their implementations use the same algorith.

For `InverseMelScale` compatibility test, it is more elaborated than that.
1. Generate the original spectrogram
2. Convert the original spectrogram to Mel scale using `torchaudio.transforms.MelScale` instance
3. Reconstruct spectrogram using torchaudio implementation
3.1. Feed the Mel spectrogram to `torchaudio.transforms.InverseMelScale` instance and get reconstructed spectrogram.
3.2. Compute the sum of element-wise P1 distance of the original spectrogram and that from 3.1.
4. Reconstruct spectrogram using librosa
4.1. Feed the Mel spectrogram to `librosa.feature.inverse.mel_to_stft` function and get reconstructed spectrogram.
4.2. Compute the sum of element-wise P1 distance of the original spectrogram and that from 4.1. (this is the reference.)
5. Check that resulting P1 distance are in a roughly same value range.

Element-wise numerical comparison is not possible due to the difference algorithms used to compute the inverse. The reconstructed spectrograms can have some values vary in magnitude.
Therefore the strategy here is to check that P1 distance (reconstruction loss) is not that different from the value obtained using `librosa`. For this purpose, threshold was empirically chosen

```
print('p1 dist (orig <-> ta):', torch.dist(spec_orig, spec_ta, p=1))
print('p1 dist (orig <-> lr):', torch.dist(spec_orig, spec_lr, p=1))
>>> p1 dist (orig <-> ta): tensor(1482.1917)
>>> p1 dist (orig <-> lr): tensor(1420.7103)
```

This value can vary based on the length and the kind of the signal being processed, so it was handpicked.

* Address review feedbacks

* Support arbitrary batch dimensions.

* Add batch test

* Use view for batch

* fix sgd

* Use negative indices and update docstring

* Update threshold
Co-authored-by: Charles J.Y. Yoon <jaeyeun97@gmail.com>

babc24af

25 Feb, 2020 1 commit
- Add allpass filter to functional (#444) · 2cf59c41
  moto authored Feb 25, 2020
  
  2cf59c41
26 Dec, 2019 1 commit

Griffin-Lim Transformation Implementation (#365) · 4a934693

Charles J.Y. Yoon authored Dec 27, 2019



* Griffin-Lim Transformation Implementation

* Griffin-Lim Docs

* Remove f-string from backwards compatibility

* iSTFT is now jit-able.

* Comment changes

* Functional Implementation & now jitable

* flake8

* Doc & GPU Fix

* Librosa comparison test

* test directly griffinlim's output. tighter atol.

* matching signature to docstring.
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

4a934693