Commits · 11fb22aae80a60fd867ce87ff8795256b0f733cd · OpenDAS / Torchaudio

24 Mar, 2020 1 commit

Add Vol Transformation (#468) · 11fb22aa

Tomás Osório authored Mar 24, 2020

* Add Vol with gain_type amplitude

* add gain in db and add tests

* add gain_type "power" and tests

* add functional DB_to_amplitude

* simplify

* remove functional

* improve docstring

* add to documentation

11fb22aa

10 Mar, 2020 1 commit

Add fade (#449) · 9efc3503

Tomás Osório authored Mar 10, 2020



* add basics for Fade

* add fade possibilities: at start, end or both

* add different types of fade

* add docstrings, add overriding possibility

* remove unnecessary logic

* correct typing

* agnostic to batch size or n_channels

* add batch test to Fade

* add transform to options

* add test_script_module

* add coherency with test batch

* remove extra step for waveform_length

* update docstring

* add test to compare fade with sox

* change name of fade_shape

* update test fade vs sox with new nomenclature for fade_shape

* add Documentation
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

9efc3503

28 Feb, 2020 1 commit

Add test for InverseMelScale (#448) · babc24af

moto authored Feb 28, 2020



* Inverse Mel Scale Implementation

* Inverse Mel Scale Docs

* Better working version.

* GPU fix

* These shouldn't go on git..

* Even better one, but does not support JITability.

* Remove JITability test

* Flake8

* n_stft is a must

* minor clean up of initialization

* Add librosa consistency test

This PR follows up #366 and adds test for `InverseMelScale` (and `MelScale`) for librosa compatibility.

For `MelScale` compatibility test;
1. Generate spectrogram
2. Feed the spectrogram to `torchaudio.transforms.MelScale` instance
3. Feed the spectrogram to `librosa.feature.melspectrogram` function.
4. Compare the result from 2 and 3 elementwise.
Element-wise numerical comparison is possible because under the hood their implementations use the same algorith.

For `InverseMelScale` compatibility test, it is more elaborated than that.
1. Generate the original spectrogram
2. Convert the original spectrogram to Mel scale using `torchaudio.transforms.MelScale` instance
3. Reconstruct spectrogram using torchaudio implementation
3.1. Feed the Mel spectrogram to `torchaudio.transforms.InverseMelScale` instance and get reconstructed spectrogram.
3.2. Compute the sum of element-wise P1 distance of the original spectrogram and that from 3.1.
4. Reconstruct spectrogram using librosa
4.1. Feed the Mel spectrogram to `librosa.feature.inverse.mel_to_stft` function and get reconstructed spectrogram.
4.2. Compute the sum of element-wise P1 distance of the original spectrogram and that from 4.1. (this is the reference.)
5. Check that resulting P1 distance are in a roughly same value range.

Element-wise numerical comparison is not possible due to the difference algorithms used to compute the inverse. The reconstructed spectrograms can have some values vary in magnitude.
Therefore the strategy here is to check that P1 distance (reconstruction loss) is not that different from the value obtained using `librosa`. For this purpose, threshold was empirically chosen

```
print('p1 dist (orig <-> ta):', torch.dist(spec_orig, spec_ta, p=1))
print('p1 dist (orig <-> lr):', torch.dist(spec_orig, spec_lr, p=1))
>>> p1 dist (orig <-> ta): tensor(1482.1917)
>>> p1 dist (orig <-> lr): tensor(1420.7103)
```

This value can vary based on the length and the kind of the signal being processed, so it was handpicked.

* Address review feedbacks

* Support arbitrary batch dimensions.

* Add batch test

* Use view for batch

* fix sgd

* Use negative indices and update docstring

* Update threshold
Co-authored-by: Charles J.Y. Yoon <jaeyeun97@gmail.com>

babc24af

25 Feb, 2020 1 commit
- Add allpass filter to functional (#444) · 2cf59c41
  moto authored Feb 25, 2020
  
  2cf59c41
26 Dec, 2019 1 commit

Griffin-Lim Transformation Implementation (#365) · 4a934693

Charles J.Y. Yoon authored Dec 27, 2019



* Griffin-Lim Transformation Implementation

* Griffin-Lim Docs

* Remove f-string from backwards compatibility

* iSTFT is now jit-able.

* Comment changes

* Functional Implementation & now jitable

* flake8

* Doc & GPU Fix

* Librosa comparison test

* test directly griffinlim's output. tighter atol.

* matching signature to docstring.
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

4a934693

21 Nov, 2019 2 commits

Remove _docs.py (#349) · c74e580f

Vincent QB authored Nov 21, 2019

* since we no longer use decoration, this fixes #165.

* remove import of _docs.

c74e580f

Move augmentations in transforms (#348) · 99ed0521

Vincent QB authored Nov 21, 2019

* sync docs with functionals.

* Adding transforms to documentations. Moving augmentations in transforms.

99ed0521

18 Sep, 2019 1 commit

Make lfilter, and related filters, available (#275) · 8273c3f4

engineerchuan authored Sep 18, 2019

* Add basic low pass filtering
* Add highpass filtering
* More tests of IIR vs FIR
* Implement convolve function, add tests
* Move lfilter and convolve into functional, more tests
* added additional documentation for convolve and lfilter, renamed functional_filtering to functional_sox_convenience
* Follow naming convention for sample rate in functional
* fix failing vctk manifest test to account for adding more test audios into assets
* Adding documentation for lfilter, biquad, highpass_biquad, lowpass_biquad
* added matrix based implementation of lfilter
* adding python lfilter implementation
* factor out biquad, lowpass, highpass to sox compatibility

8273c3f4

16 Aug, 2019 1 commit
- Kaldi MFCC (#228) · a450cf81
  jamarshon authored Aug 16, 2019
  
  a450cf81
01 Aug, 2019 1 commit
- Removal of torchaudio.legacy · d8a47f4a
  jamarshon authored Aug 01, 2019
  
  d8a47f4a
29 Jul, 2019 1 commit
- Large re-amp on the torchaudio/docs (#166) · 95235f31
  jamarshon authored Jul 29, 2019
  
  95235f31
16 Jul, 2019 1 commit
- torch.functional Docs (#140) · 0902494e
  jamarshon authored Jul 16, 2019
  
  0902494e
11 Jul, 2019 1 commit
- Add Kaldi docs (#136) · 48707255
  jamarshon authored Jul 11, 2019
  
  48707255
22 May, 2019 2 commits
- Use common_utils to check for correct import in torchaudio/kaldi_io.py (#114) · 883f2428
  jamarshon authored May 22, 2019
  
  883f2428
- Add Kaldi IO as a dependency + put a wrapper to convert to Tensor + add test... · a422f3fe
  jamarshon authored May 22, 2019
```
Add Kaldi IO as a dependency + put  a wrapper to convert to Tensor + add test to check correct type (#111)
```
  a422f3fe
25 Dec, 2018 2 commits
- docs update and fixes from pr comments · 0e0d1e59
  David Pollack authored Dec 24, 2018
  
  0e0d1e59
- sox effects and documentation · 301e2e98
  David Pollack authored Sep 11, 2018
  
  301e2e98
18 Dec, 2017 1 commit
- improve README and add sphinx docs generator · 088d5674
  Soumith Chintala authored Dec 17, 2017
  
  088d5674