Commits · 9efc3503742fa942865c2e5662147209087d2ef4 · OpenDAS / Torchaudio

10 Mar, 2020 1 commit

Tomás Osório authored Mar 10, 2020



* add basics for Fade

* add fade possibilities: at start, end or both

* add different types of fade

* add docstrings, add overriding possibility

* remove unnecessary logic

* correct typing

* agnostic to batch size or n_channels

* add batch test to Fade

* add transform to options

* add test_script_module

* add coherency with test batch

* remove extra step for waveform_length

* update docstring

* add test to compare fade with sox

* change name of fade_shape

* update test fade vs sox with new nomenclature for fade_shape

* add Documentation
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

9efc3503

06 Mar, 2020 1 commit
- Change default value of dither (#453) · e108fe2a
  Vincent QB authored Mar 06, 2020
```
* change default value of dither.

* update doc.
```
  e108fe2a
05 Mar, 2020 3 commits
- Improve Docstrings in transfroms (#442) · 4936c9eb
  Vincent QB authored Mar 05, 2020
```
* get typing on Docstrings right

* Improve Documentation standardise
```
  4936c9eb
- phase_advance should be a buffer so it moves device correctly (#457) · f1a5503e
  Vincent QB authored Mar 05, 2020
```
* phase_advance should be a buffer so it moves device correctly

* flake8
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>
```
  f1a5503e
- Migrate TimeStretch and AmplitudeToDB to torch.nn.Module (#456) · db1e7da9
  Vincent QB authored Mar 05, 2020
```
* AmplitudeToDB to torch.nn.Module

* TimeStretch use torch.nn.Module
```
  db1e7da9
28 Feb, 2020 1 commit

Add test for InverseMelScale (#448) · babc24af

moto authored Feb 28, 2020



* Inverse Mel Scale Implementation

* Inverse Mel Scale Docs

* Better working version.

* GPU fix

* These shouldn't go on git..

* Even better one, but does not support JITability.

* Remove JITability test

* Flake8

* n_stft is a must

* minor clean up of initialization

* Add librosa consistency test

This PR follows up #366 and adds test for `InverseMelScale` (and `MelScale`) for librosa compatibility.

For `MelScale` compatibility test;
1. Generate spectrogram
2. Feed the spectrogram to `torchaudio.transforms.MelScale` instance
3. Feed the spectrogram to `librosa.feature.melspectrogram` function.
4. Compare the result from 2 and 3 elementwise.
Element-wise numerical comparison is possible because under the hood their implementations use the same algorith.

For `InverseMelScale` compatibility test, it is more elaborated than that.
1. Generate the original spectrogram
2. Convert the original spectrogram to Mel scale using `torchaudio.transforms.MelScale` instance
3. Reconstruct spectrogram using torchaudio implementation
3.1. Feed the Mel spectrogram to `torchaudio.transforms.InverseMelScale` instance and get reconstructed spectrogram.
3.2. Compute the sum of element-wise P1 distance of the original spectrogram and that from 3.1.
4. Reconstruct spectrogram using librosa
4.1. Feed the Mel spectrogram to `librosa.feature.inverse.mel_to_stft` function and get reconstructed spectrogram.
4.2. Compute the sum of element-wise P1 distance of the original spectrogram and that from 4.1. (this is the reference.)
5. Check that resulting P1 distance are in a roughly same value range.

Element-wise numerical comparison is not possible due to the difference algorithms used to compute the inverse. The reconstructed spectrograms can have some values vary in magnitude.
Therefore the strategy here is to check that P1 distance (reconstruction loss) is not that different from the value obtained using `librosa`. For this purpose, threshold was empirically chosen

```
print('p1 dist (orig <-> ta):', torch.dist(spec_orig, spec_ta, p=1))
print('p1 dist (orig <-> lr):', torch.dist(spec_orig, spec_lr, p=1))
>>> p1 dist (orig <-> ta): tensor(1482.1917)
>>> p1 dist (orig <-> lr): tensor(1420.7103)
```

This value can vary based on the length and the kind of the signal being processed, so it was handpicked.

* Address review feedbacks

* Support arbitrary batch dimensions.

* Add batch test

* Use view for batch

* fix sgd

* Use negative indices and update docstring

* Update threshold
Co-authored-by: Charles J.Y. Yoon <jaeyeun97@gmail.com>

babc24af

25 Feb, 2020 1 commit
- Add allpass filter to functional (#444) · 2cf59c41
  moto authored Feb 25, 2020
  
  2cf59c41
24 Feb, 2020 1 commit
- remove custom gcd command since python2.7 is deprecated. testing resample. (#441) · 3549c57b
  Vincent QB authored Feb 24, 2020
  
  3549c57b
22 Feb, 2020 1 commit

Adding Speech Command Dataset (#437) · 4d58bc46

Tomás Osório authored Feb 22, 2020



* add speechcommand dataset and test

* prepend the full path to each result

* add missing param on docstring in walk_files

* add file to run tests on SpeechCommand Dataset

* reduce logic

* update test on SpeechCommands

* correct the indentation on docstring walk_files

* flake8 compliance

* change tuple type returned. move path split logic in load item.

* typo in name.

* redundant file path.

* filter background noise.
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

4d58bc46

20 Feb, 2020 1 commit
- LJ Speech dataset (#439) · 32bae85c
  Taras Sereda authored Feb 20, 2020
```
* LJ Speech dataset

* refactoring

as per @vincentqb's suggestions
```
  32bae85c
14 Feb, 2020 1 commit
- batch resample transform (#435) · 445e14d1
  Vincent QB authored Feb 14, 2020
  
  445e14d1
12 Feb, 2020 1 commit
- adding dev-other. (#433) · ffeee199
  Vincent QB authored Feb 12, 2020
  
  ffeee199
29 Jan, 2020 2 commits

dither jit test (#417) · ac5dd79f

Vincent QB authored Jan 29, 2020

* workaround for bartlett_window https://github.com/pytorch/pytorch/issues/32358#issuecomment-576909755

* only change dtype.

ac5dd79f

.circleci: Bump python3.7 -> python3.8 (#397) · add9495e
Eli Uriegas authored Jan 29, 2020
```
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>
```
add9495e

22 Jan, 2020 2 commits
- jit/cuda test for complex norm. (#421) · f4f71436
  Vincent QB authored Jan 22, 2020
  
  f4f71436
- Revert "conditionally skip unsupported subTest tests for Python 2 (#386)" (#420) · 5894928d
  Vincent QB authored Jan 22, 2020
```
This reverts commit cdf5c83d.
```
  5894928d
17 Jan, 2020 1 commit
- replace reshape by view. (#409) · 60fd113c
  Vincent QB authored Jan 17, 2020
  
  60fd113c
16 Jan, 2020 3 commits

ci: Remove 2.7 tests (#413) · b32606d6

Eli Uriegas authored Jan 16, 2020



Python 2.7 was EOL on January 1, 2020

The last torchaudio release to support Python 2.7 was 0.4.0
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

b32606d6

move master to 0.5.0 (#414) · b0f180bc
Vincent QB authored Jan 16, 2020

b0f180bc

packaging: Install correct version of pytorch for pip (#412) · 2e4c2a1f

Eli Uriegas authored Jan 15, 2020



CUDA_SUFFIX was still being used here when it should've been swapped out
for PYTORCH_VERSION_SUFFIX, which is what's being used for conda below.
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>
(cherry picked from commit 009b115d074ac5fcca2cc34662fe814df63324c1)
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

2e4c2a1f

13 Jan, 2020 3 commits
- extend batch support (#391) · c4565245
  Vincent QB authored Jan 13, 2020
```
* extend batch support

closes #383

* function for batch test.

* set seed.

* adjust tolerance for griffinlim.
```
  c4565245
- Upgrading to UserWarning so that the user gets the warning. (#402) · 45498f26
  Vincent QB authored Jan 13, 2020
  
  45498f26
- [Bug Fix] fix power of spectrogram. makes power a float (#392) · 79b33187
  Vincent QB authored Jan 13, 2020
```
* fix power of spectrogram. makes power a float.

closes #389

* commenting out failing test.

* change skip test logic for librosa.
closes #373
```
  79b33187
09 Jan, 2020 3 commits

Move jitability test (#395) · 343d0220
Vincent QB authored Jan 09, 2020
```
* move test for scriptmodule.

* avoiding code duplication.
```
343d0220

.circleci: Remove if block, wasn't doing anything (#399) · 7e07693f

Eli Uriegas authored Jan 09, 2020



With the introduction of the `filter_branch` parameter to the `workflows`
function we no longer have a need to have this if block anymore per
@ezyang's assessment.
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

7e07693f

Update requirements for Windows (#398) · 73243090
peterjc123 authored Jan 10, 2020

73243090

08 Jan, 2020 1 commit

Add Windows CI (#394) · be5b2d56

peterjc123 authored Jan 09, 2020

* [WIP] Add Windows CI

* Remove cu_version

* checkout_merge -> checkout

* Add build script

* Switch backend to soundfile

* Remove soundfile as dependency

* Rename jobs

* Fix lint

be5b2d56

02 Jan, 2020 4 commits
- use standard naming. (#393) · 719a39de
  Vincent QB authored Jan 02, 2020
  
  719a39de
- conditionally skip unsupported subTest tests for Python 2 (#386) · cdf5c83d
  Karl Ostmo authored Jan 02, 2020
```
closes #387
```
  cdf5c83d
- Fix random seed for flaky test_griffinlim test (#388) · 479e666b
  Karl Ostmo authored Jan 02, 2020
```
closes #382
```
  479e666b
- Apply 'nightly' branch filter to binary uploads (#385) · e0f261f3
  Karl Ostmo authored Jan 02, 2020
```
Remove suspect logic.
```
  e0f261f3
27 Dec, 2019 2 commits

Adopt native-Python code generation convention (#378) · 42ffaf62

Karl Ostmo authored Dec 27, 2019

Closes #304

See rationale writeup: https://github.com/pytorch/vision/pull/1321#issuecomment-531033978

42ffaf62

Fix several errors in tests run by Travis (#380) · 9801caf6

Karl Ostmo authored Dec 27, 2019

* Declare file encoding to support special characters

* fix missing utf_8_encoder error in Travis tests

* Py 2.7 backwards-compat iterator

* ensure integer argument to torch.nn.functional.pad

* cast match.ceil result as integer

9801caf6

26 Dec, 2019 3 commits

create tensor directly on device. (#377) · 805d7922
Vincent QB authored Dec 26, 2019

805d7922

JIT resample waveform (#362) · 9409824f

Oktai Tatanov authored Dec 26, 2019



* test with jit.

* test passed after adding annotation, and removing get_default_dtype

* fix conversion error.

* moving test to transform.

* reverting to original test.

* move type.

* math.gcd added in python 3.5.
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

9409824f

Griffin-Lim Transformation Implementation (#365) · 4a934693

Charles J.Y. Yoon authored Dec 27, 2019



* Griffin-Lim Transformation Implementation

* Griffin-Lim Docs

* Remove f-string from backwards compatibility

* iSTFT is now jit-able.

* Comment changes

* Functional Implementation & now jitable

* flake8

* Doc & GPU Fix

* Librosa comparison test

* test directly griffinlim's output. tighter atol.

* matching signature to docstring.
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

4a934693

23 Dec, 2019 1 commit
- Module GPU test fixes (#369) · 34f3c12e
  Charles J.Y. Yoon authored Dec 24, 2019
```
* Fixed GPU tests
```
  34f3c12e
20 Dec, 2019 1 commit

Improve lfilter functional (#374) · f3365ecf

David Pollack authored Dec 20, 2019



* Simplify lfilter functional

* use `torch.clamp` instead of `torch.min(..., torch.max(...))`
* remove unneeded creation of ones tensor for previous method

The current lfilter function uses min and max to essentially do a clamp
function.  I changed the code to use clamp instead.  It is more readable
than the previous version.

FYI, if you want to keep the previous way, you could make a
broadcastable tensor of size 1 instead of creating a tensor the size of
the input.
Signed-off-by: David Pollack <david@da3.net>

* Parallelize waveform windows calculation

I've parallelized the calculation of the waveform windows and also
removed the inefficient calculation within the for-loop.
Signed-off-by: David Pollack <david@da3.net>

* Refactoring and minor readability changes
Signed-off-by: David Pollack <david@da3.net>

* Remove one more creation of a temporary tensor
Signed-off-by: David Pollack <david@da3.net>

f3365ecf

19 Dec, 2019 1 commit

Backend switch (#355) · 774ebc78

Vincent QB authored Dec 19, 2019

* move sox inside function calls.

* add backend switch mechanism.

* import sox at runtime, not import.

* add backend list.

* backend tests.

* creating hidden modules for backend.

* naming backend same as file: soundfile.

* remove docstring in backend file.

* test soundfile info.

* soundfile doesn't support int64.

* adding test for wav file.

* error with incorrect parameter instead of silent ignore.

* adding test across backend. using float32 as done in sox.

* backend guard decorator.

774ebc78

18 Dec, 2019 1 commit
- Fix MelScale test and documentation (#370) · 4887ff41
  Charles J.Y. Yoon authored Dec 19, 2019
```
* Fix MelScale test and documentation

* revert change to tests
```
  4887ff41