Commits · 8c1db721135e61d4a3dfbc4e2bbe05cd50cfded1 · OpenDAS / Torchaudio

"vscode:/vscode.git/clone" did not exist on "538a809f162aa7c2c0bf6e205463c718cd8be216"

25 Feb, 2022 4 commits

Add rtf_evd method to torchaudio.functional (#2230) · 86fe4fa7

Zhaoheng Ni authored Feb 25, 2022

Summary:
This PR adds `rtf_evd` method to `torchaudio.functional`.
The method computes the relative transfer function (RTF) or the steering vector by eigenvalue decomposition.
The input argument is the power spectral density (PSD) matrix of the target speech.

Pull Request resolved: https://github.com/pytorch/audio/pull/2230

Reviewed By: mthrok

Differential Revision: D34474188

Pulled By: nateanl

fbshipit-source-id: 888df4b187608ed3c2b7271b34d2231cdabb0134

86fe4fa7

Add mvdr_weights_rtf to torchaudio.functional (#2229) · 3566ffc5

Zhaoheng Ni authored Feb 25, 2022

Summary:
This PR adds ``mvdr_weights_rtf`` method to ``torchaudio.functional``.
It computes the MVDR weight matrix based on the solution that applies relative transfer function (RTF). See [the paper](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.725.673&rep=rep1&type=pdf) for the reference.
The input arguments are the complex-valued RTF Tensor of the target speech, power spectral density (PSD) matrix of noise, int or one-hot Tensor to indicate the reference channel, respectively.

Pull Request resolved: https://github.com/pytorch/audio/pull/2229

Reviewed By: mthrok

Differential Revision: D34474119

Pulled By: nateanl

fbshipit-source-id: 2d6f62cd0858f29ed6e4e03c23dcc11c816204e2

3566ffc5

Add mvdr_weights_souden to torchaudio.functional (#2228) · 5d06a369

Zhaoheng Ni authored Feb 25, 2022

Summary:
This PR adds ``mvdr_weights_souden`` method to ``torchaudio.functional``.
It computes the MVDR weight matrix based on the solution proposed by [``Souden et, al.``](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.725.673&rep=rep1&type=pdf).
The input arguments are the complex-valued power spectral density (PSD) matrix of the target speech, PSD matrix of noise, int or one-hot Tensor to indicate the reference channel, respectively.

Pull Request resolved: https://github.com/pytorch/audio/pull/2228

Reviewed By: mthrok

Differential Revision: D34474018

Pulled By: nateanl

fbshipit-source-id: 725df812f8f6e6cc81cc37e8c3cb0da2ab3b74fb

5d06a369

Add psd method to torchaudio.functional (#2227) · 07bd1aa3

Zhaoheng Ni authored Feb 25, 2022

Summary:
This PR adds ``psd`` method to ``torchaudio.functional``.
It computes the power spectral density (PSD) matrix of the complex-valued spectrum.
The method also supports normalization of Time-Frequency mask.

Pull Request resolved: https://github.com/pytorch/audio/pull/2227

Reviewed By: mthrok

Differential Revision: D34473908

Pulled By: nateanl

fbshipit-source-id: c1cfc584085d77881b35d41d76d39b26fca1dda9

07bd1aa3

17 Feb, 2022 1 commit

Refactor batch consistency test in functional (#2245) · 9cf59e75

Zhaoheng Ni authored Feb 17, 2022

Summary:
In batch_consistency tests, the `assert_batch_consistency` method only accepts single Tensor, which is not applicable to some methods. For example, `lfilter` and `filtfilt` requires three Tensors as the arguments, hence they don't follow `assert_batch_consistency` in the tests.
This PR refactors the test to accept a tuple of Tensors which have `batch` dimension. For the other arguments like `int` or `str`, they are given as `*args` after the tuple.

Pull Request resolved: https://github.com/pytorch/audio/pull/2245

Reviewed By: mthrok

Differential Revision: D34273035

Pulled By: nateanl

fbshipit-source-id: 0096b4f062fb4e983818e5374bed6efc7b15b056

9cf59e75

16 Feb, 2022 2 commits

Refactor torchscript consistency test in functional (#2246) · 87d79889

Zhaoheng Ni authored Feb 16, 2022

Summary:
In torchscript_consistency tests, the `func` in each test method only accepts one `tensor` as the argument, for the other arguments of `F.xyz` method, they need to be defined inside the `func`. If there is no `Tensor` argument in `F.xzy`, the tests use a `dummy` tensor which is not used anywhere. In this PR, we refactor ``_assert_consistency`` and ``_assert_consistency_complex`` to accept a tuple of inputs instead of just one `tensor`.

Pull Request resolved: https://github.com/pytorch/audio/pull/2246

Reviewed By: carolineechen

Differential Revision: D34273057

Pulled By: nateanl

fbshipit-source-id: a3900edb3b2c58638e513e1490279d771ebc3d0b

87d79889

Add complex dtype support in functional autograd test (#2244) · eeba91dc

Zhaoheng Ni authored Feb 16, 2022

Summary:
In autograd tests, to guarantee the precision, the dtype of Tensors are converted to `torch.float64` if they are real. However, the complex dtype is not considered. This PR adds `self.complex_dtype` support to the inputs.

Pull Request resolved: https://github.com/pytorch/audio/pull/2244

Reviewed By: mthrok

Differential Revision: D34272998

Pulled By: nateanl

fbshipit-source-id: e8698a74d7b8d99ee0fcb5f5cb5f2ffc8c80b9b5

eeba91dc

09 Feb, 2022 1 commit

Fix librosa calls (#2208) · e5d567c9

hwangjeff authored Feb 08, 2022

Summary:
Yesterday's release of librosa 0.9.0 made args keyword-only and changed default padding from "reflect" to "zero" for some functions. This PR adjusts callsites in our tutorials and tests accordingly.

Pull Request resolved: https://github.com/pytorch/audio/pull/2208

Reviewed By: mthrok

Differential Revision: D34099793

Pulled By: hwangjeff

fbshipit-source-id: 4e2642cdda8aae6d0a928befaf1bbb3873d229bc

e5d567c9

29 Dec, 2021 1 commit

Add parameter p to TimeMasking (#2090) · 1ec7ff73

hwangjeff authored Dec 29, 2021

Summary:
Adds parameter `p` to `TimeMasking` to allow for enforcing an upper bound on the proportion of time steps that it can mask. This behavior is consistent with the specifications provided in the SpecAugment paper (https://arxiv.org/abs/1904.08779).

Pull Request resolved: https://github.com/pytorch/audio/pull/2090

Reviewed By: carolineechen

Differential Revision: D33344772

Pulled By: hwangjeff

fbshipit-source-id: 6ff65f5304e489fa1c23e15c3d96b9946229fdcf

1ec7ff73

23 Dec, 2021 1 commit

Apply arc lint to pytorch audio (#2096) · 5859923a

Joao Gomes authored Dec 23, 2021

Summary:
Pull Request resolved: https://github.com/pytorch/audio/pull/2096

run: `arc lint --apply-patches --paths-cmd 'hg files -I "./**/*.py"'`

Reviewed By: mthrok

Differential Revision: D33297351

fbshipit-source-id: 7bf5956edf0717c5ca90219f72414ff4eeaf5aa8

5859923a

04 Nov, 2021 1 commit
- Doc fixes (#1982) · c670898c
  Caroline Chen authored Nov 04, 2021
  
  c670898c
03 Nov, 2021 2 commits

[BC-Breaking] Drop pseudo complex support from phase_vocoder / TimeStretch (#1957) · d3e146fd
moto authored Nov 03, 2021
```
Following the plan #1337, this commit drops the support for pseudo complex type from `F.phase_vocoder` and `T.TimeStretch`.
```
d3e146fd

[BC-Breaking] Drop pseudo complex support from spectrogram (#1958) · 5ec6ada6

moto authored Nov 03, 2021

Following the plan #1337, this commit drops the support for pseudo complex type from 
`F.spectrogram` and `T.Spectrogram`.

It also deprecates the use of `return_complex` argument.

5ec6ada6

28 Oct, 2021 1 commit
- Remove F.complex_norm and T.ComplexNorm (#1942) · ab50909d
  S Harish authored Oct 28, 2021
  
  ab50909d
13 Oct, 2021 1 commit
- [BC-Breaking] Ensure integer input frequencies for resample (#1857) · 25a8adf6
  Caroline Chen authored Oct 13, 2021
  
  25a8adf6
02 Sep, 2021 1 commit

Put output tensor on proper device in `get_whitenoise()` (#1744) · feede97e

jayleverett authored Sep 02, 2021

* put output tensor on device in `get_whitenoise()`

* Update `get_spectrogram()` so that window uses same device as waveform

* put window on proper device in `test_griffinlim()`

feede97e

27 Aug, 2021 1 commit

Refactor scripting in test (#1727) · 595b37b6

moto authored Aug 27, 2021

Introduce a helper function `torch_script` that performs scripting in the recommended way.

595b37b6

20 Aug, 2021 1 commit

Add basic filtfilt implementation (#1681) · 496b381a

hwangjeff authored Aug 20, 2021



* Add basic filtfilt implementation

* Add filtfilt to functional package; add tests
Co-authored-by: V G <vladislav.goncharenko@phystech.edu>

496b381a

19 Aug, 2021 1 commit
- Move RNNT Loss out of prototype (#1711) · 2c115821
  Caroline Chen authored Aug 19, 2021
  
  2c115821
11 Aug, 2021 1 commit

Add InverseSpectrogram to transforms and functional (#1652) · 6e0af713

nateanl authored Aug 11, 2021



- Provide InverseSpectrogram module that corresponds to Spectrogram module
- Add length parameter to the forward method in transforms
Co-authored-by: dgenzel <dgenzel@fb.com>
Co-authored-by: Zhaoheng Ni <zni@fb.com>

6e0af713

10 Aug, 2021 1 commit
- Add batch support to lfilter (#1638) · 8094751f
  Chin-Yun Yu authored Aug 11, 2021
  
  8094751f
02 Aug, 2021 1 commit

Add melscale_fbanks and deprecate create_fb_matrix (#1653) · 83dc5ec7

Joel Frank authored Aug 02, 2021

- Renamed torchaudio.functional.create_fb_matrix to torchaudio.functional.melscale_fbanks.
- Added interface with a warning for create_fb_matrix

83dc5ec7

29 Jul, 2021 1 commit
- Add LFCC feature to transforms (#1611) · 86370639
  Joel Frank authored Jul 29, 2021
```
Summary:
- Add linear_fbank method
- Add LFCC in transforms
```
  86370639
21 Jul, 2021 1 commit
- Add filterbanks support to lfilter (#1587) · aa0dd03b
  Chin-Yun Yu authored Jul 22, 2021
  
  aa0dd03b
16 Jul, 2021 1 commit
- Add PitchShift to functional and transform (#1629) · f5dbb002
  nateanl authored Jul 16, 2021
  
  f5dbb002
25 Jun, 2021 1 commit
- Add edit_distance · 6bfd83b4
  yangarbiter authored Jun 25, 2021
  
  6bfd83b4
04 Jun, 2021 2 commits

[BC-Breaking] Default to native complex type when returning raw spect… (#1549) · 5432a3f5

moto authored Jun 04, 2021

* [BC-Breaking] Default to native complex type when returning raw spectrogram

Part of https://github.com/pytorch/audio/issues/1337 .

- This code changes the return type of spectrogram to be native complex dtype,
when (and only when) returning raw (complex-valued) spectrogram.
- Change `return_complex=False` to `return_complex=True` in spectrogram ops.
- `return_complex` is only effective when `power` is `None`. It is ignored for
cases where `power` is not `None`. Because the returned Tensor is power spectrogram,
which is real-valued Tensors.

5432a3f5

Migrate resample tests from kaldi to functional (#1520) · 15a7f78c
Caroline Chen authored Jun 03, 2021

15a7f78c

01 Jun, 2021 1 commit
- Ensure resampling identity is unchanged (#1537) · fad19fab
  Caroline Chen authored Jun 01, 2021
  
  fad19fab
22 May, 2021 1 commit

fbsync (#1524) · ae9560da

parmeet authored May 22, 2021

* Remove `class FunctionalComplex` header accidentally re-introduced in #1490

ae9560da

20 May, 2021 1 commit
- Add F.resample torchscript test (#1516) · 7763ed87
  Caroline Chen authored May 20, 2021
  
  7763ed87
11 May, 2021 1 commit
- Add warning for non-integer resampling frequencies (#1490) · 4b2de71f
  Caroline Chen authored May 11, 2021
  
  4b2de71f
06 May, 2021 2 commits
- Support higher order derivatives for `F.lfilter` (#1441) · 723e9a52
  Chin-Yun Yu authored May 07, 2021
  
  723e9a52
- Merge test classes for complex (#1491) · 7d45851d
  moto authored May 06, 2021
  
  7d45851d
03 May, 2021 1 commit

Ensure axis masking operations are not in-place (#1481) · 7fd5fce4

Caroline Chen authored May 03, 2021

It was reported in #1478 that spectrogram masking operations were done in-place and modified the original input tensors. This PR fixes this behavior and adds tests to ensure that the input tensor is not changed.

7fd5fce4

26 Apr, 2021 1 commit
- Run functional tests on GPU as well as CPU (#1475) · b5d80279
  Mark Saroufim authored Apr 26, 2021
  
  b5d80279
19 Apr, 2021 1 commit
- Refactor functional test (#1463) · b059f087
  dhthompson authored Apr 19, 2021
```
- Put functional test logic into one place, `functional_impl.py`
- Tidy imports
```
  b059f087
15 Apr, 2021 1 commit
- Fixed floor_divide deprecation warnings seen in pytest output (#1455) · 48630302
  Prabhat Roy authored Apr 15, 2021
```
* Fixed floor_divide deprecation warnings seen in pytest output

* Fixed warning in test_flanger_triangle_linear
```
  48630302
14 Apr, 2021 1 commit
- Save/load TorchScript object in test (#1446) · 5c696b50
  moto authored Apr 14, 2021
  
  5c696b50
13 Apr, 2021 1 commit

Remove VAD from batch consistency tests (#1451) · 749c0e39

Jcaw authored Apr 13, 2021

The VAD function trims the input tensor to the first instance of voice
activity on any channel or item. Trimming batches this way may be
undesirable as the item with earliest activity will dominate. Either
way, the batch behaviour does not match the itemwise behaviour.

The VAD batch consistency tests currently pass out of coincidence, but
they specify incorrect behaviour. This commit removes them.

749c0e39