- 15 Sep, 2022 1 commit
-
-
moto authored
Summary: Preparation for the adoptation of `autosummary`. Replace `:footcite:` with `:cite:` and introduce dedicated reference page, as `:footcite:` does not work well with `autosummary`. Example: https://output.circle-artifacts.com/output/job/4da47ba6-d9c7-418e-b5b0-e9f8a146a6c3/artifacts/0/docs/datasets.html#cmuarctic https://output.circle-artifacts.com/output/job/4da47ba6-d9c7-418e-b5b0-e9f8a146a6c3/artifacts/0/docs/references.html Pull Request resolved: https://github.com/pytorch/audio/pull/2676 Reviewed By: carolineechen Differential Revision: D39509431 Pulled By: mthrok fbshipit-source-id: e6003dd01ec3eff3d598054690f61de8ee31ac9a
-
- 03 Aug, 2022 1 commit
-
-
bshall authored
Summary: I took a stab at implementing the ITU-R BS.1770-4 loudness recommendation (closes https://github.com/pytorch/audio/issues/1205). To give some more details: - I've implemented K-weighting following csteinmetz1 instead of BrechtDeMan since it fit well with torchaudio's already implemented filters (`treble_biquad` and `highpass_biquad`). - I've added four audio files to test compliance with the recommendation. These are linked in [this pdf](https://www.itu.int/dms_pub/itu-r/opb/rep/R-REP-BS.2217-2-2016-PDF-E.pdf). There are many more test files there but I didn't want to bog down the assets directory with too many files. Let me know if I should add or remove anything. - I've kept many of the constant internal to the function (e.g. the block duration, overlap, and the absolute threshold gamma). I'm not sure if these should be exposed in the signature. - I've implemented support for up to 5 channels (following both csteinmetz1 and BrechtDeMan). The recommendation includes weights for up to 24 channels. Is there any convention for how many channels to support? I hope this is helpful! looking forward to hearing from you. Pull Request resolved: https://github.com/pytorch/audio/pull/2472 Reviewed By: hwangjeff Differential Revision: D38389155 Pulled By: carolineechen fbshipit-source-id: fcc86d864c04ab2bedaa9acd941ebc4478ca6904
-
- 26 Feb, 2022 1 commit
-
-
Zhaoheng Ni authored
Summary: This PR adds ``apply_beamforming`` method to ``torchaudio.functional``. The method employs the beamforming weight to the multi-channel noisy spectrum to obtain the single-channel enhanced spectrum. The input arguments are the complex-valued beamforming weight Tensor and the multi-channel noisy spectrum. Pull Request resolved: https://github.com/pytorch/audio/pull/2232 Reviewed By: mthrok Differential Revision: D34474561 Pulled By: nateanl fbshipit-source-id: 2910251a8f111e65375dfb50495b6a415113f06d
-
- 25 Feb, 2022 5 commits
-
-
Zhaoheng Ni authored
Summary: This PR adds ``rtf_power`` method to ``torchaudio.functional``. The method computes the relative transfer function (RTF) or the steering vector by [the power iteration method](https://onlinelibrary.wiley.com/doi/abs/10.1002/zamm.19290090206). [This paper](https://arxiv.org/pdf/2011.15003.pdf) describes the power iteration method in English. The input arguments are the complex-valued power spectral density (PSD) matrix of the target speech, PSD matrix of noise, int or one-hot Tensor to indicate the reference channel, number of iterations, respectively. Pull Request resolved: https://github.com/pytorch/audio/pull/2231 Reviewed By: mthrok Differential Revision: D34474503 Pulled By: nateanl fbshipit-source-id: 47011427ec4373f808755f0e8eff1efca57655eb
-
Zhaoheng Ni authored
Summary: This PR adds `rtf_evd` method to `torchaudio.functional`. The method computes the relative transfer function (RTF) or the steering vector by eigenvalue decomposition. The input argument is the power spectral density (PSD) matrix of the target speech. Pull Request resolved: https://github.com/pytorch/audio/pull/2230 Reviewed By: mthrok Differential Revision: D34474188 Pulled By: nateanl fbshipit-source-id: 888df4b187608ed3c2b7271b34d2231cdabb0134
-
Zhaoheng Ni authored
Summary: This PR adds ``mvdr_weights_rtf`` method to ``torchaudio.functional``. It computes the MVDR weight matrix based on the solution that applies relative transfer function (RTF). See [the paper](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.725.673&rep=rep1&type=pdf) for the reference. The input arguments are the complex-valued RTF Tensor of the target speech, power spectral density (PSD) matrix of noise, int or one-hot Tensor to indicate the reference channel, respectively. Pull Request resolved: https://github.com/pytorch/audio/pull/2229 Reviewed By: mthrok Differential Revision: D34474119 Pulled By: nateanl fbshipit-source-id: 2d6f62cd0858f29ed6e4e03c23dcc11c816204e2
-
Zhaoheng Ni authored
Summary: This PR adds ``mvdr_weights_souden`` method to ``torchaudio.functional``. It computes the MVDR weight matrix based on the solution proposed by [``Souden et, al.``](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.725.673&rep=rep1&type=pdf). The input arguments are the complex-valued power spectral density (PSD) matrix of the target speech, PSD matrix of noise, int or one-hot Tensor to indicate the reference channel, respectively. Pull Request resolved: https://github.com/pytorch/audio/pull/2228 Reviewed By: mthrok Differential Revision: D34474018 Pulled By: nateanl fbshipit-source-id: 725df812f8f6e6cc81cc37e8c3cb0da2ab3b74fb
-
Zhaoheng Ni authored
Summary: This PR adds ``psd`` method to ``torchaudio.functional``. It computes the power spectral density (PSD) matrix of the complex-valued spectrum. The method also supports normalization of Time-Frequency mask. Pull Request resolved: https://github.com/pytorch/audio/pull/2227 Reviewed By: mthrok Differential Revision: D34473908 Pulled By: nateanl fbshipit-source-id: c1cfc584085d77881b35d41d76d39b26fca1dda9
-
- 10 Nov, 2021 1 commit
-
-
Krishna Kalyan authored
-
- 04 Nov, 2021 1 commit
-
-
moto authored
-
- 28 Oct, 2021 1 commit
-
-
S Harish authored
-
- 27 Oct, 2021 1 commit
-
-
S Harish authored
-
- 26 Oct, 2021 1 commit
-
-
S Harish authored
-
- 01 Sep, 2021 1 commit
-
-
yangarbiter authored
-
- 20 Aug, 2021 2 commits
-
-
Caroline Chen authored
-
hwangjeff authored
* Add basic filtfilt implementation * Add filtfilt to functional package; add tests Co-authored-by:V G <vladislav.goncharenko@phystech.edu>
-
- 19 Aug, 2021 1 commit
-
-
Caroline Chen authored
-
- 14 Aug, 2021 1 commit
-
-
nateanl authored
-
- 02 Aug, 2021 1 commit
-
-
Joel Frank authored
- Renamed torchaudio.functional.create_fb_matrix to torchaudio.functional.melscale_fbanks. - Added interface with a warning for create_fb_matrix
-
- 29 Jul, 2021 1 commit
-
-
Joel Frank authored
Summary: - Add linear_fbank method - Add LFCC in transforms
-
- 16 Jul, 2021 1 commit
-
-
nateanl authored
-
- 03 Jun, 2021 1 commit
-
-
moto authored
* Use `bibtex` for paper citations. * add `override.css` for fixing back reference. * wav2vec2 * wav2letter * convtasnet * deepspeech * rnnt-loss * griffinlim * Fix broken references in `filtering`. * Fix note in soundfile backends. * Tweak wav2vec2 example. * Removes unused `pytorch_theme.css`
-
- 02 Jun, 2021 1 commit
-
-
Caroline Chen authored
-
- 22 Mar, 2021 1 commit
-
-
Caroline Chen authored
This PR additionally adds batching to kaldi compliance resample interface.
-
- 01 Mar, 2021 1 commit
-
-
moto authored
-
- 26 Feb, 2021 1 commit
-
-
Vincent QB authored
-
- 12 Feb, 2021 1 commit
-
-
moto authored
-
- 04 Dec, 2020 1 commit
-
-
Krishna Kalyan authored
* Add griffinlim and DB_to_amplitude * Fix Dataset docstring * Fix other formatting Co-authored-by:krishnakalyan3 <skalyan@cloudera.com>
-
- 06 Nov, 2020 1 commit
-
-
moto authored
-
- 30 Jul, 2020 1 commit
-
-
Jeremy Chen authored
* `istft` has been migrated to `pytorch`, and `torchaudio.functional.istft` has been deprecated in 0.6.0 release. This PR removes it Co-authored-by:Jeremy Chen <jeremyyy@fb.com>
-
- 03 Jun, 2020 1 commit
-
-
jimchen90 authored
* Add bass with biquad * Update functional.py Add the normalization coefficients * Update test_sox_compatibility.py In test_sox_compatibility.py file, I add two bass tests: one test sets gain = 30, atol = 1e-4, the other sets gain = 40, atol = 1.5e-4. The details can be seen in pytorch#676 * Update torchscript_consistency_impl.py Add torchscript test * Add flake8 test Co-authored-by:Ji Chen <jimchen90@devfair0160.h2.fair>
-
- 02 Jun, 2020 1 commit
-
-
Bhargav Kathivarapu authored
* Add flanger to functional Signed-off-by:
Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com> * Add random seed Signed-off-by:
Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com> * fix flanger Signed-off-by:
Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com> * shape * Change bool arguments to strings Signed-off-by:
Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com> * Refactor tests Signed-off-by:
Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com> Co-authored-by:
Vincent QB <vincentqb@users.noreply.github.com>
-
- 01 May, 2020 1 commit
-
-
Bhargav Kathivarapu authored
Signed-off-by:Bhargav Kathivarapu <bhargavkathivarapu31@gmail.com>
-
- 28 Apr, 2020 1 commit
-
-
Artyom Astafurov authored
* initial test, stub function, transform and docstring * add draft working implementation, update docstrings * merge VadSate into Vad calss, move Channel into Vad class * remove functional stub for vad * add wav file for test * refactor _measure() to improve performance * rename argument * replace copy_ with assignment * refactor init, update documentation, update test for readability * clean up default values * move code from transforms.py to funtional.py and integrate state into a function * remove Channel state class * fix calcuation of a flush point * make multiple channels work * clean up multi-channel, update test * rename variables and re-org arguments for _measure * fix linting errors * add torchscript consistency test and fix errors * support and test batch consistency, fix normalization * update documentation, switch torchscript consistancy test to use transform to improve coverage * fix linting errors * remove un-used imports * address PR comments * add doc references into rst
-
- 27 Apr, 2020 1 commit
-
-
Vincent QB authored
* formatting. * update datasets.
-
- 22 Apr, 2020 1 commit
-
-
Bhargav Kathivarapu authored
* Add overdrive to functional * Minor change to overdrive * Minor change to overdrive * minor flake8 changes * changes to make overdrive generic Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
- 20 Apr, 2020 1 commit
-
-
Bhargav Kathivarapu authored
* Add dcshift to functional * Doc string change and remove inplace clamp * Minor Fix to dcshit and separate sox test refactoring * Minor change to limiter_gain type * adding dcshift to __all__ in functional
-
- 17 Apr, 2020 1 commit
-
-
wanglong001 authored
* add cmvn * Update transforms.rst add cmvn * Correct the format * Correct the format * Correct the format * add test unit and cmvn change to cmn * fix bug Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
- 16 Apr, 2020 1 commit
-
-
Bhargav Kathivarapu authored
* Add contrast to functional * add tests for contrast and update functional.rst * Minor changes to sox and batch tests for contrast
-
- 25 Feb, 2020 1 commit
-
-
moto authored
-