1. 03 Aug, 2022 1 commit
    • bshall's avatar
      An implemenation of the ITU-R BS.1770-4 loudness recommendation (#2472) · 946b180a
      bshall authored
      Summary:
      I took a stab at implementing the ITU-R BS.1770-4 loudness recommendation (closes https://github.com/pytorch/audio/issues/1205). To give some more details:
      - I've implemented K-weighting following csteinmetz1 instead of BrechtDeMan since it fit well with torchaudio's already implemented filters (`treble_biquad` and `highpass_biquad`).
      - I've added four audio files to test compliance with the recommendation. These are linked in [this pdf](https://www.itu.int/dms_pub/itu-r/opb/rep/R-REP-BS.2217-2-2016-PDF-E.pdf). There are many more test files there but I didn't want to bog down the assets directory with too many files. Let me know if I should add or remove anything.
      - I've kept many of the constant internal to the function (e.g. the block duration, overlap, and the absolute threshold gamma). I'm not sure if these should be exposed in the signature.
      - I've implemented support for up to 5 channels (following both csteinmetz1 and BrechtDeMan). The recommendation includes weights for up to 24 channels. Is there any convention for how many channels to support?
      
      I hope this is helpful! looking forward to hearing from you.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2472
      
      Reviewed By: hwangjeff
      
      Differential Revision: D38389155
      
      Pulled By: carolineechen
      
      fbshipit-source-id: fcc86d864c04ab2bedaa9acd941ebc4478ca6904
      946b180a
  2. 26 Feb, 2022 1 commit
    • Zhaoheng Ni's avatar
      Add apply_beamforming to torchaudio.functional (#2232) · 9c56ffb4
      Zhaoheng Ni authored
      Summary:
      This PR adds ``apply_beamforming`` method to ``torchaudio.functional``.
      The method employs the beamforming weight to the multi-channel noisy spectrum to obtain the single-channel enhanced spectrum.
      The input arguments are the complex-valued beamforming weight Tensor and the multi-channel noisy spectrum.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2232
      
      Reviewed By: mthrok
      
      Differential Revision: D34474561
      
      Pulled By: nateanl
      
      fbshipit-source-id: 2910251a8f111e65375dfb50495b6a415113f06d
      9c56ffb4
  3. 25 Feb, 2022 5 commits
  4. 10 Nov, 2021 1 commit
  5. 04 Nov, 2021 1 commit
  6. 28 Oct, 2021 1 commit
  7. 27 Oct, 2021 1 commit
  8. 26 Oct, 2021 1 commit
  9. 01 Sep, 2021 1 commit
  10. 20 Aug, 2021 2 commits
  11. 19 Aug, 2021 1 commit
  12. 14 Aug, 2021 1 commit
  13. 02 Aug, 2021 1 commit
  14. 29 Jul, 2021 1 commit
  15. 16 Jul, 2021 1 commit
  16. 03 Jun, 2021 1 commit
    • moto's avatar
      Update docs (#1550) · 0166a851
      moto authored
      * Use `bibtex` for paper citations.
        * add `override.css` for fixing back reference.
        * wav2vec2
        * wav2letter
        * convtasnet
        * deepspeech
        * rnnt-loss
        * griffinlim
      * Fix broken references in `filtering`.
      * Fix note in soundfile backends.
      * Tweak wav2vec2 example.
      * Removes unused `pytorch_theme.css`
      0166a851
  17. 02 Jun, 2021 1 commit
  18. 22 Mar, 2021 1 commit
  19. 01 Mar, 2021 1 commit
  20. 26 Feb, 2021 1 commit
  21. 12 Feb, 2021 1 commit
  22. 04 Dec, 2020 1 commit
  23. 06 Nov, 2020 1 commit
  24. 30 Jul, 2020 1 commit
  25. 03 Jun, 2020 1 commit
    • jimchen90's avatar
      Add Bass with Biquad (#661) · a466b3c2
      jimchen90 authored
      
      
      * Add bass with biquad
      
      * Update functional.py
      
      Add the normalization coefficients
      
      * Update test_sox_compatibility.py
      
      In test_sox_compatibility.py file, I add two bass tests: one test sets gain = 30, atol = 1e-4, the other sets gain = 40, atol = 1.5e-4. The details can be seen in pytorch#676
      
      * Update torchscript_consistency_impl.py
      
      Add torchscript test
      
      * Add flake8 test
      Co-authored-by: default avatarJi Chen <jimchen90@devfair0160.h2.fair>
      a466b3c2
  26. 02 Jun, 2020 1 commit
  27. 01 May, 2020 1 commit
  28. 28 Apr, 2020 1 commit
    • Artyom Astafurov's avatar
      Port sox::vad (#578) · 3ecc7016
      Artyom Astafurov authored
      * initial test, stub function, transform and docstring
      
      * add draft working implementation, update docstrings
      
      * merge VadSate into Vad calss, move Channel into Vad class
      
      * remove functional stub for vad
      
      * add wav file for test
      
      * refactor _measure() to improve performance
      
      * rename argument
      
      * replace copy_ with assignment
      
      * refactor init, update documentation, update test for readability
      
      * clean up default values
      
      * move code from transforms.py to funtional.py and integrate state into a function
      
      * remove Channel state class
      
      * fix calcuation of a flush point
      
      * make multiple channels work
      
      * clean up multi-channel, update test
      
      * rename variables and re-org arguments for _measure
      
      * fix linting errors
      
      * add torchscript consistency test and fix errors
      
      * support and test batch consistency, fix normalization
      
      * update documentation, switch torchscript consistancy test to use transform to improve coverage
      
      * fix linting errors
      
      * remove un-used imports
      
      * address PR comments
      
      * add doc references into rst
      3ecc7016
  29. 27 Apr, 2020 1 commit
  30. 22 Apr, 2020 1 commit
  31. 20 Apr, 2020 1 commit
    • Bhargav Kathivarapu's avatar
      Add dcshift to functional (#558) · 91e59231
      Bhargav Kathivarapu authored
      * Add dcshift to functional
      
      * Doc string change and remove inplace clamp
      
      * Minor Fix to dcshit and separate sox test refactoring
      
      * Minor change to limiter_gain type
      
      * adding dcshift to __all__ in functional
      91e59231
  32. 17 Apr, 2020 1 commit
  33. 16 Apr, 2020 1 commit
  34. 25 Feb, 2020 1 commit
  35. 21 Nov, 2019 1 commit