1. 26 Feb, 2021 1 commit
  2. 24 Feb, 2021 1 commit
  3. 17 Feb, 2021 1 commit
  4. 12 Feb, 2021 1 commit
  5. 11 Feb, 2021 1 commit
  6. 11 Jan, 2021 1 commit
  7. 04 Dec, 2020 1 commit
  8. 06 Nov, 2020 1 commit
  9. 05 Nov, 2020 1 commit
  10. 27 Oct, 2020 2 commits
    • moto's avatar
      Remove legacy sox effects (#977) · 0076ab07
      moto authored
      0076ab07
    • moto's avatar
      Switch the default backend to the ones with new interfaces (#978) · fa2e4fd4
      moto authored
      Refer to #903 for the overview of planned I/O changes.
      
      * Change the default backend from `"sox"(deprecated)` to `"sox_io"`
      * Change the default interface of `"soundfile"` backend to the one identical to `"sox_io"` backend.
      * Deprecate torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE
      * Update documentations
          * Re-order backends (default first)
          * Update overhaul timeline (removed 0.7.0)
          * Simplify `"soundfile"` backend description
      fa2e4fd4
  11. 19 Oct, 2020 1 commit
  12. 09 Oct, 2020 1 commit
  13. 02 Oct, 2020 1 commit
  14. 01 Oct, 2020 1 commit
  15. 15 Sep, 2020 1 commit
  16. 20 Aug, 2020 1 commit
  17. 19 Aug, 2020 1 commit
  18. 30 Jul, 2020 1 commit
  19. 29 Jul, 2020 1 commit
  20. 20 Jul, 2020 2 commits
    • moto's avatar
      Update documentation and fix docstrings (#788) · 2381dd89
      moto authored
      - Addresses #549 #638 #786 
      - Add `torchaudio` top level module doc
      - Separate `torchaudio` top level module doc from `index.html`
      - Add `backend` module doc.
      - Remove `-> None` from function signature as it adds noise to documentation
      - Changed function argument name of `torchaudio.backend.sox_io_backend.save` from `tensor` to `src`, so that it matches with the reset of backends.
      - Tweak bunch of docstrings
      2381dd89
    • jimchen90's avatar
      Add LibriTTS dataset (#790) · 4b8aad7a
      jimchen90 authored
      
      
      * Add libritts
      
      Add LibriTTS dataset draft
      
      * Add libritts
      
      Use two separate ids for utterance_id.
      
      * Update output form
      
      Use full_id as utterance_id.
      
      * Update format
      
      Add space and test black format
      
      * Update test method
      
      * Add audio and text test
      
      Generate audio and test files on-the-fly in test 
      
      * Update format
      
      * Fix test error and remove assets libritts
      
      The test error is fixed by sorting the file in 4th element instead of 2nd element in samples. Since the files are generated on-the-fly, so the the libritts files in assets are removed.
      
      * Add seed in `get_whitenoise` function
      
      * Change utterance to text
      
      Change `_utterance` to `_text`.
      Co-authored-by: default avatarJi Chen <jimchen90@devfair0160.h2.fair>
      4b8aad7a
  21. 16 Jul, 2020 1 commit
    • moto's avatar
      Add Torchscript sox effects (#760) · 60a8e23d
      moto authored
      * Add sox_utils module
      
      * Make init/shutdown thread safe
      
      * Add sox effects implementation
      
      * Add test for sox effects
      
      * Update docstrings and add examples
      60a8e23d
  22. 10 Jun, 2020 1 commit
  23. 03 Jun, 2020 1 commit
    • jimchen90's avatar
      Add Bass with Biquad (#661) · a466b3c2
      jimchen90 authored
      
      
      * Add bass with biquad
      
      * Update functional.py
      
      Add the normalization coefficients
      
      * Update test_sox_compatibility.py
      
      In test_sox_compatibility.py file, I add two bass tests: one test sets gain = 30, atol = 1e-4, the other sets gain = 40, atol = 1.5e-4. The details can be seen in pytorch#676
      
      * Update torchscript_consistency_impl.py
      
      Add torchscript test
      
      * Add flake8 test
      Co-authored-by: default avatarJi Chen <jimchen90@devfair0160.h2.fair>
      a466b3c2
  24. 02 Jun, 2020 2 commits
  25. 01 May, 2020 1 commit
  26. 28 Apr, 2020 2 commits
    • Tomás Osório's avatar
      Add model Wav2Letter (#462) · d678357f
      Tomás Osório authored
      * add wav2letter model
      
      * add unit_test to model
      
      * add docstrings
      
      * add documentation
      
      * fix minor error, change logic on forward
      
      * update padding same with ceil
      
      * add inline typing and minor fixes to docstrings
      
      * remove python2
      
      * add formula do docstrings, change param name
      
      * add test with mfcc, add pytest
      
      * fix bug, update docstrings
      
      * change parameter name
      d678357f
    • Artyom Astafurov's avatar
      Port sox::vad (#578) · 3ecc7016
      Artyom Astafurov authored
      * initial test, stub function, transform and docstring
      
      * add draft working implementation, update docstrings
      
      * merge VadSate into Vad calss, move Channel into Vad class
      
      * remove functional stub for vad
      
      * add wav file for test
      
      * refactor _measure() to improve performance
      
      * rename argument
      
      * replace copy_ with assignment
      
      * refactor init, update documentation, update test for readability
      
      * clean up default values
      
      * move code from transforms.py to funtional.py and integrate state into a function
      
      * remove Channel state class
      
      * fix calcuation of a flush point
      
      * make multiple channels work
      
      * clean up multi-channel, update test
      
      * rename variables and re-org arguments for _measure
      
      * fix linting errors
      
      * add torchscript consistency test and fix errors
      
      * support and test batch consistency, fix normalization
      
      * update documentation, switch torchscript consistancy test to use transform to improve coverage
      
      * fix linting errors
      
      * remove un-used imports
      
      * address PR comments
      
      * add doc references into rst
      3ecc7016
  27. 27 Apr, 2020 1 commit
  28. 22 Apr, 2020 1 commit
  29. 20 Apr, 2020 1 commit
    • Bhargav Kathivarapu's avatar
      Add dcshift to functional (#558) · 91e59231
      Bhargav Kathivarapu authored
      * Add dcshift to functional
      
      * Doc string change and remove inplace clamp
      
      * Minor Fix to dcshit and separate sox test refactoring
      
      * Minor change to limiter_gain type
      
      * adding dcshift to __all__ in functional
      91e59231
  30. 17 Apr, 2020 1 commit
  31. 16 Apr, 2020 1 commit
  32. 24 Mar, 2020 1 commit
    • Tomás Osório's avatar
      Add Vol Transformation (#468) · 11fb22aa
      Tomás Osório authored
      * Add Vol with gain_type amplitude
      
      * add gain in db and add tests
      
      * add gain_type "power" and tests
      
      * add functional DB_to_amplitude
      
      * simplify
      
      * remove functional
      
      * improve docstring
      
      * add to documentation
      11fb22aa
  33. 10 Mar, 2020 1 commit
    • Tomás Osório's avatar
      Add fade (#449) · 9efc3503
      Tomás Osório authored
      
      
      * add basics for Fade
      
      * add fade possibilities: at start, end or both
      
      * add different types of fade
      
      * add docstrings, add overriding possibility
      
      * remove unnecessary logic
      
      * correct typing
      
      * agnostic to batch size or n_channels
      
      * add batch test to Fade
      
      * add transform to options
      
      * add test_script_module
      
      * add coherency with test batch
      
      * remove extra step for waveform_length
      
      * update docstring
      
      * add test to compare fade with sox
      
      * change name of fade_shape
      
      * update test fade vs sox with new nomenclature for fade_shape
      
      * add Documentation
      Co-authored-by: default avatarVincent QB <vincentqb@users.noreply.github.com>
      9efc3503
  34. 28 Feb, 2020 1 commit
    • moto's avatar
      Add test for InverseMelScale (#448) · babc24af
      moto authored
      
      
      * Inverse Mel Scale Implementation
      
      * Inverse Mel Scale Docs
      
      * Better working version.
      
      * GPU fix
      
      * These shouldn't go on git..
      
      * Even better one, but does not support JITability.
      
      * Remove JITability test
      
      * Flake8
      
      * n_stft is a must
      
      * minor clean up of initialization
      
      * Add librosa consistency test
      
      This PR follows up #366 and adds test for `InverseMelScale` (and `MelScale`) for librosa compatibility.
      
      For `MelScale` compatibility test;
      1. Generate spectrogram
      2. Feed the spectrogram to `torchaudio.transforms.MelScale` instance
      3. Feed the spectrogram to `librosa.feature.melspectrogram` function.
      4. Compare the result from 2 and 3 elementwise.
      Element-wise numerical comparison is possible because under the hood their implementations use the same algorith.
      
      For `InverseMelScale` compatibility test, it is more elaborated than that.
      1. Generate the original spectrogram
      2. Convert the original spectrogram to Mel scale using `torchaudio.transforms.MelScale` instance
      3. Reconstruct spectrogram using torchaudio implementation
      3.1. Feed the Mel spectrogram to `torchaudio.transforms.InverseMelScale` instance and get reconstructed spectrogram.
      3.2. Compute the sum of element-wise P1 distance of the original spectrogram and that from 3.1.
      4. Reconstruct spectrogram using librosa
      4.1. Feed the Mel spectrogram to `librosa.feature.inverse.mel_to_stft` function and get reconstructed spectrogram.
      4.2. Compute the sum of element-wise P1 distance of the original spectrogram and that from 4.1. (this is the reference.)
      5. Check that resulting P1 distance are in a roughly same value range.
      
      Element-wise numerical comparison is not possible due to the difference algorithms used to compute the inverse. The reconstructed spectrograms can have some values vary in magnitude.
      Therefore the strategy here is to check that P1 distance (reconstruction loss) is not that different from the value obtained using `librosa`. For this purpose, threshold was empirically chosen
      
      ```
      print('p1 dist (orig <-> ta):', torch.dist(spec_orig, spec_ta, p=1))
      print('p1 dist (orig <-> lr):', torch.dist(spec_orig, spec_lr, p=1))
      >>> p1 dist (orig <-> ta): tensor(1482.1917)
      >>> p1 dist (orig <-> lr): tensor(1420.7103)
      ```
      
      This value can vary based on the length and the kind of the signal being processed, so it was handpicked.
      
      * Address review feedbacks
      
      * Support arbitrary batch dimensions.
      
      * Add batch test
      
      * Use view for batch
      
      * fix sgd
      
      * Use negative indices and update docstring
      
      * Update threshold
      Co-authored-by: default avatarCharles J.Y. Yoon <jaeyeun97@gmail.com>
      babc24af
  35. 25 Feb, 2020 1 commit
  36. 26 Dec, 2019 1 commit