1. 22 Jul, 2020 1 commit
    • moto's avatar
      Add smoke tests to sox_io and sox_effects (#806) · daa0007a
      moto authored
      Currently all the tests in `sox_io_backend` and `sox_effects` (for new SoX effects implementation) requires additional `sox`, and this prevents running test in environment where `sox` command is not available even though `torchaudio` extension is available (such as fb internal). This PR adds smoke tests for these modules, which just runs functions to see if they do not crash.
      daa0007a
  2. 17 Jul, 2020 1 commit
    • Emmanouil Theofanis Chourdakis's avatar
      Changed GTZAN so that it only traverses filenames belonging to the dataset (#791) · 47eb1e6a
      Emmanouil Theofanis Chourdakis authored
      * Addressed review issues in PR #668
      
      * Changed GTZAN so that it only traverses filenames belonging to the dataset
      
      Now, instead of walking the whole directory and subdirectories of the dataset
      GTZAN only looks for files under a `genre`/`genre`.`5 digit number`.wav format, where `genre` is an allowed GTZAN genre label.
      This allows moving or removing files from the dataset (e.g. for fixing duplication or mislabeling issues).
      47eb1e6a
  3. 16 Jul, 2020 3 commits
  4. 12 Jul, 2020 1 commit
  5. 01 Jul, 2020 1 commit
  6. 10 Jun, 2020 1 commit
  7. 08 Jun, 2020 1 commit
  8. 04 Jun, 2020 2 commits
  9. 03 Jun, 2020 1 commit
  10. 02 Jun, 2020 1 commit
    • Emmanouil Theofanis Chourdakis's avatar
      Added the popular GTZAN dataset: (#668) · b0367251
      Emmanouil Theofanis Chourdakis authored
      
      
      * Added the popular GTZAN dataset:
      
      * Added the GTZAN class in torchaudio.datasets using the same format as the rest of the datasets.
      * Added the appropriate test function in test_datasets.py.
      * Added the GTZAN class in the datasets.rst documentation file.
      
      * Addressed review issues in PR #668
      
      * Added dummy noise .wav in `test/assets/`
      * Removed transforms of input and output from the dataset
        `__init__` function, as well as the corresponding methods.
      * Replaced rendundant `filtered` and `subset` methods from
        class initialization and also changed the corresponding
        assertion message.
      
      * Fixed E303: too many blank lines error
      
      * Added GTZAN to __init__.__all__
      
      * Fixed incorrectly not importing GTZAN
      
      * removed duplicate warning
      
      * lint
      Co-authored-by: default avatarVincent QB <vincentqb@users.noreply.github.com>
      b0367251
  11. 30 Apr, 2020 1 commit
  12. 28 Apr, 2020 1 commit
    • Artyom Astafurov's avatar
      Port sox::vad (#578) · 3ecc7016
      Artyom Astafurov authored
      * initial test, stub function, transform and docstring
      
      * add draft working implementation, update docstrings
      
      * merge VadSate into Vad calss, move Channel into Vad class
      
      * remove functional stub for vad
      
      * add wav file for test
      
      * refactor _measure() to improve performance
      
      * rename argument
      
      * replace copy_ with assignment
      
      * refactor init, update documentation, update test for readability
      
      * clean up default values
      
      * move code from transforms.py to funtional.py and integrate state into a function
      
      * remove Channel state class
      
      * fix calcuation of a flush point
      
      * make multiple channels work
      
      * clean up multi-channel, update test
      
      * rename variables and re-org arguments for _measure
      
      * fix linting errors
      
      * add torchscript consistency test and fix errors
      
      * support and test batch consistency, fix normalization
      
      * update documentation, switch torchscript consistancy test to use transform to improve coverage
      
      * fix linting errors
      
      * remove un-used imports
      
      * address PR comments
      
      * add doc references into rst
      3ecc7016
  13. 03 Apr, 2020 1 commit
    • Tomás Osório's avatar
      Fix common voice dataset (#498) · 9b288109
      Tomás Osório authored
      * fix download
      
      * fix reading tsv archive
      
      * add new languages
      
      * maintain same structure as other datasets
      
      * update CommonVoice Tests
      
      * fix
      
      * change directory name
      
      * remove extra line
      9b288109
  14. 30 Mar, 2020 1 commit
    • Vincent QB's avatar
      testing with sox only when sox is available (#419) · d63d851e
      Vincent QB authored
      * testing with sox only when sox is available.
      
      * use wav instead of mp3 for testing functions.
      
      * typo.
      
      * guard against not sox.
      
      * backends definition.
      
      * grouping backend functions into a separate file.
      
      * remove duplicated code.
      
      * requires sox.
      
      * replace by wav, requires sox.
      
      * require with scope.
      
      * undo alignment.
      
      * requires sox for these two, because of mp3.
      
      * no longer need first mp3.
      
      * cleaning.
      
      * new wav version of file.
      
      * flake8.
      
      * remove unnecessary load.
      
      * flake8.
      
      * lint.
      
      * lint.
      
      * revert formatting of file.
      
      * merging into common_utils.
      
      * docstring.
      
      * rename to common_utils.
      d63d851e
  15. 22 Feb, 2020 1 commit
    • Tomás Osório's avatar
      Adding Speech Command Dataset (#437) · 4d58bc46
      Tomás Osório authored
      
      
      * add speechcommand dataset and test
      
      * prepend the full path to each result
      
      * add missing param on docstring in walk_files
      
      * add file to run tests on SpeechCommand Dataset
      
      * reduce logic
      
      * update test on SpeechCommands
      
      * correct the indentation on docstring walk_files
      
      * flake8 compliance
      
      * change tuple type returned. move path split logic in load item.
      
      * typo in name.
      
      * redundant file path.
      
      * filter background noise.
      Co-authored-by: default avatarVincent QB <vincentqb@users.noreply.github.com>
      4d58bc46
  16. 20 Feb, 2020 1 commit
  17. 19 Dec, 2019 1 commit
    • Vincent QB's avatar
      Backend switch (#355) · 774ebc78
      Vincent QB authored
      * move sox inside function calls.
      
      * add backend switch mechanism.
      
      * import sox at runtime, not import.
      
      * add backend list.
      
      * backend tests.
      
      * creating hidden modules for backend.
      
      * naming backend same as file: soundfile.
      
      * remove docstring in backend file.
      
      * test soundfile info.
      
      * soundfile doesn't support int64.
      
      * adding test for wav file.
      
      * error with incorrect parameter instead of silent ignore.
      
      * adding test across backend. using float32 as done in sox.
      
      * backend guard decorator.
      774ebc78
  18. 06 Nov, 2019 1 commit
  19. 30 Oct, 2019 1 commit
  20. 29 Oct, 2019 1 commit
  21. 23 Oct, 2019 1 commit
  22. 18 Sep, 2019 1 commit
    • engineerchuan's avatar
      Make lfilter, and related filters, available (#275) · 8273c3f4
      engineerchuan authored
      * Add basic low pass filtering
      * Add highpass filtering
      * More tests of IIR vs FIR
      * Implement convolve function, add tests
      * Move lfilter and convolve into functional, more tests
      * added additional documentation for convolve and lfilter, renamed functional_filtering to functional_sox_convenience
      * Follow naming convention for sample rate in functional
      * fix failing vctk manifest test to account for adding more test audios into assets
      * Adding documentation for lfilter, biquad, highpass_biquad, lowpass_biquad
      * added matrix based implementation of lfilter
      * adding python lfilter implementation
      * factor out biquad, lowpass, highpass to sox compatibility
      8273c3f4
  23. 16 Aug, 2019 1 commit
  24. 16 Jul, 2019 2 commits
  25. 02 Jul, 2019 1 commit
  26. 18 Jun, 2019 1 commit
  27. 22 May, 2019 1 commit
  28. 08 May, 2018 2 commits
  29. 26 Apr, 2018 2 commits
  30. 12 Aug, 2017 1 commit
    • David Pollack's avatar
      basic transforms · 3433b9b6
      David Pollack authored
      vctk dataset
      
      add label processing
      
      added tests
      
      chunk dataset
      
      simple caching scheme
      
      caching fixed
      
      vctk downsample
      
      yesno dataset
      3433b9b6