1. 08 Jun, 2023 5 commits
    • moto's avatar
      Remove CCI badge from README (#3420) · a7fea8a6
      moto authored
      Summary:
      CI jobs are migrated from CCI to GHA
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3420
      
      Differential Revision: D46548562
      
      Pulled By: mthrok
      
      fbshipit-source-id: d7e17201e8b256efaa54543e445a0f139aa549b2
      a7fea8a6
    • moto's avatar
      Clean up CI scripts (#3407) · f0803152
      moto authored
      Summary:
      - Moving the unit test scripts from .circleci to .github
      - Remove docker file for unit test base
      - Use the Conda from Docker image in Linux jobs.
      
      Remaining follow-up items
      
      - Reuse the unittest script in Linux GPU job like done in Linux CPU job.
      
      The unit test script needs to be fixed to be used for Linux GPU job
      in new GHA workflow. Keeping it as a separate follow-up work item.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3407
      
      Differential Revision: D46498263
      
      Pulled By: mthrok
      
      fbshipit-source-id: d8256717a55bb4257151d819d3b2ebd453601eac
      f0803152
    • Kuba Rad's avatar
      Optimize Torchaudio Vad (#3382) · 1e117f57
      Kuba Rad authored
      Summary:
      Pull Request resolved: https://github.com/pytorch/audio/pull/3382
      
      The voice activity detector function was unoptimized, confusingly written, and buggy.
      
      The optimizations created here allow for the function to run roughly 17x faster.
      The main optimizations were to loop over windows of audio rather than individual audio samples. Reducing the number of copies also helped.
      
      There was an off by one error where the array slice referenced was [1: 16001] (for the default settings) instead of [0: 16000]
      
      Reviewed By: hwangjeff
      
      Differential Revision: D44749359
      
      fbshipit-source-id: c76c9412e70cdc6fcd527d113603c88f78480558
      1e117f57
    • moto's avatar
      Merge all the lint/style checks to pre-commit hook (#3414) · c3ca2562
      moto authored
      Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3414
      
      Differential Revision: D46536717
      
      Pulled By: mthrok
      
      fbshipit-source-id: 505bdcdd1b59ca9fe5afc2c8516a0a821e2b8d7e
      c3ca2562
    • atalman's avatar
      [Nova] Add cache ffmpeg before building (#3417) · 5ca03f42
      atalman authored
      Summary:
      [Nova] Add cache ffmpeg before building
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3417
      
      Reviewed By: mthrok
      
      Differential Revision: D46537892
      
      Pulled By: atalman
      
      fbshipit-source-id: 9f8dc0ecfc305c3b378557d46f89a5d7de67a165
      5ca03f42
  2. 07 Jun, 2023 2 commits
  3. 06 Jun, 2023 4 commits
  4. 05 Jun, 2023 1 commit
  5. 04 Jun, 2023 1 commit
  6. 03 Jun, 2023 1 commit
  7. 02 Jun, 2023 3 commits
    • moto's avatar
      [BC-Breaking] Remove compute_kaldi_pitch (#3368) · 5bbbb1d5
      moto authored
      Summary:
      This commit removes compute_kaldi_pitch function and the underlying Kaldi integration from torchaudio.
      
      Kaldi pitch function was added in a short period of time by integrating the original Kaldi implementation, instead of reimplementing it in PyTorch.
      
      The Kaldi integration employed a hack which replaces the base vector/matrix implementation of Kaldi with PyTorch Tensor so that there is only one blas library within torchaudio.
      
      Recently, we are making torchaudio more lean, and we don't see a wide adoption of kaldi_pitch feature, so we decided to remove them.
      
      See some of the discussion https://github.com/pytorch/audio/issues/1269
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3368
      
      Differential Revision: D46406176
      
      Pulled By: mthrok
      
      fbshipit-source-id: ee5e24d825188f379979ddccd680c7323b119b1e
      5bbbb1d5
    • moto's avatar
      Update data augmentation tutorial (#3375) · 2ba36b47
      moto authored
      Summary:
      Replace sox_effects with `torchaudio.io.AudioEffector`
      
      1. To show case the new and better feature
      2. To prepare for the upcoming removal of file-like support object
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3375
      
      Reviewed By: nateanl
      
      Differential Revision: D46379016
      
      Pulled By: mthrok
      
      fbshipit-source-id: 70f24b62494204949f327f6ac6c49f315c9ee315
      2ba36b47
    • Moto Hira's avatar
      Revert D46059199: [audio][PR] Use dlopen for FFmpeg · ab7a39f7
      Moto Hira authored
      Differential Revision:
      D46059199
      
      Original commit changeset: 4493a5fd8a4c
      
      Original Phabricator Diff: D46059199
      
      fbshipit-source-id: 71cde3f8cd870d1ad9114e3e87cdd1ba564441c0
      ab7a39f7
  8. 01 Jun, 2023 8 commits
    • moto's avatar
      Use dlopen for FFmpeg (#3353) · b14ced1a
      moto authored
      Summary:
      This commit changes the way FFmpeg extension is built and used.
      Instead of linking (LGPL) FFmpeg libraries to torchaudio at build time,
      It uses dlopen to search and link them at run time.
      
      For dlopen-ing, we use PyTorch's `at::DynamicLibrary` class, which provides
      portable wrapper.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3353
      
      Differential Revision: D46059199
      
      Pulled By: mthrok
      
      fbshipit-source-id: 4493a5fd8a4c802178d20276522f5334d637307d
      b14ced1a
    • moto's avatar
      [BC-breaking] Remove file-like object support from sox_io backend (#3035) · bc54ac8a
      moto authored
      Summary:
      This commit removes file-like obejct support so that we can remove custom patch
      
      The motivation and plan is outlined in https://github.com/pytorch/audio/issues/2950.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3035
      
      Reviewed By: hwangjeff
      
      Differential Revision: D44695647
      
      Pulled By: mthrok
      
      fbshipit-source-id: 13af0234e288c041bc7b490e1f967f85ce7eb8ec
      bc54ac8a
    • Omkar Salpekar's avatar
      [Nova] Deleting Remaining CircleCI jobs (#3399) · cc89f743
      Omkar Salpekar authored
      Summary:
      This job completely deletes the CircleCI `config.yml`. Here is what was remaining in the config at the point of deletion:
      
      Used Jobs:
      * **Lint** - Now running on Nova - see https://github.com/pytorch/audio/actions/runs/5144082942 for an example run on the latest PR in trunk
      * **CircleCI Consistency** - Not needed anymore now if there is no CCI config.
      
      Unused Jobs:
      * **build-ffmpeg-$OS** - For the build jobs, we are already building FFMPEG from source as part of the Nova workflows.
      * **download-third-parties** - This is caching. We currently do not have caching in Nova jobs, but atalman is working on adding support for this as a future optimization.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3399
      
      Reviewed By: mthrok
      
      Differential Revision: D46363921
      
      Pulled By: osalpekar
      
      fbshipit-source-id: 8abf5b0c1612c3492908fb2f5797e6b0a3c70766
      cc89f743
    • moto's avatar
      Fix style issue (#3398) · c7ac1aff
      moto authored
      Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3398
      
      Reviewed By: nateanl
      
      Differential Revision: D46354862
      
      Pulled By: mthrok
      
      fbshipit-source-id: b86dcdfeff8ed9db87b0b78eca20f6f18117e97e
      c7ac1aff
    • moto's avatar
      Fix apply_codec to use named file (#3397) · 1dfac469
      moto authored
      Summary:
      Follow-up https://github.com/pytorch/audio/issues/3386 The intended change was to use path of temporary file, instead of file-like object
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3397
      
      Reviewed By: hwangjeff
      
      Differential Revision: D46346189
      
      Pulled By: mthrok
      
      fbshipit-source-id: 44da799c6587bcb63a118a6313b7299bad742a40
      1dfac469
    • moto's avatar
      Refactor arg mapping in ffmpeg save function (#3387) · b99e5f46
      moto authored
      Summary:
      The arguments of TorchAudio's save function ("format", "bits_per_sample" and "encoding")
      are not one-to-one mapping to the arguments of FFmpeg encoding.
      
      For example, to use vorbis codec, FFmpeg expects "ogg" container/extension with "vorbis"
      encoder. It does not recognize "vorbis" extension like TorchAudio (libsox) does.
      
      This commit refactors the logic to parse/map the arguments.
      
      As a result it now properly works with vorbis and mp3 extension.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3387
      
      Reviewed By: hwangjeff
      
      Differential Revision: D46328787
      
      Pulled By: mthrok
      
      fbshipit-source-id: 36f993952a062bfec58a8b51be6aa86297571f90
      b99e5f46
    • moto's avatar
      Update and deprecate apply_codec function (#3386) · d6dd497c
      moto authored
      Summary:
      To prepare for the upcoming removal of file-like object support from sox_io backend,
      this commit changes apply_codec function to use tempfile.
      
      `apply_codec` function is now deprecated and users are encourated to use `torchaudio.io.AudioEffector`.
      We will not remove the function itself, but will remove the entry from the doc.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3386
      
      Reviewed By: hwangjeff
      
      Differential Revision: D46330610
      
      Pulled By: mthrok
      
      fbshipit-source-id: 3071bdefa05b4cbb9f00629bef50f0981eae89b4
      d6dd497c
    • Omkar Salpekar's avatar
      Delete CCI Linux and MacOS Unittest Jobs (#3391) · d5d94b7e
      Omkar Salpekar authored
      Summary:
      Deprecates the Linux and MacOS Unittest jobs now that they've been running on Nova for over a week.
      
      Aside: There was also a stylecheck job that was dependent on the Linux Unittest job. I also put up https://github.com/pytorch/audio/pull/3390 to move that stylecheck job to Nova. I'm happy to reintroduce the CCI stylecheck job standalone in CCI if we want the Nova version to run on main for a week.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3391
      
      Reviewed By: mthrok
      
      Differential Revision: D46324198
      
      Pulled By: osalpekar
      
      fbshipit-source-id: 2115748e153c5dee1a38db2b6230acebc4f56927
      d5d94b7e
  9. 31 May, 2023 6 commits
  10. 30 May, 2023 3 commits
  11. 29 May, 2023 1 commit
  12. 27 May, 2023 1 commit
    • moto's avatar
      Fix AudioEffector for mulaw (#3372) · af932cc7
      moto authored
      Summary:
      When encoding audio with mulaw, the resulting data does not have header, and the StreamReader defaults to 16k Hz, which can strech/shrink the resulting waveform.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3372
      
      Reviewed By: hwangjeff
      
      Differential Revision: D46234772
      
      Pulled By: mthrok
      
      fbshipit-source-id: 942c89a8cfe29b0b6f57b3e5b6c9dfd3524ca552
      af932cc7
  13. 26 May, 2023 4 commits