- 08 Jun, 2023 5 commits
-
-
moto authored
Summary: CI jobs are migrated from CCI to GHA Pull Request resolved: https://github.com/pytorch/audio/pull/3420 Differential Revision: D46548562 Pulled By: mthrok fbshipit-source-id: d7e17201e8b256efaa54543e445a0f139aa549b2
-
moto authored
Summary: - Moving the unit test scripts from .circleci to .github - Remove docker file for unit test base - Use the Conda from Docker image in Linux jobs. Remaining follow-up items - Reuse the unittest script in Linux GPU job like done in Linux CPU job. The unit test script needs to be fixed to be used for Linux GPU job in new GHA workflow. Keeping it as a separate follow-up work item. Pull Request resolved: https://github.com/pytorch/audio/pull/3407 Differential Revision: D46498263 Pulled By: mthrok fbshipit-source-id: d8256717a55bb4257151d819d3b2ebd453601eac
-
Kuba Rad authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3382 The voice activity detector function was unoptimized, confusingly written, and buggy. The optimizations created here allow for the function to run roughly 17x faster. The main optimizations were to loop over windows of audio rather than individual audio samples. Reducing the number of copies also helped. There was an off by one error where the array slice referenced was [1: 16001] (for the default settings) instead of [0: 16000] Reviewed By: hwangjeff Differential Revision: D44749359 fbshipit-source-id: c76c9412e70cdc6fcd527d113603c88f78480558
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3414 Differential Revision: D46536717 Pulled By: mthrok fbshipit-source-id: 505bdcdd1b59ca9fe5afc2c8516a0a821e2b8d7e
-
atalman authored
Summary: [Nova] Add cache ffmpeg before building Pull Request resolved: https://github.com/pytorch/audio/pull/3417 Reviewed By: mthrok Differential Revision: D46537892 Pulled By: atalman fbshipit-source-id: 9f8dc0ecfc305c3b378557d46f89a5d7de67a165
-
- 07 Jun, 2023 2 commits
-
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3415 Differential Revision: D46526437 Pulled By: mthrok fbshipit-source-id: f78d19c19d7e68f67712412de35d9ed50f47263b
-
moto authored
Summary: To investigate https://github.com/pytorch/audio/issues/3411 Pull Request resolved: https://github.com/pytorch/audio/pull/3418 Differential Revision: D46535891 Pulled By: mthrok fbshipit-source-id: b90bba399eb54f9f0ae073bd590cd8a46054ed7e
-
- 06 Jun, 2023 4 commits
-
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3410 Differential Revision: D46496786 Pulled By: mthrok fbshipit-source-id: e517b273c40b340f39ce7db7ab1be1c3eb5f2059
-
atalman authored
Summary: We are using Project Nova workflows now. These are not required. Same as: https://github.com/pytorch/vision/pull/7656 Pull Request resolved: https://github.com/pytorch/audio/pull/3409 Reviewed By: mthrok Differential Revision: D46494331 Pulled By: atalman fbshipit-source-id: a642ae55b75482918e0afb7c55dc876bc8356e70
-
Moto Hira authored
Differential Revision: D46126226 Original commit changeset: 42cb52b19d91 Original Phabricator Diff: D46126226 fbshipit-source-id: 372b2526d9e196e37e014f1556bf117d29bb1ac6
-
Zhaoheng Ni authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3365 Current design of forced_align accept 2D Tensor for `log_probs` and 1D Tensor for `targets`. To make the API simple, the PR make changes to only support batch Tensors (3D Tensor for `log_probs` and 2D Tensor for `targets`). Reviewed By: vineelpratap Differential Revision: D46126226 fbshipit-source-id: 42cb52b19d91bbff7dc040ccf60350545d75b3a2
-
- 05 Jun, 2023 1 commit
-
-
moto authored
Summary: Follow up of: https://github.com/pytorch/audio/pull/3368 Remove files and lines no longer used. Pull Request resolved: https://github.com/pytorch/audio/pull/3403 Differential Revision: D46441462 Pulled By: mthrok fbshipit-source-id: 11b881ec4b24fa0d625c6aee9f4bd91f637f9923
-
- 04 Jun, 2023 1 commit
-
-
Zhaoheng Ni authored
Summary: There are some BC-Breaking changes from pytorch_lightning to lightning library. The PR adjust those changes to support latest lightning library. Pull Request resolved: https://github.com/pytorch/audio/pull/3396 Reviewed By: mthrok Differential Revision: D46345206 Pulled By: nateanl fbshipit-source-id: 59469c15dc5fe5466a99a5b5380eb4f98c2c633f
-
- 03 Jun, 2023 1 commit
-
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3402 This is a second attempt of https://github.com/pytorch/audio/pull/3353. The basic logic to enable dlopen for FFmpeg libraries are same. It uses `at::DynamicLibrary`, which allows to compile torchaudio without linking FFmpeg libraries. This time, the option to enable this feature DLOPEN_FFMPEG has been added, so that users have a way to disable this feature and keep using build-time linking. Please refer to stub.h for more technical detail. Differential Revision: D46403783 fbshipit-source-id: ca3db57ff6bdc50c8c225d22f12f3e76c6dc3f16
-
- 02 Jun, 2023 3 commits
-
-
moto authored
Summary: This commit removes compute_kaldi_pitch function and the underlying Kaldi integration from torchaudio. Kaldi pitch function was added in a short period of time by integrating the original Kaldi implementation, instead of reimplementing it in PyTorch. The Kaldi integration employed a hack which replaces the base vector/matrix implementation of Kaldi with PyTorch Tensor so that there is only one blas library within torchaudio. Recently, we are making torchaudio more lean, and we don't see a wide adoption of kaldi_pitch feature, so we decided to remove them. See some of the discussion https://github.com/pytorch/audio/issues/1269 Pull Request resolved: https://github.com/pytorch/audio/pull/3368 Differential Revision: D46406176 Pulled By: mthrok fbshipit-source-id: ee5e24d825188f379979ddccd680c7323b119b1e
-
moto authored
Summary: Replace sox_effects with `torchaudio.io.AudioEffector` 1. To show case the new and better feature 2. To prepare for the upcoming removal of file-like support object Pull Request resolved: https://github.com/pytorch/audio/pull/3375 Reviewed By: nateanl Differential Revision: D46379016 Pulled By: mthrok fbshipit-source-id: 70f24b62494204949f327f6ac6c49f315c9ee315
-
Moto Hira authored
Differential Revision: D46059199 Original commit changeset: 4493a5fd8a4c Original Phabricator Diff: D46059199 fbshipit-source-id: 71cde3f8cd870d1ad9114e3e87cdd1ba564441c0
-
- 01 Jun, 2023 8 commits
-
-
moto authored
Summary: This commit changes the way FFmpeg extension is built and used. Instead of linking (LGPL) FFmpeg libraries to torchaudio at build time, It uses dlopen to search and link them at run time. For dlopen-ing, we use PyTorch's `at::DynamicLibrary` class, which provides portable wrapper. Pull Request resolved: https://github.com/pytorch/audio/pull/3353 Differential Revision: D46059199 Pulled By: mthrok fbshipit-source-id: 4493a5fd8a4c802178d20276522f5334d637307d
-
moto authored
Summary: This commit removes file-like obejct support so that we can remove custom patch The motivation and plan is outlined in https://github.com/pytorch/audio/issues/2950. Pull Request resolved: https://github.com/pytorch/audio/pull/3035 Reviewed By: hwangjeff Differential Revision: D44695647 Pulled By: mthrok fbshipit-source-id: 13af0234e288c041bc7b490e1f967f85ce7eb8ec
-
Omkar Salpekar authored
Summary: This job completely deletes the CircleCI `config.yml`. Here is what was remaining in the config at the point of deletion: Used Jobs: * **Lint** - Now running on Nova - see https://github.com/pytorch/audio/actions/runs/5144082942 for an example run on the latest PR in trunk * **CircleCI Consistency** - Not needed anymore now if there is no CCI config. Unused Jobs: * **build-ffmpeg-$OS** - For the build jobs, we are already building FFMPEG from source as part of the Nova workflows. * **download-third-parties** - This is caching. We currently do not have caching in Nova jobs, but atalman is working on adding support for this as a future optimization. Pull Request resolved: https://github.com/pytorch/audio/pull/3399 Reviewed By: mthrok Differential Revision: D46363921 Pulled By: osalpekar fbshipit-source-id: 8abf5b0c1612c3492908fb2f5797e6b0a3c70766
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3398 Reviewed By: nateanl Differential Revision: D46354862 Pulled By: mthrok fbshipit-source-id: b86dcdfeff8ed9db87b0b78eca20f6f18117e97e
-
moto authored
Summary: Follow-up https://github.com/pytorch/audio/issues/3386 The intended change was to use path of temporary file, instead of file-like object Pull Request resolved: https://github.com/pytorch/audio/pull/3397 Reviewed By: hwangjeff Differential Revision: D46346189 Pulled By: mthrok fbshipit-source-id: 44da799c6587bcb63a118a6313b7299bad742a40
-
moto authored
Summary: The arguments of TorchAudio's save function ("format", "bits_per_sample" and "encoding") are not one-to-one mapping to the arguments of FFmpeg encoding. For example, to use vorbis codec, FFmpeg expects "ogg" container/extension with "vorbis" encoder. It does not recognize "vorbis" extension like TorchAudio (libsox) does. This commit refactors the logic to parse/map the arguments. As a result it now properly works with vorbis and mp3 extension. Pull Request resolved: https://github.com/pytorch/audio/pull/3387 Reviewed By: hwangjeff Differential Revision: D46328787 Pulled By: mthrok fbshipit-source-id: 36f993952a062bfec58a8b51be6aa86297571f90 -
moto authored
Summary: To prepare for the upcoming removal of file-like object support from sox_io backend, this commit changes apply_codec function to use tempfile. `apply_codec` function is now deprecated and users are encourated to use `torchaudio.io.AudioEffector`. We will not remove the function itself, but will remove the entry from the doc. Pull Request resolved: https://github.com/pytorch/audio/pull/3386 Reviewed By: hwangjeff Differential Revision: D46330610 Pulled By: mthrok fbshipit-source-id: 3071bdefa05b4cbb9f00629bef50f0981eae89b4
-
Omkar Salpekar authored
Summary: Deprecates the Linux and MacOS Unittest jobs now that they've been running on Nova for over a week. Aside: There was also a stylecheck job that was dependent on the Linux Unittest job. I also put up https://github.com/pytorch/audio/pull/3390 to move that stylecheck job to Nova. I'm happy to reintroduce the CCI stylecheck job standalone in CCI if we want the Nova version to run on main for a week. Pull Request resolved: https://github.com/pytorch/audio/pull/3391 Reviewed By: mthrok Differential Revision: D46324198 Pulled By: osalpekar fbshipit-source-id: 2115748e153c5dee1a38db2b6230acebc4f56927
-
- 31 May, 2023 6 commits
-
-
Omkar Salpekar authored
Summary: Introducing the stylecheck job on Nova. It seems like it is failing on trunk, but the functionality of this job itself is working and it fails with the same error as it does on trunk with CCI. Pull Request resolved: https://github.com/pytorch/audio/pull/3390 Reviewed By: mthrok Differential Revision: D46324223 Pulled By: osalpekar fbshipit-source-id: 1324202e53569d610559ef6f1b90cb5c364e6909
-
Omkar Salpekar authored
Summary: See title. If all is well, we can deprecate the CCI job in a few days. Pull Request resolved: https://github.com/pytorch/audio/pull/3341 Reviewed By: mthrok Differential Revision: D46324265 Pulled By: osalpekar fbshipit-source-id: bc706c6ae4285d4085dc5f0223ea41d8fc290f1c
-
moto authored
Summary: Set the directory of JUnitText XML file to the one where test-infra picks up and put them in summary. Example: https://github.com/pytorch/audio/actions/runs/5136305988 Pull Request resolved: https://github.com/pytorch/audio/pull/3394 Differential Revision: D46328832 Pulled By: mthrok fbshipit-source-id: f0b5020a911ca4ec09345a965bdec769300859f0
-
atalman authored
Summary: Nova - Deprecate windows circleci unit tests Pull Request resolved: https://github.com/pytorch/audio/pull/3393 Reviewed By: malfet Differential Revision: D46315608 Pulled By: atalman fbshipit-source-id: 3d7b5d0618b9d2e12e5f97e21d7becdc61d85c69
-
atalman authored
Summary: Windows GPU workflows Pull Request resolved: https://github.com/pytorch/audio/pull/3364 Reviewed By: mthrok Differential Revision: D46292403 Pulled By: atalman fbshipit-source-id: ee3c6f8082ca77bdc1ffdb930c59fa5a9cb25a4a
-
Jeff Hwang authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3379 Fixes `RNNTBeamSearch.infer`'s docstring and removes unused import from tutorial. Reviewed By: mthrok Differential Revision: D46227174 fbshipit-source-id: 7c1c3f05a6476cb0437622dea6f3ae6cb3ea9468
-
- 30 May, 2023 3 commits
-
-
atalman authored
Summary: Disable failing GPU unit test. See associated issue: https://github.com/pytorch/audio/issues/3376 Pull Request resolved: https://github.com/pytorch/audio/pull/3384 Reviewed By: mthrok Differential Revision: D46279324 Pulled By: atalman fbshipit-source-id: 3a606bb992e0261451f48d1fb458e054f7fd5583
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3389 Adopt more of const reference in sox source code. Differential Revision: D46264068 fbshipit-source-id: 809d34a6e16f621c856d4278ef7ce45a5868a717
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3383 This commit reduces `torchaudio::sox_*` namespace into `torchaudio::sox`. Also put Pybind11 registration and TorchBind registration into anonymous namescope. Differential Revision: D46257367 fbshipit-source-id: 0f0f181eaa72036916e223263daf4b7c298fca0d
-
- 29 May, 2023 1 commit
-
-
Omkar Salpekar authored
Summary: Continuing with the job migrations from CCI to Nova, this PR introduces the Windows CPU Unittest job as a Nova workflow. The job is passing: https://github.com/pytorch/audio/actions/runs/5094569687/jobs/9159020192?pr=3329. Pull Request resolved: https://github.com/pytorch/audio/pull/3329 Reviewed By: huydhn Differential Revision: D46265649 Pulled By: atalman fbshipit-source-id: 7659dfbcc8ad400f2e109ff64530e1f768e82ef9
-
- 27 May, 2023 1 commit
-
-
moto authored
Summary: When encoding audio with mulaw, the resulting data does not have header, and the StreamReader defaults to 16k Hz, which can strech/shrink the resulting waveform. Pull Request resolved: https://github.com/pytorch/audio/pull/3372 Reviewed By: hwangjeff Differential Revision: D46234772 Pulled By: mthrok fbshipit-source-id: 942c89a8cfe29b0b6f57b3e5b6c9dfd3524ca552
-
- 26 May, 2023 4 commits
-
-
moto authored
Summary: g722 format only supports 16k Hz, but AVCodec does not list this. The implementation does not insert resampling and the resulting audio can be slowed down or sped up. Pull Request resolved: https://github.com/pytorch/audio/pull/3373 Reviewed By: hwangjeff Differential Revision: D46233181 Pulled By: mthrok fbshipit-source-id: 902b3f862a8f7269dc35bc871e868b0e78326c6c
-
Huy Do authored
Summary: 11.7 uses 8.5.0; 11.8 uses 8.7.0; 12.1 uses 8.8.1. Otherwise, Windows vision job (8.5.0) would overwrite the CUDNN version setup by PyTorch (8.7.0) leading to this flaky failures https://github.com/pytorch/pytorch/actions/runs/5088860652/jobs/9146641450 ``` RuntimeError: cuDNN version incompatibility: PyTorch was compiled against (8, 7, 0) but found runtime version (8, 5, 0). PyTorch already comes bundled with cuDNN. One option to resolving this error is to ensure PyTorch can find the bundled cuDNN. ``` Pull Request resolved: https://github.com/pytorch/audio/pull/3380 Reviewed By: atalman Differential Revision: D46236286 Pulled By: huydhn fbshipit-source-id: 9ca12d5068c3029688347d52c5c284488f33728d
-
atalman authored
Summary: Use cuda 11.8 for circleci tests. 11.7 was deprecated Pull Request resolved: https://github.com/pytorch/audio/pull/3381 Reviewed By: osalpekar Differential Revision: D46236223 Pulled By: atalman fbshipit-source-id: 6d6a8e09603807a07241f31c1bd1e6d3a2b67d9d
-
Zhaoheng Ni authored
Summary: The tests failed for several bundles. Remove them and will re-add once the root cause is figured out. Pull Request resolved: https://github.com/pytorch/audio/pull/3378 Reviewed By: atalman Differential Revision: D46230884 Pulled By: nateanl fbshipit-source-id: 42056a29b2ec2335268b273d3e37fb517035be92
-