- 16 Feb, 2023 2 commits
-
-
moto authored
Summary: Flashlight Text decoder is now available on PyPI and KenLM support is being added at https://github.com/flashlight/text/pull/43 Once this work is merged, we can rely on the official distribution of Flashlight Text package, so we are adding deprecation warning. Once the decoder is fully available, one can install it with ``` pip install flashlight-text pip install git+https://github.com/kpu/kenlm.git ``` Pull Request resolved: https://github.com/pytorch/audio/pull/3055 Reviewed By: hwangjeff, nateanl Differential Revision: D43239150 Pulled By: mthrok fbshipit-source-id: 728cb208b8403100cd4ccd80c6295d454756b414
-
hwangjeff authored
Summary: Adds I/O backend dispatcher that routes I/O requests to FFmpeg, SoX, or Soundfile backend, per library availability. It allows users to specify a backend mapped to a media library, i.e. one of `["ffmpeg", "sox", "soundfile"]`, to use via keyword argument, with FFmpeg being the default. Environment variable `TORCHAUDIO_USE_BACKEND_DISPATCHER` gates enablement of the dispatcher; specifically, if `TORCHAUDIO_USE_BACKEND_DISPATCHER` is explicitly set to `1`, importing TorchAudio makes it accessible via `torchaudio.info`, `torchaudio.load`, and `torchaudio.save`. Pull Request resolved: https://github.com/pytorch/audio/pull/3015 Reviewed By: mthrok Differential Revision: D43258649 Pulled By: hwangjeff fbshipit-source-id: 8f12e4e56b9fa3f0814dd3fed3e1783ab23a53a1
-
- 15 Feb, 2023 5 commits
-
-
Cole Li authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3056 Task #2 from https://github.com/pytorch/audio/issues/2835 Reviewed By: mthrok Differential Revision: D42854156 fbshipit-source-id: e1b3bd992c91fedc55f30a814e16efd7c51e0c80
-
hwangjeff authored
Summary: Relaxes input dimension matching constraint on `convolve` to enable broadcasting for inputs. Pull Request resolved: https://github.com/pytorch/audio/pull/3061 Reviewed By: mthrok Differential Revision: D43298078 Pulled By: hwangjeff fbshipit-source-id: a6cc36674754523b88390fac0a05f06562921319
-
Jeff Hwang authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3058 Adds FFmpeg-based save function. Reviewed By: mthrok Differential Revision: D43264858 fbshipit-source-id: ae3f89012bc2520f3de11af65348ba8f77f0acff
-
hwangjeff authored
Summary: Updates tutorial "Audio Data Augmentation" to use two of the newly introduced data augmentation operators in beta: `torchaudio.functional.fftconvolve` and `torchaudio.functional.add_noise`. Pull Request resolved: https://github.com/pytorch/audio/pull/3062 Reviewed By: mthrok Differential Revision: D43298120 Pulled By: hwangjeff fbshipit-source-id: 09ca736a5c67242568515d600b7d31eab32c2df1
-
moto authored
Summary: * Mention context manager in StreamWriter * Add FFmpeg as optional dependency Pull Request resolved: https://github.com/pytorch/audio/pull/3064 Reviewed By: hwangjeff Differential Revision: D43307818 Pulled By: mthrok fbshipit-source-id: 86339d973aba85e090f520e08af65b5d736e3d18
-
- 14 Feb, 2023 4 commits
-
-
Omkar Salpekar authored
Summary: Add triggers for RC branches and tags to all build workflows. This will ensure that the release-candidate builds will run with `CHANNEL=test`. Pull Request resolved: https://github.com/pytorch/audio/pull/3057 Reviewed By: atalman Differential Revision: D43279657 Pulled By: osalpekar fbshipit-source-id: 5abf3994b9b4a4897f53c540bd1db6c3d624b3e0
-
Zhaoheng Ni authored
Summary: - Rename the current `ssl` example to `self_supervised_learning` - Add README to demonstrate how to run the recipe with hubert task Pull Request resolved: https://github.com/pytorch/audio/pull/3060 Reviewed By: mthrok Differential Revision: D43287868 Pulled By: nateanl fbshipit-source-id: 10352682485ef147ca32f4c4c9f9cde995444aa0
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3053 Reviewed By: nateanl Differential Revision: D43238766 Pulled By: mthrok fbshipit-source-id: 4f82878b1c97b0e6a35af75855849b86200e6061
-
Zhaoheng Ni authored
Summary: replicate of https://github.com/pytorch/audio/issues/2644 Pull Request resolved: https://github.com/pytorch/audio/pull/2880 Reviewed By: mthrok Differential Revision: D41633911 Pulled By: nateanl fbshipit-source-id: 73cf145d75c389e996aafe96571ab86dc21f86e5
-
- 11 Feb, 2023 1 commit
-
-
moto authored
Summary: Par https://github.com/pytorch/audio/issues/3040 and https://github.com/pytorch/audio/issues/3041, it turned out Google Colab now has FFmpeg with GPU decoder/encoder preinstalled, and installing FFmpeg manually corrups the environment. This commit updates the tutorial by extracting and moving the how-to-install part to installation/build section. closes https://github.com/pytorch/audio/issues/3041 closes https://github.com/pytorch/audio/issues/3040 Pull Request resolved: https://github.com/pytorch/audio/pull/3050 Reviewed By: nateanl Differential Revision: D43166054 Pulled By: mthrok fbshipit-source-id: 32667f292a796344d5fcde86e8231e15ad904e58
-
- 10 Feb, 2023 1 commit
-
-
Wei Wang authored
Summary: So far Linux and MacOS were tested to work fine out of the box. This PR is created to verify this -- disabled windows jobs and configs for now. Pull Request resolved: https://github.com/pytorch/audio/pull/3039 Reviewed By: osalpekar Differential Revision: D43174745 Pulled By: weiwangmeta fbshipit-source-id: 81766905256e03c5a01cb5448a350f5d409ca4b8
-
- 09 Feb, 2023 3 commits
-
-
moto authored
Summary: - Add documentation - Tweak docsrting - Fix import Pull Request resolved: https://github.com/pytorch/audio/pull/3051 Reviewed By: weiwangmeta, atalman, nateanl Differential Revision: D43166081 Pulled By: mthrok fbshipit-source-id: 7d77aa34a6318a64824626cff8372f8b9aebf6f9
-
moto authored
Summary: Commit b4c66d1f broke all the CIs. The new policy changes the timestamp of configuration files of third party libraries, which triggers re-configuration which requires extra tools. This commit fixes it by reverting the old behavior. Also this adds guard for older cmake versions. Pull Request resolved: https://github.com/pytorch/audio/pull/3046 Reviewed By: atalman Differential Revision: D43133536 Pulled By: mthrok fbshipit-source-id: 357055c8c1b53e593b8b7880f2045e13512c7a8f
-
DanilBaibak authored
Summary: We don't need the presence of physical HW to compile with CUDA. This is a follow up PR regarding `USE_ROCM` for issue https://github.com/pytorch/audio/issues/2979. Pull Request resolved: https://github.com/pytorch/audio/pull/3008 Reviewed By: malfet Differential Revision: D42708862 Pulled By: DanilBaibak fbshipit-source-id: 90cedc80a2d180ca1e0912ad5b644398182417b8
-
- 08 Feb, 2023 4 commits
-
-
moto authored
Summary: Instead of raising an error when lazy import happens, this method allows to import features, and raises an error when the feature is being used. This makes it easy to adopt the same error mechanism across different modules. It is how it's done for sox-related features. Pull Request resolved: https://github.com/pytorch/audio/pull/3028 Reviewed By: xiaohui-zhang Differential Revision: D42966976 Pulled By: mthrok fbshipit-source-id: 423dfe0b8a3970cd07f20e841c794c7f2809f993
-
moto authored
Summary: The first step to migrate doc build to GHA. Pull Request resolved: https://github.com/pytorch/audio/pull/3043 Reviewed By: xiaohui-zhang Differential Revision: D43110816 Pulled By: mthrok fbshipit-source-id: 91de5f3ac567188e7030f14c2827a202a1901f1a
-
moto authored
Summary: Currently, for each third party library checked out with ExternalProject_Add, the following warning is shown. This commit set the policy so that the warning is not shown. ``` CMake Warning (dev) at ci_env/lib/python3.10/site-packages/cmake/data/share/cmake-3.25/Modules/ExternalProject.cmake:3075 (message): The DOWNLOAD_EXTRACT_TIMESTAMP option was not given and policy CMP0135 is not set. The policy's OLD behavior will be used. When using a URL download, the timestamps of extracted files should preferably be that of the time of extraction, otherwise code that depends on the extracted contents might not be rebuilt if the URL changes. The OLD behavior preserves the timestamps from the archive instead, but this is usually not what you want. Update your project to the NEW behavior or specify the DOWNLOAD_EXTRACT_TIMESTAMP option with a value of true to avoid this robustness issue. ``` Pull Request resolved: https://github.com/pytorch/audio/pull/3044 Reviewed By: xiaohui-zhang Differential Revision: D43110818 Pulled By: mthrok fbshipit-source-id: d2e20c9fdbbeeedb5ad546fe32dbda28c5bdd431
-
DanilBaibak authored
Summary: Switch to Nova M1 Conda Pull Request resolved: https://github.com/pytorch/audio/pull/2908 Reviewed By: seemethere, osalpekar Differential Revision: D43093605 Pulled By: DanilBaibak fbshipit-source-id: 9e44f26cfb87e277c3808ee59f50218b4629e86e
-
- 07 Feb, 2023 2 commits
-
-
moto authored
Summary: Add a section about installation/build https://output.circle-artifacts.com/output/job/f121cd38-68f3-47a3-ac29-c7b0cfe94c77/artifacts/0/docs/installation.html <img width="1102" alt="Screenshot 2023-02-06 at 6 13 50 PM" src="https://user-images.githubusercontent.com/855818/217108551-622b117b-209e-4776-b5d6-d6934c8126a4.png"> https://output.circle-artifacts.com/output/job/f121cd38-68f3-47a3-ac29-c7b0cfe94c77/artifacts/0/docs/build.html <img width="1072" alt="Screenshot 2023-02-06 at 6 13 57 PM" src="https://user-images.githubusercontent.com/855818/217108568-c125cdc2-9d6a-4c1d-a155-2cee40c9dac6.png"> Pull Request resolved: https://github.com/pytorch/audio/pull/3038 Reviewed By: hwangjeff, nateanl Differential Revision: D43083469 Pulled By: mthrok fbshipit-source-id: e0b5b76dbf706552dd60ae26ea40ebc98627e3b0
-
juan.azcarreta.ortiz authored
Summary: Allows user to play audio through the device speaker. Pull Request resolved: https://github.com/pytorch/audio/pull/3026 Test Plan: Created a new test that mocks a call to the write audio chunk method from StreamWriter. To run the test: `pytest test/torchaudio_unittest/io/_playback_test.py` Reviewed By: mthrok Differential Revision: D43082062 Pulled By: jazcarretao fbshipit-source-id: 01a85b32ce925687a633d1208d15d54556e89dd8
-
- 06 Feb, 2023 1 commit
-
-
atalman authored
Summary: Switch circleci jobs from cu116 to cu117 Pull Request resolved: https://github.com/pytorch/audio/pull/3034 Reviewed By: DanilBaibak Differential Revision: D43042385 Pulled By: atalman fbshipit-source-id: 636e3d86d66a6091d13d731238550d800e77ccc8
-
- 04 Feb, 2023 2 commits
-
-
Tristan Rice authored
Summary: This adds 2 10 bit pix formats one for CPU and one for CUDA. This allows for training on HDR/10bit video datasets. Pull Request resolved: https://github.com/pytorch/audio/pull/3023 Test Plan: ```py r = StreamReader( reader, format='hevc', ) stream = r.add_video_stream( frames_per_chunk=-1, decoder="hevc_cuvid", hw_accel="cuda", ) frame = next(r.stream()) ``` ```py r = StreamReader( reader, format='hevc', ) stream = r.add_video_stream( frames_per_chunk=-1, filter_desc="format=rgb48le", ) frame = next(r.stream()) ```  Reviewed By: xiaohui-zhang Differential Revision: D43019191 Pulled By: mthrok fbshipit-source-id: fe4359e525b24c8b856dfdf3d2f8596871566350
-
Chin-Yun Yu authored
Summary: close https://github.com/pytorch/audio/issues/1408 . Pull Request resolved: https://github.com/pytorch/audio/pull/3018 Reviewed By: xiaohui-zhang Differential Revision: D42961853 Pulled By: mthrok fbshipit-source-id: b9f847986e0afe416e7817ce4790e42cc0f83ee1
-
- 03 Feb, 2023 1 commit
-
-
moto authored
Summary: Add GitHub Action-based GPU test jobs. - It seems that there is 2 hour upper cap so only running CUDA/GPU tests. - Since Kaldi related features are not available, they are disabled. Pull Request resolved: https://github.com/pytorch/audio/pull/3029 Reviewed By: hwangjeff Differential Revision: D42983800 Pulled By: mthrok fbshipit-source-id: 47fefe39c635d1c73ad6799ddacefd2666fe5403
-
- 02 Feb, 2023 2 commits
-
-
moto authored
Summary: If FFmpeg is not available, sox_io cannot fallback to it. In such case, we use a fallback function, just to raise an error with easy-to-understand message. Turned out that the number of arguments this function receives is wrong. This commit fixes it. Pull Request resolved: https://github.com/pytorch/audio/pull/3032 Reviewed By: hwangjeff Differential Revision: D42966930 Pulled By: mthrok fbshipit-source-id: c2c969c7f8db4119ae965a715d65c10f6ac6087c
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3033 Reviewed By: hwangjeff Differential Revision: D42966938 Pulled By: mthrok fbshipit-source-id: 4889735c244690889f02bf57212489ad333389f7
-
- 01 Feb, 2023 5 commits
-
-
atalman authored
Summary: Remove cuda 11.6 from CI replace with 11.7 Following the Release readme here: https://github.com/pytorch/pytorch/blob/master/RELEASE.md#release-compatibility-matrix Pull Request resolved: https://github.com/pytorch/audio/pull/3031 Reviewed By: mthrok Differential Revision: D42937626 Pulled By: atalman fbshipit-source-id: 7e01c56ec0eefbef9ad455d013fced9012febe82
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3027 To support older NumPy, removing `numpy.typing`. Reviewed By: nateanl Differential Revision: D42924428 fbshipit-source-id: af1a370b5baf00c63a088f172dbc2190d414bdf1
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3030 A part of StreamWriter refactoring (Note: this ignores all push blocking failures!) Reviewed By: hwangjeff Differential Revision: D42905959 fbshipit-source-id: ba8add3ce549c70c3775640840e41ace06b0ef65
-
moto authored
Summary: Adding C++ documentation. (C++ APIs are categorized as prototype, though it's used by Python beta APIs.) https://output.circle-artifacts.com/output/job/69654229-a99e-4b15-9ce0-7bc6bcf01101/artifacts/0/docs/libtorchaudio.html <img width="1202" alt="Screenshot 2023-01-31 at 11 48 47 AM" src="https://user-images.githubusercontent.com/855818/215828167-d23032f8-9e40-4413-b5b1-5cbd12d705e9.png"> Pull Request resolved: https://github.com/pytorch/audio/pull/2994 Reviewed By: hwangjeff Differential Revision: D42876621 Pulled By: mthrok fbshipit-source-id: d8b8d610b87ec766501baa88b7506368a9905a6a
-
Wei Wang authored
Summary: https://github.com/pytorch/pytorch/pull/93155 Core has dropped python3.7 Pull Request resolved: https://github.com/pytorch/audio/pull/3020 Reviewed By: mthrok Differential Revision: D42902346 Pulled By: weiwangmeta fbshipit-source-id: 07ab1aff0e128c5960d87e5fa29e341310dea388
-
- 31 Jan, 2023 1 commit
-
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3021 When input format and encode format is different in StreamWriter, filter for format conversion is inserted. A temporary AVFilter (`dst_frame`) is used for this case, but FilterGraph handles the memory allocation, so there is no need to perform allocation by ourselves. This `dst_frame` is otherwise not used, so we do not have to allocate memory at all. This commit removes the unnecessary memory allocation at all. Reviewed By: xiaohui-zhang Differential Revision: D42865042 fbshipit-source-id: 2673b06de1e905dc73a11e2ec1cc6ce7b525d451
-
- 30 Jan, 2023 2 commits
-
-
Yan Li authored
Summary: Currently there will be a few errors when this tutorial is run with a CUDA device. The reasons being: - The source audio waveform is not properly moved to the GPU. The `to()` method is not in-place for Tensors, so we need to assign the return value of the method call to the variable (otherwise the Tensor would still be on the CPU). - When performing further analysis and displaying of the output audio, we need to move them back from the GPU to the CPU. This is because some of the functions we call require the Tensor to be on the CPU (e.g. `stft()` and `bss_eval_sources()`). Pull Request resolved: https://github.com/pytorch/audio/pull/3017 Reviewed By: mthrok Differential Revision: D42828526 Pulled By: nateanl fbshipit-source-id: c28bc855e79e3363a011f4a35a69aae1764e7762
-
moto authored
Summary: We often need to look at which FFmpeg was found and linked when debugging an issue. Version number is often not enough but there is no easy way to find where the library was found either. This commit adds utility function that prints the build time configuration. It helps to distinguish if the linked FFmpeg is the one from binary distribution built in CI or locally built. Pull Request resolved: https://github.com/pytorch/audio/pull/3014 Reviewed By: hwangjeff Differential Revision: D42794952 Pulled By: mthrok fbshipit-source-id: 91ed358fde8cfe9d6d950f34742b1722e729cf4e
-
- 27 Jan, 2023 3 commits
-
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3013 Namespace clean up before publishing the torchaudio C++ API as prototype. Reviewed By: hwangjeff Differential Revision: D42699903 fbshipit-source-id: 8a9eed0390dfa4a152124b42f2b927dbdd3e23d2
-
DanilBaibak authored
Summary: Switch to Nova Linux Conda build. Pull Request resolved: https://github.com/pytorch/audio/pull/2899 Reviewed By: seemethere, osalpekar, mthrok Differential Revision: D42416835 Pulled By: DanilBaibak fbshipit-source-id: 70886c4ff6f3243b80059be9385269cc0f2d4764
-
hwangjeff authored
Summary: Moves `AddNoise`, `Convolve`, `FFTConvolve`, `Speed`, `SpeedPerturbation`, `Deemphasis`, and `Preemphasis` out of `torchaudio.prototype.transforms` and into `torchaudio.transforms`. Pull Request resolved: https://github.com/pytorch/audio/pull/3009 Reviewed By: xiaohui-zhang, mthrok Differential Revision: D42730322 Pulled By: hwangjeff fbshipit-source-id: 43739ac31437150d3127e51eddc0f0bba5facb15
-
- 26 Jan, 2023 1 commit
-
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3007 Simplify the construction of StreamReader/Writer in C++. Currently these classes require client code to build AVFormatContext manually. This is tedious and not user freindly. Some client code actually uses the same helper function that TorchAudio codebase uses. This commit moves the helper logic inside of the constructor of StreamReader/Writer, so that the signatures of these constructors are easy to use and similar to Python interface. Reviewed By: xiaohui-zhang Differential Revision: D42662520 fbshipit-source-id: d95e5236810c48d7d9bd2d89c05d4f60a44b3ba1
-