- 14 Feb, 2023 2 commits
-
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3053 Reviewed By: nateanl Differential Revision: D43238766 Pulled By: mthrok fbshipit-source-id: 4f82878b1c97b0e6a35af75855849b86200e6061
-
Zhaoheng Ni authored
Summary: replicate of https://github.com/pytorch/audio/issues/2644 Pull Request resolved: https://github.com/pytorch/audio/pull/2880 Reviewed By: mthrok Differential Revision: D41633911 Pulled By: nateanl fbshipit-source-id: 73cf145d75c389e996aafe96571ab86dc21f86e5
-
- 11 Feb, 2023 1 commit
-
-
moto authored
Summary: Par https://github.com/pytorch/audio/issues/3040 and https://github.com/pytorch/audio/issues/3041, it turned out Google Colab now has FFmpeg with GPU decoder/encoder preinstalled, and installing FFmpeg manually corrups the environment. This commit updates the tutorial by extracting and moving the how-to-install part to installation/build section. closes https://github.com/pytorch/audio/issues/3041 closes https://github.com/pytorch/audio/issues/3040 Pull Request resolved: https://github.com/pytorch/audio/pull/3050 Reviewed By: nateanl Differential Revision: D43166054 Pulled By: mthrok fbshipit-source-id: 32667f292a796344d5fcde86e8231e15ad904e58
-
- 10 Feb, 2023 1 commit
-
-
Wei Wang authored
Summary: So far Linux and MacOS were tested to work fine out of the box. This PR is created to verify this -- disabled windows jobs and configs for now. Pull Request resolved: https://github.com/pytorch/audio/pull/3039 Reviewed By: osalpekar Differential Revision: D43174745 Pulled By: weiwangmeta fbshipit-source-id: 81766905256e03c5a01cb5448a350f5d409ca4b8
-
- 09 Feb, 2023 3 commits
-
-
moto authored
Summary: - Add documentation - Tweak docsrting - Fix import Pull Request resolved: https://github.com/pytorch/audio/pull/3051 Reviewed By: weiwangmeta, atalman, nateanl Differential Revision: D43166081 Pulled By: mthrok fbshipit-source-id: 7d77aa34a6318a64824626cff8372f8b9aebf6f9
-
moto authored
Summary: Commit b4c66d1f broke all the CIs. The new policy changes the timestamp of configuration files of third party libraries, which triggers re-configuration which requires extra tools. This commit fixes it by reverting the old behavior. Also this adds guard for older cmake versions. Pull Request resolved: https://github.com/pytorch/audio/pull/3046 Reviewed By: atalman Differential Revision: D43133536 Pulled By: mthrok fbshipit-source-id: 357055c8c1b53e593b8b7880f2045e13512c7a8f
-
DanilBaibak authored
Summary: We don't need the presence of physical HW to compile with CUDA. This is a follow up PR regarding `USE_ROCM` for issue https://github.com/pytorch/audio/issues/2979. Pull Request resolved: https://github.com/pytorch/audio/pull/3008 Reviewed By: malfet Differential Revision: D42708862 Pulled By: DanilBaibak fbshipit-source-id: 90cedc80a2d180ca1e0912ad5b644398182417b8
-
- 08 Feb, 2023 4 commits
-
-
moto authored
Summary: Instead of raising an error when lazy import happens, this method allows to import features, and raises an error when the feature is being used. This makes it easy to adopt the same error mechanism across different modules. It is how it's done for sox-related features. Pull Request resolved: https://github.com/pytorch/audio/pull/3028 Reviewed By: xiaohui-zhang Differential Revision: D42966976 Pulled By: mthrok fbshipit-source-id: 423dfe0b8a3970cd07f20e841c794c7f2809f993
-
moto authored
Summary: The first step to migrate doc build to GHA. Pull Request resolved: https://github.com/pytorch/audio/pull/3043 Reviewed By: xiaohui-zhang Differential Revision: D43110816 Pulled By: mthrok fbshipit-source-id: 91de5f3ac567188e7030f14c2827a202a1901f1a
-
moto authored
Summary: Currently, for each third party library checked out with ExternalProject_Add, the following warning is shown. This commit set the policy so that the warning is not shown. ``` CMake Warning (dev) at ci_env/lib/python3.10/site-packages/cmake/data/share/cmake-3.25/Modules/ExternalProject.cmake:3075 (message): The DOWNLOAD_EXTRACT_TIMESTAMP option was not given and policy CMP0135 is not set. The policy's OLD behavior will be used. When using a URL download, the timestamps of extracted files should preferably be that of the time of extraction, otherwise code that depends on the extracted contents might not be rebuilt if the URL changes. The OLD behavior preserves the timestamps from the archive instead, but this is usually not what you want. Update your project to the NEW behavior or specify the DOWNLOAD_EXTRACT_TIMESTAMP option with a value of true to avoid this robustness issue. ``` Pull Request resolved: https://github.com/pytorch/audio/pull/3044 Reviewed By: xiaohui-zhang Differential Revision: D43110818 Pulled By: mthrok fbshipit-source-id: d2e20c9fdbbeeedb5ad546fe32dbda28c5bdd431
-
DanilBaibak authored
Summary: Switch to Nova M1 Conda Pull Request resolved: https://github.com/pytorch/audio/pull/2908 Reviewed By: seemethere, osalpekar Differential Revision: D43093605 Pulled By: DanilBaibak fbshipit-source-id: 9e44f26cfb87e277c3808ee59f50218b4629e86e
-
- 07 Feb, 2023 2 commits
-
-
moto authored
Summary: Add a section about installation/build https://output.circle-artifacts.com/output/job/f121cd38-68f3-47a3-ac29-c7b0cfe94c77/artifacts/0/docs/installation.html <img width="1102" alt="Screenshot 2023-02-06 at 6 13 50 PM" src="https://user-images.githubusercontent.com/855818/217108551-622b117b-209e-4776-b5d6-d6934c8126a4.png"> https://output.circle-artifacts.com/output/job/f121cd38-68f3-47a3-ac29-c7b0cfe94c77/artifacts/0/docs/build.html <img width="1072" alt="Screenshot 2023-02-06 at 6 13 57 PM" src="https://user-images.githubusercontent.com/855818/217108568-c125cdc2-9d6a-4c1d-a155-2cee40c9dac6.png"> Pull Request resolved: https://github.com/pytorch/audio/pull/3038 Reviewed By: hwangjeff, nateanl Differential Revision: D43083469 Pulled By: mthrok fbshipit-source-id: e0b5b76dbf706552dd60ae26ea40ebc98627e3b0
-
juan.azcarreta.ortiz authored
Summary: Allows user to play audio through the device speaker. Pull Request resolved: https://github.com/pytorch/audio/pull/3026 Test Plan: Created a new test that mocks a call to the write audio chunk method from StreamWriter. To run the test: `pytest test/torchaudio_unittest/io/_playback_test.py` Reviewed By: mthrok Differential Revision: D43082062 Pulled By: jazcarretao fbshipit-source-id: 01a85b32ce925687a633d1208d15d54556e89dd8
-
- 06 Feb, 2023 1 commit
-
-
atalman authored
Summary: Switch circleci jobs from cu116 to cu117 Pull Request resolved: https://github.com/pytorch/audio/pull/3034 Reviewed By: DanilBaibak Differential Revision: D43042385 Pulled By: atalman fbshipit-source-id: 636e3d86d66a6091d13d731238550d800e77ccc8
-
- 04 Feb, 2023 2 commits
-
-
Tristan Rice authored
Summary: This adds 2 10 bit pix formats one for CPU and one for CUDA. This allows for training on HDR/10bit video datasets. Pull Request resolved: https://github.com/pytorch/audio/pull/3023 Test Plan: ```py r = StreamReader( reader, format='hevc', ) stream = r.add_video_stream( frames_per_chunk=-1, decoder="hevc_cuvid", hw_accel="cuda", ) frame = next(r.stream()) ``` ```py r = StreamReader( reader, format='hevc', ) stream = r.add_video_stream( frames_per_chunk=-1, filter_desc="format=rgb48le", ) frame = next(r.stream()) ```  Reviewed By: xiaohui-zhang Differential Revision: D43019191 Pulled By: mthrok fbshipit-source-id: fe4359e525b24c8b856dfdf3d2f8596871566350
-
Chin-Yun Yu authored
Summary: close https://github.com/pytorch/audio/issues/1408 . Pull Request resolved: https://github.com/pytorch/audio/pull/3018 Reviewed By: xiaohui-zhang Differential Revision: D42961853 Pulled By: mthrok fbshipit-source-id: b9f847986e0afe416e7817ce4790e42cc0f83ee1
-
- 03 Feb, 2023 1 commit
-
-
moto authored
Summary: Add GitHub Action-based GPU test jobs. - It seems that there is 2 hour upper cap so only running CUDA/GPU tests. - Since Kaldi related features are not available, they are disabled. Pull Request resolved: https://github.com/pytorch/audio/pull/3029 Reviewed By: hwangjeff Differential Revision: D42983800 Pulled By: mthrok fbshipit-source-id: 47fefe39c635d1c73ad6799ddacefd2666fe5403
-
- 02 Feb, 2023 2 commits
-
-
moto authored
Summary: If FFmpeg is not available, sox_io cannot fallback to it. In such case, we use a fallback function, just to raise an error with easy-to-understand message. Turned out that the number of arguments this function receives is wrong. This commit fixes it. Pull Request resolved: https://github.com/pytorch/audio/pull/3032 Reviewed By: hwangjeff Differential Revision: D42966930 Pulled By: mthrok fbshipit-source-id: c2c969c7f8db4119ae965a715d65c10f6ac6087c
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3033 Reviewed By: hwangjeff Differential Revision: D42966938 Pulled By: mthrok fbshipit-source-id: 4889735c244690889f02bf57212489ad333389f7
-
- 01 Feb, 2023 5 commits
-
-
atalman authored
Summary: Remove cuda 11.6 from CI replace with 11.7 Following the Release readme here: https://github.com/pytorch/pytorch/blob/master/RELEASE.md#release-compatibility-matrix Pull Request resolved: https://github.com/pytorch/audio/pull/3031 Reviewed By: mthrok Differential Revision: D42937626 Pulled By: atalman fbshipit-source-id: 7e01c56ec0eefbef9ad455d013fced9012febe82
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3027 To support older NumPy, removing `numpy.typing`. Reviewed By: nateanl Differential Revision: D42924428 fbshipit-source-id: af1a370b5baf00c63a088f172dbc2190d414bdf1
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3030 A part of StreamWriter refactoring (Note: this ignores all push blocking failures!) Reviewed By: hwangjeff Differential Revision: D42905959 fbshipit-source-id: ba8add3ce549c70c3775640840e41ace06b0ef65
-
moto authored
Summary: Adding C++ documentation. (C++ APIs are categorized as prototype, though it's used by Python beta APIs.) https://output.circle-artifacts.com/output/job/69654229-a99e-4b15-9ce0-7bc6bcf01101/artifacts/0/docs/libtorchaudio.html <img width="1202" alt="Screenshot 2023-01-31 at 11 48 47 AM" src="https://user-images.githubusercontent.com/855818/215828167-d23032f8-9e40-4413-b5b1-5cbd12d705e9.png"> Pull Request resolved: https://github.com/pytorch/audio/pull/2994 Reviewed By: hwangjeff Differential Revision: D42876621 Pulled By: mthrok fbshipit-source-id: d8b8d610b87ec766501baa88b7506368a9905a6a
-
Wei Wang authored
Summary: https://github.com/pytorch/pytorch/pull/93155 Core has dropped python3.7 Pull Request resolved: https://github.com/pytorch/audio/pull/3020 Reviewed By: mthrok Differential Revision: D42902346 Pulled By: weiwangmeta fbshipit-source-id: 07ab1aff0e128c5960d87e5fa29e341310dea388
-
- 31 Jan, 2023 1 commit
-
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3021 When input format and encode format is different in StreamWriter, filter for format conversion is inserted. A temporary AVFilter (`dst_frame`) is used for this case, but FilterGraph handles the memory allocation, so there is no need to perform allocation by ourselves. This `dst_frame` is otherwise not used, so we do not have to allocate memory at all. This commit removes the unnecessary memory allocation at all. Reviewed By: xiaohui-zhang Differential Revision: D42865042 fbshipit-source-id: 2673b06de1e905dc73a11e2ec1cc6ce7b525d451
-
- 30 Jan, 2023 2 commits
-
-
Yan Li authored
Summary: Currently there will be a few errors when this tutorial is run with a CUDA device. The reasons being: - The source audio waveform is not properly moved to the GPU. The `to()` method is not in-place for Tensors, so we need to assign the return value of the method call to the variable (otherwise the Tensor would still be on the CPU). - When performing further analysis and displaying of the output audio, we need to move them back from the GPU to the CPU. This is because some of the functions we call require the Tensor to be on the CPU (e.g. `stft()` and `bss_eval_sources()`). Pull Request resolved: https://github.com/pytorch/audio/pull/3017 Reviewed By: mthrok Differential Revision: D42828526 Pulled By: nateanl fbshipit-source-id: c28bc855e79e3363a011f4a35a69aae1764e7762
-
moto authored
Summary: We often need to look at which FFmpeg was found and linked when debugging an issue. Version number is often not enough but there is no easy way to find where the library was found either. This commit adds utility function that prints the build time configuration. It helps to distinguish if the linked FFmpeg is the one from binary distribution built in CI or locally built. Pull Request resolved: https://github.com/pytorch/audio/pull/3014 Reviewed By: hwangjeff Differential Revision: D42794952 Pulled By: mthrok fbshipit-source-id: 91ed358fde8cfe9d6d950f34742b1722e729cf4e
-
- 27 Jan, 2023 3 commits
-
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3013 Namespace clean up before publishing the torchaudio C++ API as prototype. Reviewed By: hwangjeff Differential Revision: D42699903 fbshipit-source-id: 8a9eed0390dfa4a152124b42f2b927dbdd3e23d2
-
DanilBaibak authored
Summary: Switch to Nova Linux Conda build. Pull Request resolved: https://github.com/pytorch/audio/pull/2899 Reviewed By: seemethere, osalpekar, mthrok Differential Revision: D42416835 Pulled By: DanilBaibak fbshipit-source-id: 70886c4ff6f3243b80059be9385269cc0f2d4764
-
hwangjeff authored
Summary: Moves `AddNoise`, `Convolve`, `FFTConvolve`, `Speed`, `SpeedPerturbation`, `Deemphasis`, and `Preemphasis` out of `torchaudio.prototype.transforms` and into `torchaudio.transforms`. Pull Request resolved: https://github.com/pytorch/audio/pull/3009 Reviewed By: xiaohui-zhang, mthrok Differential Revision: D42730322 Pulled By: hwangjeff fbshipit-source-id: 43739ac31437150d3127e51eddc0f0bba5facb15
-
- 26 Jan, 2023 3 commits
-
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3007 Simplify the construction of StreamReader/Writer in C++. Currently these classes require client code to build AVFormatContext manually. This is tedious and not user freindly. Some client code actually uses the same helper function that TorchAudio codebase uses. This commit moves the helper logic inside of the constructor of StreamReader/Writer, so that the signatures of these constructors are easy to use and similar to Python interface. Reviewed By: xiaohui-zhang Differential Revision: D42662520 fbshipit-source-id: d95e5236810c48d7d9bd2d89c05d4f60a44b3ba1
-
hwangjeff authored
Summary: Passing functions as test parameters causes issues on some platforms. This PR updates the functional tests to pass functions by name instead. Pull Request resolved: https://github.com/pytorch/audio/pull/3011 Reviewed By: mthrok Differential Revision: D42748106 Pulled By: hwangjeff fbshipit-source-id: 4d81dabe4aff2293bc344a457a034a2d9af024e2
-
moto authored
Summary: These functions are called part of sox initialization, thus it is no longer needed. Pull Request resolved: https://github.com/pytorch/audio/pull/3010 Reviewed By: hwangjeff Differential Revision: D42744478 Pulled By: mthrok fbshipit-source-id: 17d715b328392397ec47d81a533a307aac22862d
-
- 24 Jan, 2023 1 commit
-
-
hwangjeff authored
Summary: Moves `add_noise`, `fftconvolve`, `convolve`, `speed`, `preemphasis`, and `deemphasis` out of `torchaudio.prototype.functional` and into `torchaudio.functional`. Pull Request resolved: https://github.com/pytorch/audio/pull/3001 Reviewed By: mthrok Differential Revision: D42688971 Pulled By: hwangjeff fbshipit-source-id: 43280bd3ffeccddae57f1092ac45afb64dd426cc
-
- 23 Jan, 2023 3 commits
-
-
Nikita Shulga authored
Summary: We don't need the presence of physical HW to compile with CUDA. Likely one of the causes of https://github.com/pytorch/audio/issues/2979 (i.e. in CircleCI builds USE_CUDA were defined by CI environment, so nobody ever checked the default, but this is not the case in Nova builds) Pull Request resolved: https://github.com/pytorch/audio/pull/3005 Test Plan: Check that `compute.cu` is mentioned in builds, for example see https://github.com/pytorch/audio/actions/runs/3990295262/jobs/6843771056#step:9:829 ``` [193/202] /usr/local/cuda-11.6/bin/nvcc -forward-unknown-to-host-compiler -DINCLUDE_KALDI -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_CUDA -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dlibtorchaudio_EXPORTS -I/__w/audio/audio/pytorch/audio -I/__w/audio/audio/pytorch/audio/third_party/kaldi/src -I/__w/audio/audio/pytorch/audio/third_party/kaldi/submodule/src -isystem=/__w/_temp/conda_environment_3990295262/lib/python3.7/site-packages/torch/include -isystem=/__w/_temp/conda_environment_3990295262/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -isystem=/usr/local/cuda-11.6/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_35,code=sm_35 -gencode arch=compute_50,code=sm_50 -gencode arch=compute_60,code=sm_60 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_50,code=compute_50 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=integer_sign_change,--diag_suppress=useless_using_declaration,--diag_suppress=set_but_not_used,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=implicit_return_from_non_void_function,--diag_suppress=unsigned_compare_with_zero,--diag_suppress=declared_but_not_referenced,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -Xcompiler=-fPIC -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++17 -MD -MT torchaudio/csrc/CMakeFiles/libtorchaudio.dir/rnnt/gpu/compute.cu.o -MF torchaudio/csrc/CMakeFiles/libtorchaudio.dir/rnnt/gpu/compute.cu.o.d -x cu -c /__w/audio/audio/pytorch/audio/torchaudio/csrc/rnnt/gpu/compute.cu -o torchaudio/csrc/CMakeFiles/libtorchaudio.dir/rnnt/gpu/compute.cu.o ``` Reviewed By: mthrok Differential Revision: D42687455 Pulled By: malfet fbshipit-source-id: c37ad58cc62439d1268865e9bf0bcb97079a529f
-
Moto Hira authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3002 This commit merges `pop_chunks` and `pop_chunks_with_metadata`. In #2975 (D42526945 (https://github.com/pytorch/audio/commit/0dd59e0dda22eabf54fc95ad8050094df239bd39)), we updated StreamReader so that it returns PTS. In that PR, we introduced `pop_chunks_with_metadata` method, so that the original `pop_chunks` method returns the same type and we could focus on the PTS logic in the code review. The commit is landed, now we merge the two methods, so that the original `pop_chunks` returns Tensor frames and metadata (PTS). Reviewed By: xiaohui-zhang Differential Revision: D42662321 fbshipit-source-id: 37ae088bc63fc516ea068698088925e8b31bc0a1
-
moto authored
Summary: This change fixes the issue where syntax highlighting is broken up par word. ## Plain Before <img width="243" alt="Screenshot 2023-01-20 at 1 28 48 PM" src="https://user-images.githubusercontent.com/855818/213778202-27ec8030-3f2f-4ef9-8210-bce7cfc3cb38.png"> After <img width="244" alt="Screenshot 2023-01-20 at 1 29 01 PM" src="https://user-images.githubusercontent.com/855818/213778231-61c52825-d63a-4913-b10d-a65f3b2cfbbb.png"> ## In articles Before <img width="786" alt="Screenshot 2023-01-20 at 1 34 12 PM" src="https://user-images.githubusercontent.com/855818/213779050-c21ba5e2-84b3-4935-bbab-6edcb7bc89ce.png"> After <img width="783" alt="Screenshot 2023-01-20 at 1 34 17 PM" src="https://user-images.githubusercontent.com/855818/213779069-f1406422-27a4-41cf-8ccd-5058f80860bd.png"> ## In tables Before <img width="813" alt="Screenshot 2023-01-20 at 1 27 35 PM" src="https://user-images.githubusercontent.com/855818/213778039-fede6f18-5a35-47f2-9e0b-a9be5716dc73.png"> After <img width="813" alt="Screenshot 2023-01-20 at 1 27 51 PM" src="https://user-images.githubusercontent.com/855818/213778073-e26275a9-d380-4601-aa92-84af7aeab00f.png"> Pull Request resolved: https://github.com/pytorch/audio/pull/3000 Reviewed By: xiaohui-zhang Differential Revision: D42642522 Pulled By: mthrok fbshipit-source-id: 6831bb90da005aff8d7f178ef768e967bc6d2640
-
- 22 Jan, 2023 1 commit
-
-
moto authored
Summary: This commit makes `StreamReader` report PTS (presentation time stamp) of the returned chunk as well. Example ```python from torchaudio.io import StreamReader s = StreamReader(...) s.add_video_stream(...) for (video_chunk, ) in s.stream(): # video_chunk is Torch tensor type but has extra attribute of PTS print(video_chunk.pts) # reports the PTS of the first frame of the video chunk. ``` For the backward compatibility, we introduce a `_ChunkTensor`, that is a composition of Tensor and metadata, but works like a normal tensor in PyTorch operations. The implementation of `_ChunkTensor` is based on [TrivialTensorViaComposition](https://github.com/albanD/subclass_zoo/blob/0eeb1d68fb59879029c610bc407f2997ae43ba0a/trivial_tensors.py#L83). It was also suggested to attach metadata directly to Tensor object, but the possibility to have the collision on torchaudio's metadata and new attributes introduced in PyTorch cannot be ignored, so we use Tensor subclass implementation. If any unexpected issue arise from metadata attribute name collision, client code can fetch the bare Tensor and continue. Pull Request resolved: https://github.com/pytorch/audio/pull/2975 Reviewed By: hwangjeff Differential Revision: D42526945 Pulled By: mthrok fbshipit-source-id: b4e9422e914ff328421b975120460f3001268f35
-
- 20 Jan, 2023 2 commits
-
-
moto authored
Summary: Extraction from https://github.com/pytorch/audio/issues/2994 Add docstrings to C++ StreamReader/Writer. Pull Request resolved: https://github.com/pytorch/audio/pull/2997 Reviewed By: nateanl Differential Revision: D42628016 Pulled By: mthrok fbshipit-source-id: b22c43b80997af4a9087142340c67bed28e54917
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2999 Reviewed By: hwangjeff Differential Revision: D42637618 Pulled By: mthrok fbshipit-source-id: 35a7976c316e3b3899ae9c2202f132f1a960b736
-