Commits · 73b29fc93b2de0b52573d2a966db633518aa1ae3 · OpenDAS / Torchaudio

14 Feb, 2023 2 commits

Redirect build instruction to official doc (#3053) · 73b29fc9

moto authored Feb 14, 2023

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3053

Reviewed By: nateanl

Differential Revision: D43238766

Pulled By: mthrok

fbshipit-source-id: 4f82878b1c97b0e6a35af75855849b86200e6061

73b29fc9

Add simulate_rir_ism method for room impulse response simulation (#2880) · 8c5c9a9b

Zhaoheng Ni authored Feb 14, 2023

Summary:
replicate of https://github.com/pytorch/audio/issues/2644

Pull Request resolved: https://github.com/pytorch/audio/pull/2880

Reviewed By: mthrok

Differential Revision: D41633911

Pulled By: nateanl

fbshipit-source-id: 73cf145d75c389e996aafe96571ab86dc21f86e5

8c5c9a9b

11 Feb, 2023 1 commit

Update hardware accelerated video processing tutorial (#3050) · 3f02b898

moto authored Feb 10, 2023

Summary:
Par https://github.com/pytorch/audio/issues/3040 and https://github.com/pytorch/audio/issues/3041, it turned out Google Colab now has FFmpeg with GPU decoder/encoder preinstalled, and installing FFmpeg manually corrups the environment.

This commit updates the tutorial by extracting and moving the how-to-install part to installation/build section.

closes https://github.com/pytorch/audio/issues/3041
closes https://github.com/pytorch/audio/issues/3040

Pull Request resolved: https://github.com/pytorch/audio/pull/3050

Reviewed By: nateanl

Differential Revision: D43166054

Pulled By: mthrok

fbshipit-source-id: 32667f292a796344d5fcde86e8231e15ad904e58

3f02b898

10 Feb, 2023 1 commit

Add python 3.11 support for torchaudio and add workflow concurrency rule (#3039) · fadb5ae5

Wei Wang authored Feb 09, 2023

Summary:
So far Linux and MacOS were tested to work fine out of the box. This PR is created to verify this -- disabled windows jobs and configs for now.

Pull Request resolved: https://github.com/pytorch/audio/pull/3039

Reviewed By: osalpekar

Differential Revision: D43174745

Pulled By: weiwangmeta

fbshipit-source-id: 81766905256e03c5a01cb5448a350f5d409ca4b8

fadb5ae5

09 Feb, 2023 3 commits

Follow-up on audio playback function (#3051) · 91b05e2e

moto authored Feb 09, 2023

Summary:
- Add documentation
- Tweak docsrting
- Fix import

Pull Request resolved: https://github.com/pytorch/audio/pull/3051

Reviewed By: weiwangmeta, atalman, nateanl

Differential Revision: D43166081

Pulled By: mthrok

fbshipit-source-id: 7d77aa34a6318a64824626cff8372f8b9aebf6f9

91b05e2e

Follow-up fix policy set (#3046) · 70acff7a

moto authored Feb 09, 2023

Summary:
Commit b4c66d1f broke all the CIs.
The new policy changes the timestamp of configuration files of third party libraries,
which triggers re-configuration which requires extra tools.

This commit fixes it by reverting the old behavior.
Also this adds guard for older cmake versions.

Pull Request resolved: https://github.com/pytorch/audio/pull/3046

Reviewed By: atalman

Differential Revision: D43133536

Pulled By: mthrok

fbshipit-source-id: 357055c8c1b53e593b8b7880f2045e13512c7a8f

70acff7a

Updated USE_ROCM detection (#3008) · 05d597fa

DanilBaibak authored Feb 09, 2023

Summary:
We don't need the presence of physical HW to compile with CUDA.

This is a follow up PR regarding `USE_ROCM` for issue https://github.com/pytorch/audio/issues/2979.

Pull Request resolved: https://github.com/pytorch/audio/pull/3008

Reviewed By: malfet

Differential Revision: D42708862

Pulled By: DanilBaibak

fbshipit-source-id: 90cedc80a2d180ca1e0912ad5b644398182417b8

05d597fa

08 Feb, 2023 4 commits

Update the guard mechanism for FFmpeg-related features (#3028) · 98b3ac17

moto authored Feb 08, 2023

Summary:
Instead of raising an error when lazy import happens, this method allows to import features, and raises an error when the feature is being used.

This makes it easy to adopt the same error mechanism across different modules. It is how it's done for sox-related features.

Pull Request resolved: https://github.com/pytorch/audio/pull/3028

Reviewed By: xiaohui-zhang

Differential Revision: D42966976

Pulled By: mthrok

fbshipit-source-id: 423dfe0b8a3970cd07f20e841c794c7f2809f993

98b3ac17

Build doc on GHA (#3043) · a0f8af4b

moto authored Feb 08, 2023

Summary:
The first step to migrate doc build to GHA.

Pull Request resolved: https://github.com/pytorch/audio/pull/3043

Reviewed By: xiaohui-zhang

Differential Revision: D43110816

Pulled By: mthrok

fbshipit-source-id: 91de5f3ac567188e7030f14c2827a202a1901f1a

a0f8af4b

Suppres warning about archive timestamp (#3044) · b4c66d1f

moto authored Feb 08, 2023

Summary:
Currently, for each third party library checked out with ExternalProject_Add, the following warning is shown.

This commit set the policy so that the warning is not shown.

```
CMake Warning (dev) at ci_env/lib/python3.10/site-packages/cmake/data/share/cmake-3.25/Modules/ExternalProject.cmake:3075 (message):
  The DOWNLOAD_EXTRACT_TIMESTAMP option was not given and policy CMP0135 is
  not set.  The policy's OLD behavior will be used.  When using a URL
  download, the timestamps of extracted files should preferably be that of
  the time of extraction, otherwise code that depends on the extracted
  contents might not be rebuilt if the URL changes.  The OLD behavior
  preserves the timestamps from the archive instead, but this is usually not
  what you want.  Update your project to the NEW behavior or specify the
  DOWNLOAD_EXTRACT_TIMESTAMP option with a value of true to avoid this
  robustness issue.
```

Pull Request resolved: https://github.com/pytorch/audio/pull/3044

Reviewed By: xiaohui-zhang

Differential Revision: D43110818

Pulled By: mthrok

fbshipit-source-id: d2e20c9fdbbeeedb5ad546fe32dbda28c5bdd431

b4c66d1f

Switch to Nova MacOS Conda (#2908) · de54d864

DanilBaibak authored Feb 08, 2023

Summary:
Switch to Nova M1 Conda

Pull Request resolved: https://github.com/pytorch/audio/pull/2908

Reviewed By: seemethere, osalpekar

Differential Revision: D43093605

Pulled By: DanilBaibak

fbshipit-source-id: 9e44f26cfb87e277c3808ee59f50218b4629e86e

de54d864

07 Feb, 2023 2 commits

Add installation / build instruction to doc (#3038) · 3c121a59

moto authored Feb 07, 2023

Summary:
Add a section about installation/build

https://output.circle-artifacts.com/output/job/f121cd38-68f3-47a3-ac29-c7b0cfe94c77/artifacts/0/docs/installation.html
<img width="1102" alt="Screenshot 2023-02-06 at 6 13 50 PM" src="https://user-images.githubusercontent.com/855818/217108551-622b117b-209e-4776-b5d6-d6934c8126a4.png">

https://output.circle-artifacts.com/output/job/f121cd38-68f3-47a3-ac29-c7b0cfe94c77/artifacts/0/docs/build.html
<img width="1072" alt="Screenshot 2023-02-06 at 6 13 57 PM" src="https://user-images.githubusercontent.com/855818/217108568-c125cdc2-9d6a-4c1d-a155-2cee40c9dac6.png">

Pull Request resolved: https://github.com/pytorch/audio/pull/3038

Reviewed By: hwangjeff, nateanl

Differential Revision: D43083469

Pulled By: mthrok

fbshipit-source-id: e0b5b76dbf706552dd60ae26ea40ebc98627e3b0

3c121a59

Add playback function (#3026) · 2ead941e

juan.azcarreta.ortiz authored Feb 07, 2023

Summary:
Allows user to play audio through the
device speaker.

Pull Request resolved: https://github.com/pytorch/audio/pull/3026

Test Plan:
Created a new test that mocks a call to the write audio chunk method from StreamWriter. To run the test:

`pytest test/torchaudio_unittest/io/_playback_test.py`

Reviewed By: mthrok

Differential Revision: D43082062

Pulled By: jazcarretao

fbshipit-source-id: 01a85b32ce925687a633d1208d15d54556e89dd8

2ead941e

06 Feb, 2023 1 commit

Switch circleci jobs from cu116 to cu117 (#3034) · 9368f33b

atalman authored Feb 06, 2023

Summary:
Switch circleci jobs from cu116 to cu117

Pull Request resolved: https://github.com/pytorch/audio/pull/3034

Reviewed By: DanilBaibak

Differential Revision: D43042385

Pulled By: atalman

fbshipit-source-id: 636e3d86d66a6091d13d731238550d800e77ccc8

9368f33b

04 Feb, 2023 2 commits

Add rgb48le and CUDA p010 support (HDR/10bit) to StreamReader (#3023) · b7e173fa

Tristan Rice authored Feb 04, 2023

Summary:
This adds 2 10 bit pix formats one for CPU and one for CUDA. This allows for training on HDR/10bit video datasets.

Pull Request resolved: https://github.com/pytorch/audio/pull/3023

Test Plan:
```py
r = StreamReader(
    reader, format='hevc',
)
stream = r.add_video_stream(
    frames_per_chunk=-1,
    decoder="hevc_cuvid",
    hw_accel="cuda",
)
frame = next(r.stream())
```

```py
r = StreamReader(
    reader, format='hevc',
)
stream = r.add_video_stream(
    frames_per_chunk=-1,
    filter_desc="format=rgb48le",
)
frame = next(r.stream())
```

![audio-example](https://user-images.githubusercontent.com/909104/215696543-ed3dc5a3-3013-4a57-8b98-05aa4a5a9a7c.png)

Reviewed By: xiaohui-zhang

Differential Revision: D43019191

Pulled By: mthrok

fbshipit-source-id: fe4359e525b24c8b856dfdf3d2f8596871566350

b7e173fa

feat: cuda kernel for lfilter (#3018) · 4f201054

Chin-Yun Yu authored Feb 03, 2023

Summary:
close https://github.com/pytorch/audio/issues/1408 .

Pull Request resolved: https://github.com/pytorch/audio/pull/3018

Reviewed By: xiaohui-zhang

Differential Revision: D42961853

Pulled By: mthrok

fbshipit-source-id: b9f847986e0afe416e7817ce4790e42cc0f83ee1

4f201054

03 Feb, 2023 1 commit

Add Linux GPU unit tests on GHA (#3029) · 6bdd3830

moto authored Feb 02, 2023

Summary:
Add GitHub Action-based GPU test jobs.
- It seems that there is 2 hour upper cap so only running CUDA/GPU tests.
- Since Kaldi related features are not available, they are disabled.

Pull Request resolved: https://github.com/pytorch/audio/pull/3029

Reviewed By: hwangjeff

Differential Revision: D42983800

Pulled By: mthrok

fbshipit-source-id: 47fefe39c635d1c73ad6799ddacefd2666fe5403

6bdd3830

02 Feb, 2023 2 commits

Fix signature mismatch on _fail_info_fileobj (#3032) · 409c687f

moto authored Feb 02, 2023

Summary:
If FFmpeg is not available, sox_io cannot fallback to it. In such case, we use a fallback function, just to raise an error with easy-to-understand message.

Turned out that the number of arguments this function receives is wrong.

This commit fixes it.

Pull Request resolved: https://github.com/pytorch/audio/pull/3032

Reviewed By: hwangjeff

Differential Revision: D42966930

Pulled By: mthrok

fbshipit-source-id: c2c969c7f8db4119ae965a715d65c10f6ac6087c

409c687f

Add depreaction warnings to file-like object support in sox_io (#3033) · c63e9eb8

moto authored Feb 02, 2023

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3033

Reviewed By: hwangjeff

Differential Revision: D42966938

Pulled By: mthrok

fbshipit-source-id: 4889735c244690889f02bf57212489ad333389f7

c63e9eb8

01 Feb, 2023 5 commits

Switch CI to CUDA 11.7 from CUDA 11.6 (#3031) · b28f9b34

atalman authored Feb 01, 2023

Summary:
Remove cuda 11.6 from CI replace with 11.7
Following the Release readme here: https://github.com/pytorch/pytorch/blob/master/RELEASE.md#release-compatibility-matrix

Pull Request resolved: https://github.com/pytorch/audio/pull/3031

Reviewed By: mthrok

Differential Revision: D42937626

Pulled By: atalman

fbshipit-source-id: 7e01c56ec0eefbef9ad455d013fced9012febe82

b28f9b34

Update prototype functional tests. (#3027) · 01ba0ac8

Moto Hira authored Feb 01, 2023

Summary:
Pull Request resolved: https://github.com/pytorch/audio/pull/3027

To support older NumPy, removing `numpy.typing`.

Reviewed By: nateanl

Differential Revision: D42924428

fbshipit-source-id: af1a370b5baf00c63a088f172dbc2190d414bdf1

01ba0ac8

Cleaning up private methods (#3030) · 49287210

Moto Hira authored Feb 01, 2023

Summary:
Pull Request resolved: https://github.com/pytorch/audio/pull/3030

A part of StreamWriter refactoring

(Note: this ignores all push blocking failures!)

Reviewed By: hwangjeff

Differential Revision: D42905959

fbshipit-source-id: ba8add3ce549c70c3775640840e41ace06b0ef65

49287210

Add C++ documentation (#2994) · f663cb28

moto authored Jan 31, 2023

Summary:
Adding C++ documentation. (C++ APIs are categorized as prototype, though it's used by Python beta APIs.)

https://output.circle-artifacts.com/output/job/69654229-a99e-4b15-9ce0-7bc6bcf01101/artifacts/0/docs/libtorchaudio.html

<img width="1202" alt="Screenshot 2023-01-31 at 11 48 47 AM" src="https://user-images.githubusercontent.com/855818/215828167-d23032f8-9e40-4413-b5b1-5cbd12d705e9.png">

Pull Request resolved: https://github.com/pytorch/audio/pull/2994

Reviewed By: hwangjeff

Differential Revision: D42876621

Pulled By: mthrok

fbshipit-source-id: d8b8d610b87ec766501baa88b7506368a9905a6a

f663cb28

Drop python 3.7 support (#3020) · 60af60a8

Wei Wang authored Jan 31, 2023

Summary:
https://github.com/pytorch/pytorch/pull/93155 Core has dropped python3.7

Pull Request resolved: https://github.com/pytorch/audio/pull/3020

Reviewed By: mthrok

Differential Revision: D42902346

Pulled By: weiwangmeta

fbshipit-source-id: 07ab1aff0e128c5960d87e5fa29e341310dea388

60af60a8

31 Jan, 2023 1 commit

Remove unnecessary AVFrame allocation (#3021) · 0709cadc

Moto Hira authored Jan 31, 2023

Summary:
Pull Request resolved: https://github.com/pytorch/audio/pull/3021

When input format and encode format is different in StreamWriter, filter for format conversion is inserted.

A temporary AVFilter (`dst_frame`) is used for this case,
but FilterGraph handles the memory allocation,
so there is no need to perform allocation by ourselves.

This `dst_frame` is otherwise not used, so we do not have to allocate memory at all.
This commit removes the unnecessary memory allocation at all.

Reviewed By: xiaohui-zhang

Differential Revision: D42865042

fbshipit-source-id: 2673b06de1e905dc73a11e2ec1cc6ce7b525d451

0709cadc

30 Jan, 2023 2 commits

Fix hybrid demucs tutorial for CUDA (#3017) · da9d1627

Yan Li authored Jan 30, 2023

Summary:
Currently there will be a few errors when this tutorial is run with a CUDA device.

The reasons being:
- The source audio waveform is not properly moved to the GPU. The `to()` method is not in-place for Tensors, so we need to assign the return value of the method call to the variable (otherwise the Tensor would still be on the CPU).
- When performing further analysis and displaying of the output audio, we need to move them back from the GPU to the CPU. This is because some of the functions we call require the Tensor to be on the CPU (e.g. `stft()` and `bss_eval_sources()`).

Pull Request resolved: https://github.com/pytorch/audio/pull/3017

Reviewed By: mthrok

Differential Revision: D42828526

Pulled By: nateanl

fbshipit-source-id: c28bc855e79e3363a011f4a35a69aae1764e7762

da9d1627

Add get_build_config ffmpeg utility function (#3014) · 635d8cff

moto authored Jan 29, 2023

Summary:
We often need to look at which FFmpeg was found and linked when debugging an issue.

Version number is often not enough but there is no easy way to find where the library was found either.

This commit adds utility function that prints the build time configuration.

It helps to distinguish if the linked FFmpeg is the one from binary distribution built in CI or locally built.

Pull Request resolved: https://github.com/pytorch/audio/pull/3014

Reviewed By: hwangjeff

Differential Revision: D42794952

Pulled By: mthrok

fbshipit-source-id: 91ed358fde8cfe9d6d950f34742b1722e729cf4e

635d8cff

27 Jan, 2023 3 commits

Replace torchaudio::ffmpeg with torchaudio::io (#3013) · 51aae466

Moto Hira authored Jan 27, 2023

Summary:
Pull Request resolved: https://github.com/pytorch/audio/pull/3013

Namespace clean up before publishing the torchaudio C++ API as prototype.

Reviewed By: hwangjeff

Differential Revision: D42699903

fbshipit-source-id: 8a9eed0390dfa4a152124b42f2b927dbdd3e23d2

51aae466

Switch to Nova Linux Conda build (#2899) · 12f960b2

DanilBaibak authored Jan 27, 2023

Summary:
Switch to Nova Linux Conda build.

Pull Request resolved: https://github.com/pytorch/audio/pull/2899

Reviewed By: seemethere, osalpekar, mthrok

Differential Revision: D42416835

Pulled By: DanilBaibak

fbshipit-source-id: 70886c4ff6f3243b80059be9385269cc0f2d4764

12f960b2

Move data augmentation transforms out of prototype (#3009) · b4cc0f33

hwangjeff authored Jan 26, 2023

Summary:
Moves `AddNoise`, `Convolve`, `FFTConvolve`, `Speed`, `SpeedPerturbation`, `Deemphasis`, and `Preemphasis` out of `torchaudio.prototype.transforms` and into `torchaudio.transforms`.

Pull Request resolved: https://github.com/pytorch/audio/pull/3009

Reviewed By: xiaohui-zhang, mthrok

Differential Revision: D42730322

Pulled By: hwangjeff

fbshipit-source-id: 43739ac31437150d3127e51eddc0f0bba5facb15

b4cc0f33

26 Jan, 2023 3 commits

Abstract away AVFormatContext from StreamReader/Writer constructor (#3007) · 7ea69e61

Moto Hira authored Jan 26, 2023

Summary:
Pull Request resolved: https://github.com/pytorch/audio/pull/3007

Simplify the construction of StreamReader/Writer in C++.

Currently these classes require client code to build AVFormatContext
manually. This is tedious and not user freindly.

Some client code actually uses the same helper function that
TorchAudio codebase uses.

This commit moves the helper logic inside of the constructor of
StreamReader/Writer, so that the signatures of these constructors
are easy to use and similar to Python interface.

Reviewed By: xiaohui-zhang

Differential Revision: D42662520

fbshipit-source-id: d95e5236810c48d7d9bd2d89c05d4f60a44b3ba1

7ea69e61

Remove function input parameters from data aug functional tests (#3011) · 2f5fcf4f

hwangjeff authored Jan 25, 2023

Summary:
Passing functions as test parameters causes issues on some platforms. This PR updates the functional tests to pass functions by name instead.

Pull Request resolved: https://github.com/pytorch/audio/pull/3011

Reviewed By: mthrok

Differential Revision: D42748106

Pulled By: hwangjeff

fbshipit-source-id: 4d81dabe4aff2293bc344a457a034a2d9af024e2

2f5fcf4f

Deprecate sox initialization/shutdown public API functions (#3010) · aa760caf

moto authored Jan 25, 2023

Summary:
These functions are called part of sox initialization, thus it is no longer needed.

Pull Request resolved: https://github.com/pytorch/audio/pull/3010

Reviewed By: hwangjeff

Differential Revision: D42744478

Pulled By: mthrok

fbshipit-source-id: 17d715b328392397ec47d81a533a307aac22862d

aa760caf

24 Jan, 2023 1 commit

Move data augmentation functions out of prototype (#3001) · 41b88314

hwangjeff authored Jan 23, 2023

Summary:
Moves `add_noise`, `fftconvolve`, `convolve`, `speed`, `preemphasis`, and `deemphasis` out of `torchaudio.prototype.functional` and into `torchaudio.functional`.

Pull Request resolved: https://github.com/pytorch/audio/pull/3001

Reviewed By: mthrok

Differential Revision: D42688971

Pulled By: hwangjeff

fbshipit-source-id: 43280bd3ffeccddae57f1092ac45afb64dd426cc

41b88314

23 Jan, 2023 3 commits

Tweak `USE_CUDA` detection (#3005) · 09e7d818

Nikita Shulga authored Jan 23, 2023

Summary:
We don't need the presence of physical HW to compile with CUDA.

Likely one of the causes of  https://github.com/pytorch/audio/issues/2979 (i.e. in CircleCI builds USE_CUDA were defined by CI environment, so nobody ever checked the default, but this is not the case in Nova builds)

Pull Request resolved: https://github.com/pytorch/audio/pull/3005

Test Plan:
Check that `compute.cu` is mentioned in builds, for example see https://github.com/pytorch/audio/actions/runs/3990295262/jobs/6843771056#step:9:829
```
[193/202] /usr/local/cuda-11.6/bin/nvcc -forward-unknown-to-host-compiler -DINCLUDE_KALDI -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_CUDA -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dlibtorchaudio_EXPORTS -I/__w/audio/audio/pytorch/audio -I/__w/audio/audio/pytorch/audio/third_party/kaldi/src -I/__w/audio/audio/pytorch/audio/third_party/kaldi/submodule/src -isystem=/__w/_temp/conda_environment_3990295262/lib/python3.7/site-packages/torch/include -isystem=/__w/_temp/conda_environment_3990295262/lib/python3.7/site-packages/torch/include/torch/csrc/api/include -isystem=/usr/local/cuda-11.6/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_35,code=sm_35 -gencode arch=compute_50,code=sm_50 -gencode arch=compute_60,code=sm_60 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_50,code=compute_50 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=integer_sign_change,--diag_suppress=useless_using_declaration,--diag_suppress=set_but_not_used,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=implicit_return_from_non_void_function,--diag_suppress=unsigned_compare_with_zero,--diag_suppress=declared_but_not_referenced,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -Xcompiler=-fPIC -D_GLIBCXX_USE_CXX11_ABI=0 -std=c++17 -MD -MT torchaudio/csrc/CMakeFiles/libtorchaudio.dir/rnnt/gpu/compute.cu.o -MF torchaudio/csrc/CMakeFiles/libtorchaudio.dir/rnnt/gpu/compute.cu.o.d -x cu -c /__w/audio/audio/pytorch/audio/torchaudio/csrc/rnnt/gpu/compute.cu -o torchaudio/csrc/CMakeFiles/libtorchaudio.dir/rnnt/gpu/compute.cu.o
```

Reviewed By: mthrok

Differential Revision: D42687455

Pulled By: malfet

fbshipit-source-id: c37ad58cc62439d1268865e9bf0bcb97079a529f

09e7d818

Merge pop_chunks methods (#3002) · 54196fd3

Moto Hira authored Jan 23, 2023

Summary:
Pull Request resolved: https://github.com/pytorch/audio/pull/3002

This commit merges `pop_chunks` and `pop_chunks_with_metadata`.

In #2975 (D42526945 (https://github.com/pytorch/audio/commit/0dd59e0dda22eabf54fc95ad8050094df239bd39)), we updated StreamReader so that it returns PTS.
In that PR, we introduced `pop_chunks_with_metadata` method, so that
the original `pop_chunks` method returns the same type and we could
focus on the PTS logic in the code review.

The commit is landed, now we merge the two methods, so that the original
`pop_chunks` returns Tensor frames and metadata (PTS).

Reviewed By: xiaohui-zhang

Differential Revision: D42662321

fbshipit-source-id: 37ae088bc63fc516ea068698088925e8b31bc0a1

54196fd3

Update highlighting in doc (#3000) · 1f9b9104

moto authored Jan 23, 2023

Summary:
This change fixes the issue where syntax highlighting is broken up par word.

## Plain
Before
<img width="243" alt="Screenshot 2023-01-20 at 1 28 48 PM" src="https://user-images.githubusercontent.com/855818/213778202-27ec8030-3f2f-4ef9-8210-bce7cfc3cb38.png">
After
<img width="244" alt="Screenshot 2023-01-20 at 1 29 01 PM" src="https://user-images.githubusercontent.com/855818/213778231-61c52825-d63a-4913-b10d-a65f3b2cfbbb.png">

## In articles
Before
<img width="786" alt="Screenshot 2023-01-20 at 1 34 12 PM" src="https://user-images.githubusercontent.com/855818/213779050-c21ba5e2-84b3-4935-bbab-6edcb7bc89ce.png">
After
<img width="783" alt="Screenshot 2023-01-20 at 1 34 17 PM" src="https://user-images.githubusercontent.com/855818/213779069-f1406422-27a4-41cf-8ccd-5058f80860bd.png">

## In tables
Before
<img width="813" alt="Screenshot 2023-01-20 at 1 27 35 PM" src="https://user-images.githubusercontent.com/855818/213778039-fede6f18-5a35-47f2-9e0b-a9be5716dc73.png">
After
<img width="813" alt="Screenshot 2023-01-20 at 1 27 51 PM" src="https://user-images.githubusercontent.com/855818/213778073-e26275a9-d380-4601-aa92-84af7aeab00f.png">

Pull Request resolved: https://github.com/pytorch/audio/pull/3000

Reviewed By: xiaohui-zhang

Differential Revision: D42642522

Pulled By: mthrok

fbshipit-source-id: 6831bb90da005aff8d7f178ef768e967bc6d2640

1f9b9104

22 Jan, 2023 1 commit

Make StreamReader return PTS (#2975) · 0dd59e0d

moto authored Jan 22, 2023

Summary:
This commit makes `StreamReader` report PTS (presentation time stamp) of the returned chunk as well.

Example

```python
from torchaudio.io import StreamReader

s = StreamReader(...)
s.add_video_stream(...)
for (video_chunk, ) in s.stream():
    # video_chunk is Torch tensor type but has extra attribute of PTS
    print(video_chunk.pts)  # reports the PTS of the first frame of the video chunk.
```

For the backward compatibility, we introduce a `_ChunkTensor`, that is a composition
of Tensor and metadata, but works like a normal tensor in PyTorch operations.

The implementation of `_ChunkTensor` is based on [TrivialTensorViaComposition](https://github.com/albanD/subclass_zoo/blob/0eeb1d68fb59879029c610bc407f2997ae43ba0a/trivial_tensors.py#L83).

It was also suggested to attach metadata directly to Tensor object,
but the possibility to have the collision on torchaudio's metadata and new attributes introduced in
PyTorch cannot be ignored, so we use Tensor subclass implementation.

If any unexpected issue arise from metadata attribute name collision, client code can
fetch the bare Tensor and continue.

Pull Request resolved: https://github.com/pytorch/audio/pull/2975

Reviewed By: hwangjeff

Differential Revision: D42526945

Pulled By: mthrok

fbshipit-source-id: b4e9422e914ff328421b975120460f3001268f35

0dd59e0d

20 Jan, 2023 2 commits

Document StreamReader/Writer C++ code (#2997) · de628226

moto authored Jan 20, 2023

Summary:
Extraction from https://github.com/pytorch/audio/issues/2994

Add docstrings to C++ StreamReader/Writer.

Pull Request resolved: https://github.com/pytorch/audio/pull/2997

Reviewed By: nateanl

Differential Revision: D42628016

Pulled By: mthrok

fbshipit-source-id: b22c43b80997af4a9087142340c67bed28e54917

de628226

Fix error message (#2999) · bcfa9eed

moto authored Jan 20, 2023

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2999

Reviewed By: hwangjeff

Differential Revision: D42637618

Pulled By: mthrok

fbshipit-source-id: 35a7976c316e3b3899ae9c2202f132f1a960b736

bcfa9eed