- 19 Sep, 2023 1 commit
-
-
moto authored
Some changes at matplotlib 3.8.0 rejects torch.Tensor passed to `plot` function.
-
- 13 Sep, 2023 2 commits
- 08 Sep, 2023 1 commit
-
-
Pingchuan Ma authored
* Simplify trainining step in av-asr recipe * Run pre-commit
-
- 07 Sep, 2023 2 commits
-
-
moto authored
-
Pingchuan Ma authored
-
- 05 Sep, 2023 2 commits
-
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3578 Reviewed By: nateanl Differential Revision: D48971591 Pulled By: mthrok fbshipit-source-id: a76cf8f1287c076d7b6919d901eda0ce832bdb7d
-
moto authored
Summary: The PR https://github.com/pytorch/audio/issues/3549 re-organized the backend implementations and deprecated the direct access to torchaudio.backend. The change was supposed to be BC-compatible while issuing a warning to users, but the implementation of module-level `__getattr__` was not quite right. See an issue https://github.com/pyannote/pyannote-audio/pull/1456. This commit fixes it so that the following imports work; ```python from torchaudio.backend.common import AudioMetaData from torchaudio.backend import sox_io_backend from torchaudio.backend.sox_io_backend import save, load, info from torchaudio.backend import no_backend from torchaudio.backend.no_backend import save, load, info from torchaudio.backend import soundfile_backend from torchaudio.backend.soundfile_backend import save, load, info ``` Pull Request resolved: https://github.com/pytorch/audio/pull/3595 Reviewed By: nateanl Differential Revision: D48957446 Pulled By: mthrok fbshipit-source-id: ebb256461dd3032025fd27d0455ce980888f7778
-
- 04 Sep, 2023 4 commits
-
-
moto authored
Summary: This PR removes the legacy backend switch mechanism. The implementation itself is still available. Merge after v2.1 release Pull Request resolved: https://github.com/pytorch/audio/pull/3559 Reviewed By: nateanl Differential Revision: D48353764 Pulled By: mthrok fbshipit-source-id: 4d3924dbe6f334ecebe2b12fcd4591c61c4aa656
-
hwangjeff authored
Summary: Fixes decoder calls and related code in Device ASR/AVSR tutorials to account for changes to RNN-T decoder introduced in https://github.com/pytorch/audio/issues/3295. Pull Request resolved: https://github.com/pytorch/audio/pull/3572 Reviewed By: mthrok Differential Revision: D48629428 Pulled By: hwangjeff fbshipit-source-id: 63ede307fb4412aa28f88972d56dca8405607b7a
-
moto authored
Summary: Add incremental decoding support to CTC decoder. Resolves https://github.com/pytorch/audio/issues/3574 Pull Request resolved: https://github.com/pytorch/audio/pull/3594 Reviewed By: nateanl Differential Revision: D48940584 Pulled By: mthrok fbshipit-source-id: 31871614008cf197cf3900f7183ec6cff34d2905
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3593 Reviewed By: nateanl Differential Revision: D48933041 Pulled By: mthrok fbshipit-source-id: cd05d3cf5006206ba441fdc05548bcd922ce0598
-
- 02 Sep, 2023 1 commit
-
-
Omkar Salpekar authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3587 Reviewed By: huydhn Differential Revision: D48847224 Pulled By: osalpekar fbshipit-source-id: 15ccae56f0e5e8e366b5feb66fdb42bb5cba97e1
-
- 01 Sep, 2023 1 commit
-
-
Omkar Salpekar authored
Summary: First Pass for automating release-only changes. Tested this in audio to ensure it works. Pull Request resolved: https://github.com/pytorch/audio/pull/3590 Reviewed By: huydhn Differential Revision: D48921925 Pulled By: osalpekar fbshipit-source-id: 2920e4d984ffb0c1957dae6fb6bab10344e65ff7
-
- 30 Aug, 2023 2 commits
-
-
atalman authored
Summary: This reverts commit c5939616. Unblock 2.1.0 rc Pull Request resolved: https://github.com/pytorch/audio/pull/3586 Reviewed By: osalpekar Differential Revision: D48842032 Pulled By: atalman fbshipit-source-id: bbdf9e45c9aa5fde00f315a2ff491ed050bc1707
-
Omkar Salpekar authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3585 Reviewed By: atalman Differential Revision: D48832146 Pulled By: osalpekar fbshipit-source-id: c36dd79a0c0f407d490a488902207c8066c01383
-
- 29 Aug, 2023 2 commits
-
-
Omkar Salpekar authored
Summary: We will use a separate token for uploading test binaries (instead of reusing the nightly token). This PR adds that token to the caller workflow. Pull Request resolved: https://github.com/pytorch/audio/pull/3582 Reviewed By: atalman Differential Revision: D48803009 Pulled By: osalpekar fbshipit-source-id: c2af57f6946da51a7b56c975614e60f243e3f6fb
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3577 Reviewed By: atalman Differential Revision: D48763580 Pulled By: mthrok fbshipit-source-id: 6ab155a5dd4cf11b2a58f26ced369107f0a2f08f
-
- 23 Aug, 2023 1 commit
-
-
pbialecki authored
Summary: Follow-up of: https://github.com/pytorch/builder/pull/1485 CC atalman Pull Request resolved: https://github.com/pytorch/audio/pull/3563 Reviewed By: kit1980 Differential Revision: D48610200 Pulled By: atalman fbshipit-source-id: 61c9981da5a343a3cbce97b0a77ab91f37560087
-
- 21 Aug, 2023 2 commits
-
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3570 Reviewed By: huangruizhe Differential Revision: D48518568 Pulled By: mthrok fbshipit-source-id: 0fdfb8b3988789c7ded0fb336824034bedf6a394
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3569 Reviewed By: huangruizhe Differential Revision: D48508244 Pulled By: mthrok fbshipit-source-id: 6e14267e2dbdf08ea3c25a1dab480cb0e908e0c3
-
- 20 Aug, 2023 3 commits
-
-
moto authored
Summary: Turned out FFmpeg 5 installed via conda reports video frame rate -1. FFmpeg 4 and 6 are fine. This is either a regression in FFmpeg or in the underlying decoding library. Make the reference value adoptive. Pull Request resolved: https://github.com/pytorch/audio/pull/3568 Reviewed By: huangruizhe Differential Revision: D48499621 Pulled By: mthrok fbshipit-source-id: fb64187bcf0dc57b753cb6c05f04d436238f5c51
-
moto authored
Summary: It seems that the default Python version was updated to 3.11. libcst does not have binary release for 3.11, so the CI attempts to build from source but it fails because building libcst requires Rust compiler. This commit fix the Python version of style check job to 3.10 so that the issue with Rust compiler is avoided. Pull Request resolved: https://github.com/pytorch/audio/pull/3564 Reviewed By: huangruizhe Differential Revision: D48499560 Pulled By: mthrok fbshipit-source-id: 53ab77268d8143f4946d92e8cd1f96aea55e7b72
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3566 Reviewed By: huangruizhe Differential Revision: D48499338 Pulled By: mthrok fbshipit-source-id: 7f837e1a1f8116d7d82411607c91628b729077d8
-
- 19 Aug, 2023 1 commit
-
-
Juan Villamizar authored
Summary: Added HIPIFY code and small changes for ROCm. Targeting RNN-T loss. Pull Request resolved: https://github.com/pytorch/audio/pull/2485 Reviewed By: huangruizhe Differential Revision: D43537864 Pulled By: mthrok fbshipit-source-id: 4bdb1f291dc51a12232ccd072b97ae94ae20cc0c
-
- 18 Aug, 2023 1 commit
-
-
moto authored
Summary: Remove mention of backend and quick usage. Those are explained in the documentation in detail. Pull Request resolved: https://github.com/pytorch/audio/pull/3567 Reviewed By: huangruizhe Differential Revision: D48471832 Pulled By: mthrok fbshipit-source-id: 467efc1f11f66534c33cf4751de27b08176c31bf
-
- 15 Aug, 2023 2 commits
-
-
Andrey Talman authored
Summary: Use pytorch/manylinuxaarch64-builder:cpu-aarch64 Introduced in https://github.com/pytorch/builder/pull/1472 Pull Request resolved: https://github.com/pytorch/audio/pull/3560 Reviewed By: mthrok Differential Revision: D48366572 Pulled By: atalman fbshipit-source-id: 6de15f81abb09c737e6a1271226259483141e8f4
-
moto authored
Summary: In https://github.com/pytorch/audio/pull/3460, we switched the build process for FFmpeg extension. Since it is complicated to install FFmpeg in some environments, at build time, pre-built binaries and its headers are downloaded and used as a scaffolding for torchaudio build. Now even though we did not change any code or FFmpeg version, it turned out that this causes segmentation fault on Ubuntu when using system Python and FFmpeg 4.4 installed via aptitude. While investigating the issue, I swapped the said pre-built FFmpeg scaffolding with FFmpeg 4.4 from aptitude, and the segmentation fault did not happen. This indicates that it is binary compatibility issue. Before https://github.com/pytorch/audio/issues/3460, each binary build job was building FFmpeg 4.1.8 using the same compiler used to build torchaudio, but after https://github.com/pytorch/audio/issues/3460 the environments to build FFmpeg 4.1.8 and torchaudio are different. My hypothesis is that this difference is causing some ABI incompatibility when linking against FFmpeg 4.4. (Also, I don't remember well, but I read somewhere that 4.4 has a different ABI) Through experiments, it turned out upgrading the pre-built FFmpeg scaffolding to 4.4 resolves this. So this commit upgrade the pre-built FFmpeg 4 to 4.4. The potential (yet unconfirmed) downside is that torchaudio will no longer work with 4.1, 4.2, and 4.3. Since FFmpeg 4.4 is what Ubuntu 20.04 and 22.04 support by default, and Google Colab is also on 20.04, I think it is more important to support 4.4. Therefore we drop the support for 4.1-4.3 from normal build (and official distributions). Those who wish to use 4.1-4.3 can build torchaudio from source by linking to specific FFmpeg. Pull Request resolved: https://github.com/pytorch/audio/pull/3561 Reviewed By: hwangjeff Differential Revision: D48340201 Pulled By: mthrok fbshipit-source-id: 7ece82910f290c7cf83f58311c4cf6a384e8795e
-
- 14 Aug, 2023 5 commits
-
-
moto authored
Summary: Move the actual I/O implementation to `_backend` submodule so that the existing `backend` submodule contains only what's related to legacy backend utilities. Pull Request resolved: https://github.com/pytorch/audio/pull/3549 Reviewed By: huangruizhe Differential Revision: D48253550 Pulled By: mthrok fbshipit-source-id: c23f1664458c723f63e134c7974b3f7cf17a1e98
-
moto authored
Summary: * Merge backend doc into torchaudio toplevel doc * Update backend, dispatcher, installation doc Pull Request resolved: https://github.com/pytorch/audio/pull/3555 Reviewed By: huangruizhe Differential Revision: D48326812 Pulled By: mthrok fbshipit-source-id: cc0d7326eacfebd341323b5d613ca1777255748b
-
moto authored
Summary: Update the ubuntu image so that CI is triggered. There is some issue with FFmpeg 4, so that CI does not succeed. This will be is handled separately. Pull Request resolved: https://github.com/pytorch/audio/pull/3502 Reviewed By: huangruizhe Differential Revision: D48327431 Pulled By: mthrok fbshipit-source-id: 5ea639f3e20c3aaf460e6030f6cb1ad2daa00172
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3557 Reviewed By: huangruizhe Differential Revision: D48326462 Pulled By: mthrok fbshipit-source-id: c37ae38e28e4514ea284613636604a725829346d
-
Jeff Hwang authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3558 In the event that `use_tmp_hub_dir` isn't specified as an option, pytest shouldn't fail. To resolve such failures, this PR modifies function `temp_hub_dir` to fall back on a default value of `False` for `use_tmp_hub_dir`. Reviewed By: mthrok Differential Revision: D48318947 fbshipit-source-id: 5dd692f9202ef37ec3e2c9ea39896156f928d693
-
- 11 Aug, 2023 3 commits
-
-
moto authored
Summary: `torchaudio.info` returns `AudioMetaData`. It should be exposed as public API, without referring `backend` submodule. Pull Request resolved: https://github.com/pytorch/audio/pull/3556 Reviewed By: huangruizhe Differential Revision: D48267349 Pulled By: mthrok fbshipit-source-id: 6ccc0c32bf62fbdcb71495fc7d8d4cc29891538a
-
Jeff Hwang authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3551 Restores VGGish pipeline test to be a function rather than class. Reviewed By: mthrok Differential Revision: D48236197 fbshipit-source-id: 25ac19d87a7a0964a9c3f7552037cd6c21dc38a9
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3554 Reviewed By: huangruizhe Differential Revision: D48240906 Pulled By: mthrok fbshipit-source-id: 1936757646f8ebba74e8b65e2ffe2a8b74fdfeeb
-
- 10 Aug, 2023 4 commits
-
-
moto authored
Summary: * Move Backend implementations to separate files Pull Request resolved: https://github.com/pytorch/audio/pull/3547 Reviewed By: hwangjeff Differential Revision: D48233538 Pulled By: mthrok fbshipit-source-id: bcc63fc07a5dfcd48929f0a2fb64bfcb3282eb92
-
Jeff Hwang authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/3545 Adds function for computing the Fréchet distance between two multivariate normal distributions. Reviewed By: mthrok Differential Revision: D48126102 fbshipit-source-id: e4e122b831e1e752037c03f5baa9451e81ef1697
-
Mike Schneider authored
Summary: # Changes * Adding workflow for building aarch64 wheels. Pull Request resolved: https://github.com/pytorch/audio/pull/3553 Reviewed By: hwangjeff, osalpekar Differential Revision: D48239384 Pulled By: atalman fbshipit-source-id: dfa00edb3fee0acaf2b83fb420eaf12bddc6980e
-
moto authored
Summary: The backend dispatcher is implemented in `torchaudio._backend`, while the legacy backend is implemented in `torchaudio.backend`. The initialization happen in `torchaudio._backend`. This commit moves it to `torchaudio.__init__`, so that `backend` and `_backend` is more independent. Pull Request resolved: https://github.com/pytorch/audio/pull/3548 Reviewed By: huangruizhe Differential Revision: D48219244 Pulled By: mthrok fbshipit-source-id: e694cb232794f90902a60ee51c7bf11b7f0548a0
-