Commits · 2ba36b479a8faf0f01d29c5c2991f8bebf3a4efc · OpenDAS / Torchaudio

01 Jun, 2023 1 commit

[BC-breaking] Remove file-like object support from sox_io backend (#3035) · bc54ac8a

moto authored Jun 01, 2023

Summary:
This commit removes file-like obejct support so that we can remove custom patch

The motivation and plan is outlined in https://github.com/pytorch/audio/issues/2950.

Pull Request resolved: https://github.com/pytorch/audio/pull/3035

Reviewed By: hwangjeff

Differential Revision: D44695647

Pulled By: mthrok

fbshipit-source-id: 13af0234e288c041bc7b490e1f967f85ce7eb8ec

bc54ac8a

16 May, 2023 1 commit

Remove obsolete third party dependencies of CTC decoder (#3339) · e4c1d70b

moto authored May 16, 2023

Summary:
TorchAudio has migrated CTC decoder to flashlight-text, and code related CTC decoder was removed in https://github.com/pytorch/audio/issues/3236.

This commit cleans up the residual, removes the third party libraries used for CTC decoder, and mention to environment variable for CTC decoder.

Pull Request resolved: https://github.com/pytorch/audio/pull/3339

Reviewed By: nateanl

Differential Revision: D45920878

Pulled By: mthrok

fbshipit-source-id: 8d93e64138697781570e5b0b1c9f86e1a7923a89

e4c1d70b

28 Apr, 2023 1 commit

Add cuctc decoder (#3096) · 0a1801ed

Yuekai Zhang authored Apr 28, 2023

Summary:
This PR implements a CUDA based ctc prefix beam search decoder.

Attach serveral benchmark results using V100 below:
|decoder type| model |datasets       | decoding time (secs)| beam size | batch size | model unit | subsampling times | vocab size |
|--------------|---------|------|-----------------|------------|-------------|------------|-----------------------|------------|
| cuctc |  conformer nemo    |dev clean        |7.68s | 8           |  32       | bpe         |    4  | 1000|
| cuctc |  conformer nemo   |dev clean  (sort by length)      |1.6s | 8           |  32       | bpe         |    4  | 1000|
| cuctc |  wav2vec2.0 torchaudio |dev clean                                |22s | 10           |  1       | char         |    2  | 29|
| cuctc |   conformer espnet   |aishell1 test                             | 5s | 10           |  24       | char         |    4  | 4233|

Note:
1.  The design is to parallel computation through batch and vocab axis, for loop the frames axis. So it's more friendly with smaller sequence lengths, larger vocab size comparing with CPU implementations.
2. WER is the same as CPU implementations. However, it can't decode with LM now.

Resolves: https://github.com/pytorch/audio/issues/2957.

Pull Request resolved: https://github.com/pytorch/audio/pull/3096

Reviewed By: nateanl

Differential Revision: D44709397

Pulled By: mthrok

fbshipit-source-id: 3078c54a2b44dc00eb4a81b4c657487eeff8c155

0a1801ed

05 Apr, 2023 1 commit

Remove source for flashlight-text bundle (#3236) · 5053aa7f

moto authored Apr 05, 2023

Summary:
Following https://github.com/pytorch/audio/pull/3232, static build of flashlight-text has been disabled and removed from nightly build.

This commit removes the related source/build from torchaudio code base.

Pull Request resolved: https://github.com/pytorch/audio/pull/3236

Reviewed By: jacobkahn

Differential Revision: D44712539

Pulled By: mthrok

fbshipit-source-id: a201c89b5046f224526309cd4e17a5105e58a949

5053aa7f

29 Dec, 2022 1 commit

Refactor CMake modules (#2930) · 7b5317b3

moto authored Dec 29, 2022

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2930

Reviewed By: carolineechen, nateanl

Differential Revision: D42280966

Pulled By: mthrok

fbshipit-source-id: f9d5f1dc7c1a62d932fb2020aafb63734f2bf405

7b5317b3

29 Jul, 2022 1 commit

Enable CTC decoder in Windows (#2587) · 67cb420d

moto authored Jul 29, 2022

Summary:
This commit enables CTC decoder on Windows.

The functionality seems to work fine.
The tests are passing, the decoding tutorial runs fine.

The only difference to the Linux/macOS version is that
loading model in XZ compression format is not supported.

![289961785_399620772041679_7768117002438616376_n](https://user-images.githubusercontent.com/855818/181420923-cfbd8402-20de-4e63-b9e4-e39f9aa9fc50.png)

Pull Request resolved: https://github.com/pytorch/audio/pull/2587

Reviewed By: carolineechen, nateanl

Differential Revision: D38276490

Pulled By: mthrok

fbshipit-source-id: f2203b2235c5bbb0220fe560aaaf0e1d5530347a

67cb420d

28 Jul, 2022 1 commit

Migrate CTC decoder code (#2580) · 39b6343d

moto authored Jul 28, 2022

Summary:
This commit gets rid of our copy of CTC decoder code and
replace it with upstream Flashlight-Text repo.

Pull Request resolved: https://github.com/pytorch/audio/pull/2580

Reviewed By: carolineechen

Differential Revision: D38244906

Pulled By: mthrok

fbshipit-source-id: d274240fc67675552d19ff35e9a363b9b9048721

39b6343d

19 Jul, 2022 1 commit

Remove boost (#2552) · ee631d6b

moto authored Jul 19, 2022

Summary:
After reviewing the code for KenLM it turned out that we can build it without boost.

Pull Request resolved: https://github.com/pytorch/audio/pull/2552

Reviewed By: xiaohui-zhang

Differential Revision: D37949699

Pulled By: mthrok

fbshipit-source-id: 4a4ffae4220d0b764b53f52b93040670d91a84a3

ee631d6b

15 Jun, 2022 1 commit

Update config.guess to the latest (#2479) · 575478ec

moto authored Jun 14, 2022

Summary:
closes https://github.com/pytorch/audio/issues/2420

Pull Request resolved: https://github.com/pytorch/audio/pull/2479

Reviewed By: carolineechen

Differential Revision: D37142717

Pulled By: mthrok

fbshipit-source-id: c3d4cc1435a74dfa6992112590c988c2903511a8

575478ec

02 Jun, 2022 1 commit

Remove mad (#2428) · d2ecba98

moto authored Jun 02, 2022

Summary:
Remove the code related to libmad, which had been disabled in https://github.com/pytorch/audio/issues/2354

In https://github.com/pytorch/audio/issues/2419, we mp3 decoding to ffmpeg. But CI tests were still using libmad.
This commit completely removes libmad from torchaudio.

This is BC-breaking change as `apply_sox_effects_file` function cannot handle MP3, and it cannot fallback to ffmpeg.
The workaround for this is to use `torchaudio.load` then `apply_sox_effects_tensor`.

Pull Request resolved: https://github.com/pytorch/audio/pull/2428

Reviewed By: carolineechen

Differential Revision: D36851805

Pulled By: mthrok

fbshipit-source-id: f98795c59a1ac61cef511f2bbeac37f7c3c69d55

d2ecba98

28 Apr, 2022 1 commit

Add BUILD_MAD option and default to OFF (#2354) · a71e3a40

moto authored Apr 28, 2022

Summary:
libmad integration should be enabled only from source-build

Pull Request resolved: https://github.com/pytorch/audio/pull/2354

Reviewed By: nateanl

Differential Revision: D36012035

Pulled By: mthrok

fbshipit-source-id: adeda8cbfd418f96245909cae6862b648a6915a7

a71e3a40

01 Apr, 2022 1 commit

Update GNU config files to support `arm64-apple` system (#2307) · 3ed39e15

moto authored Apr 01, 2022

Summary:
This commit
1. Updates the config.guess and config.sub files and
2. applies them to all the third party libraries that use them.

This resolves the following build failure on M1 mac with newer SDK.

On MacBookPro with M1 chip, with the recent OS update, something
about the development environment has been changed (probably newer
version of XCode) and the build stopeed working with the following
errors from third party dependencies.

```
checking build system type... Invalid configuration ‘arm64-apple-darwin20.0.0': machine ‘arm64-apple' not recognized
```

note: config files are taken from https://www.gnu.org/software/gettext/manual/html_node/config_002eguess.html

Pull Request resolved: https://github.com/pytorch/audio/pull/2307

Reviewed By: nateanl

Differential Revision: D35318273

Pulled By: mthrok

fbshipit-source-id: 746ac51dd1816767aa78b88445f76a29acfd29e8

3ed39e15

30 Mar, 2022 2 commits

Use zlib v1.2.12 with GitHub source (#2300) · 050b2fb4

Zhaoheng Ni authored Mar 30, 2022

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2300

Reviewed By: xiaohui-zhang

Differential Revision: D35258323

Pulled By: nateanl

fbshipit-source-id: 4b9f86600399ba0f5ec47f1c402968a812aa557d

050b2fb4

Use sourceforge url to fetch zlib (#2297) · 03badcd3

Zhaoheng Ni authored Mar 30, 2022

Summary:
This PR addresses https://github.com/pytorch/audio/issues/2295 by updating `zlib`'s url to the one on sourceforge.net.
`zlib` 1.2.11 source code is removed from the official site. According to https://zlib.net, ```Due to the bug fixes, any installations of 1.2.11 should be replaced with 1.2.12.```
sourceforge preserves the older versions thus is more stable. The PR keep 1.2.11 as currently there is no 1.2.12 on sourceforge.

Pull Request resolved: https://github.com/pytorch/audio/pull/2297

Reviewed By: mthrok

Differential Revision: D35251361

Pulled By: nateanl

fbshipit-source-id: 174c2c2e1c34bef9799bbacfd1e12c8ff13ff15d

03badcd3

22 Mar, 2022 1 commit

Revise the parameterization of third party libraries (#2282) · 7444f568

moto authored Mar 22, 2022

Summary:
Originally, the global property TORCHAUDIO_THIRD_PARTIES was introduced
to handle the optional third party dependencies that can change based on
the build config.

After revising the CMake, it turned out this is not really necessary,
as our torchaudio/csrc/CMakeLists.txt properly branches out for
conditional dependencies. Rather we should leave the global scope untouched.

Pull Request resolved: https://github.com/pytorch/audio/pull/2282

Reviewed By: hwangjeff

Differential Revision: D35059838

Pulled By: mthrok

fbshipit-source-id: ed3557eaa9a669e4466d64893beab5089eca78b8

7444f568

06 Mar, 2022 1 commit

Fix Kaldi submodule integration (#2269) · a92ae368

moto authored Mar 06, 2022

Summary:
When building Kaldi submodule, it requires to run `get_version.sh`, so that version header is available.
It was pointed that the script should run with `bash`, instead of `sh`.

Fixes https://github.com/pytorch/audio/issues/2268

Pull Request resolved: https://github.com/pytorch/audio/pull/2269

Reviewed By: carolineechen

Differential Revision: D34667726

Pulled By: mthrok

fbshipit-source-id: 761b82c54b58af2bfb2836cbe18c9708f853f1e1

a92ae368

15 Feb, 2022 1 commit

Improve ffmpeg library discovery (#2204) · 963905e4

moto authored Feb 15, 2022

Summary:
This commit fixes the issue with ffmpeg discovery at build time.
The original implementation had issues like.

1. Wrong usage of FindFFMPEG, which caused mixture of ffmpeg libraries from system directory and user directory.
2. The optional `FFMPEG_ROOT` variable was not set within cmake.

The issue 1 is problematic when a user does not have a permission to
modify the environment. For example, an old version of ffmpeg, which is
installed in a directory managed by the system (such as `/usr/local/lib`),
then there is no way to specify a path in which user installs a supported version
of ffmpeg.

This commit changes the behavior by first searching the library
in `FFMPEG_ROOT` environment variables, then
resorting to the original behavior of searching the custom paths with
system default path.

Also this commirt removes support for `libavresample`, which is deprecated in
ffmpeg 4 and removed in ffmpeg 5.

Pull Request resolved: https://github.com/pytorch/audio/pull/2204

Reviewed By: carolineechen

Differential Revision: D34225769

Pulled By: mthrok

fbshipit-source-id: 95b0bfaaef31e2e69e6df29f789010f48a48210b

963905e4

05 Jan, 2022 1 commit

Update ffmpeg discovery logic (#2124) · d8a65450

moto authored Jan 05, 2022

Summary:
Update ffmpeg discovery logic

Previously the build process used pkg-config to locate
an installation of ffmpeg, which does not work well Windows/CentOS.

This commit update the discovery process to use the custom
FindFFMPEG.cmake adopted from Kitware/VTK repository with addition of
conda environment.

 The custom discovery logic can support Windows and CentOS.

Pull Request resolved: https://github.com/pytorch/audio/pull/2124

Reviewed By: carolineechen

Differential Revision: D33429564

Pulled By: mthrok

fbshipit-source-id: 6cb50c1d8c58f51e0f3f3af5c5b541aa3a699bba

d8a65450

04 Jan, 2022 1 commit

[CI] Install tools from conda instead of brew (#1873) · df0175e8

moto authored Jan 04, 2022

Summary:
Currently, macOS CI jobs install `pkg-config` and `wget` with `brew`.
This is problematic as brew takes a long time with auto-update, and disabling the auto-update is not an ideal solution.
Conda also distributes these packages, so switching to conda.

Example issues with brew installation.
https://app.circleci.com/pipelines/github/pytorch/audio/7825/workflows/53965bcf-6ddf-4e42-ad52-83fd1bbab717

This commit removes the use of `brew` by
1. Replacing the use of `wget` with `curl` (pre-installed in most distro)
2. Install `pkg-condig` from conda.
Note: All the macOS jobs, including binary build jobs, uses conda. Using `pkg-config` from Conda makes it easy to discover the packages installed from conda. (like `ffmpeg` in https://github.com/pytorch/audio/issues/2122)
3. Add `pkg-config` to conda build-time dependency
4. Make sure that the availability of `pkg-config` is explicitly checked when `sox` is being configured. (otherwise, it will fail at somewhere in the middle of build process with somewhat unintuitve error message)

Pull Request resolved: https://github.com/pytorch/audio/pull/1873

Reviewed By: carolineechen, nateanl

Differential Revision: D33404975

Pulled By: mthrok

fbshipit-source-id: ae512d3a3a422ebfe3b46c492bed44deecc36e72

df0175e8

30 Dec, 2021 3 commits

Build ffmpeg-features in Linux/macOS unittests (#2114) · 9f14fa63

moto authored Dec 30, 2021

Summary:
Preparation to land Python front-end of ffmpeg-related features.

- Set BUILD_FFMPEG=1 in Linux/macOS unit test jobs
- Install ffmpeg and pkg-config from conda-forge
- Add note about Windows build process
- Temporarily avoid `av_err2str`

Pull Request resolved: https://github.com/pytorch/audio/pull/2114

Reviewed By: hwangjeff

Differential Revision: D33371346

Pulled By: mthrok

fbshipit-source-id: b0e16a35959a49a2166109068f3e0cbbb836e888

9f14fa63

Enforce lint checks and fix/mute lint errors (#2116) · 8ed14782

Joao Gomes authored Dec 30, 2021

Summary:
cc mthrok

Pull Request resolved: https://github.com/pytorch/audio/pull/2116

Reviewed By: mthrok

Differential Revision: D33368453

Pulled By: jdsgomes

fbshipit-source-id: 09cf3fe5ed6f771c2f16505633c0e59b0c27453c

8ed14782

Add a switch to build ffmpeg binding (#2048) · ece03edc

moto authored Dec 30, 2021

Summary:
This PR adds `BUILD_FFMPEG` switch to torchaudio build process so that features related to ffmpeg are built.
The flag is false by default, so no CI jobs or development flow are affected.

This is because handling the dependencies around ffmpeg is a bit tricky.
Currently, the CMake file uses `pkg-config` to find an ffmpeg installation in the system.
This works fine for both conda-based installation and system-managed installation (like `apt`).

In subsequent PRs, I will find a solution that works for local development and binary distributions.

Pull Request resolved: https://github.com/pytorch/audio/pull/2048

Reviewed By: hwangjeff, nateanl

Differential Revision: D33367260

Pulled By: mthrok

fbshipit-source-id: 94517acecb62bd6d4e96d4b7cbc3ab3c2a25706c

ece03edc

20 Dec, 2021 2 commits

Standardize the location of third-party source code (#2086) · 2476dd2d

moto authored Dec 20, 2021

Summary:
Previously sox-related third-party source code was archived at
`third_party/sox/archives`.
Recently KenLM-related third-party source code was added and
they are archived at `third_party/archives`.

This PR changes the sox archive location to `third_party/archives`,
so that all the archvies are cached at the same location.

Pull Request resolved: https://github.com/pytorch/audio/pull/2086

Reviewed By: carolineechen

Differential Revision: D33236927

Pulled By: mthrok

fbshipit-source-id: 2f2aa5f4b386fefb46d7c98f7179c04995219f3c

2476dd2d

Remove unnecessary sources from KenLM build (#2085) · db5ac7de

moto authored Dec 20, 2021

Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2085

Reviewed By: carolineechen

Differential Revision: D33235225

Pulled By: mthrok

fbshipit-source-id: 47fe9ec4c93a26322b3a362202ddd3c4654c3f8c

db5ac7de

18 Dec, 2021 1 commit

Add FL Decoder / KenLM integration to build process (#2078) · 246dd52a

moto authored Dec 18, 2021

Summary:
After all the C++ code from https://github.com/pytorch/audio/issues/2072 are added, this commit will enable decoder/KenLM integration in the build process.

Pull Request resolved: https://github.com/pytorch/audio/pull/2078

Reviewed By: carolineechen

Differential Revision: D33198183

Pulled By: mthrok

fbshipit-source-id: 9d7fa76151d06fbbac3785183c7c2ff9862d3128

246dd52a

17 Dec, 2021 1 commit

Add static build of KenLM (#2076) · adc559a8

moto authored Dec 17, 2021

Summary:
Add KenLM and its dependencies required for static build (`zlib`, `bzip2`, `lzma` and `boost-thread`).

The KenLM and its dependencies are build but since no corresponding code on torchaudio side is changed, the resulting torchaudio extension module is not changed. (therefore, as long as build process passes on CI this PR should be good to go.)

Pull Request resolved: https://github.com/pytorch/audio/pull/2076

Reviewed By: carolineechen

Differential Revision: D33189980

Pulled By: mthrok

fbshipit-source-id: 6096113128b939f3cf70990c99aacc4aaa954584

adc559a8

31 Aug, 2021 1 commit

Fix CUDA build logic for _torchaudio.so (#1737) · e3c082b7

Nikita Shulga authored Aug 31, 2021

It's wrong to depend on `${TORCH_LIBRARIES}` as it pulls in explicit
`libcuda.so.1` dependency, which violates the assumption that GPU
accelerated libraries should be loadable with no NVIDIA drivers installed

Instead, make it depend on `torch` target, which includes all necessary
Torch C++ API dependences

e3c082b7

18 Aug, 2021 1 commit

Guard Kaldi's version generation (#1715) · df9d0b47

moto authored Aug 18, 2021

When building torchaudio from source, `get_version.sh` from kaldi is executed everytime,
which results in kaldi-bindings to be always rebuilt.

This commit add "if" guard to the part so that they are not always executed.

df9d0b47

28 Jun, 2021 1 commit

Update config.[sub|guess] for lame and libmad (#1613) · 9b5a2704

Nikita Shulga authored Jun 28, 2021

Needed to allow building on M1

Downloaded from https://git.savannah.gnu.org/gitweb/?p=config.git;a=blob_plain;f=config.guess;hb=HEAD and https://git.savannah.gnu.org/gitweb/?p=config.git;a=blob_plain;f=config.sub;hb=HEAD

9b5a2704

25 May, 2021 1 commit

Update config.guess for lame and libmad (#1484) · 838e1e0a

moto authored May 25, 2021

Replacing the config.guess with a newer version to support newer hardware, such as Nvidia Jetson.

Obtained from: https://git.savannah.gnu.org/gitweb/?p=config.git;a=blob_plain;f=config.guess;hb=HEAD

See: https://www.gnu.org/software/gettext/manual/html_node/config_002eguess.html

838e1e0a

30 Apr, 2021 1 commit

Replace existing prototype RNNT Loss (#1479) · 0c263a93

Caroline Chen authored Apr 30, 2021

Replace the prototype RNNT implementation (using warp-transducer) with one without external library dependencies

0c263a93

19 Apr, 2021 1 commit

Explicitly disable wavpack when building SoX (#1462) · 7355d9fd

moto authored Apr 19, 2021

`wavpack` is a format not supported/tested in torchaudio.
Leaving the option blank can cause the issue like #1461 in untested environment.

7355d9fd

02 Mar, 2021 1 commit
- Make sox selective (#1338) · ecfed4d9
  Caroline Chen authored Mar 02, 2021
  
  ecfed4d9
23 Feb, 2021 1 commit

Fix fileobj I/O un-deterministic behavior (#1297) · 71486acf

moto authored Feb 23, 2021

* Fix fileobj I/O undeterministic behavior

Ever since the file-like object support was added in #1158, the test
was occasionally failing in CI. This PR fixes this.

71486acf

09 Feb, 2021 1 commit
- Add Kaldi Pitch feature (#1243) · 7ee1c46b
  moto authored Feb 09, 2021
  
  7ee1c46b
04 Feb, 2021 1 commit
- Switch to cmake for build (#1187) · 2c8aad97
  moto authored Feb 04, 2021
```
* Switch to cmake for build
* Hide symbols
```
  2c8aad97
21 Jan, 2021 1 commit
- Clean up sox/CMakeLists.txt and its build log (#1190) · 90e753c9
  moto authored Jan 21, 2021
  
  90e753c9
09 Jan, 2021 2 commits
- Clean up libsox source and build directory (#1161) · 7a36c557
  moto authored Jan 08, 2021
  
  7a36c557
- Clean up transducer build (#1159) · 9690e8e1
  moto authored Jan 08, 2021
  
  9690e8e1
05 Jan, 2021 1 commit
- Add RNN Transducer Loss for CPU (#1137) · 6b07bcf8
  Vincent QB authored Jan 05, 2021
  
  6b07bcf8