1. 25 Jan, 2024 1 commit
  2. 03 Jan, 2024 1 commit
  3. 09 Nov, 2023 1 commit
  4. 31 Oct, 2023 1 commit
  5. 26 Oct, 2023 5 commits
  6. 24 Oct, 2023 2 commits
  7. 13 Oct, 2023 1 commit
  8. 11 Oct, 2023 1 commit
  9. 09 Oct, 2023 1 commit
  10. 19 Sep, 2023 1 commit
  11. 04 Sep, 2023 2 commits
  12. 20 Aug, 2023 1 commit
  13. 15 Aug, 2023 1 commit
    • moto's avatar
      [BC-breaking] Update pre-built ffmpeg4 to 4.4.4 (#3561) · bf07ea6b
      moto authored
      Summary:
      In https://github.com/pytorch/audio/pull/3460, we switched the build process for FFmpeg extension.
      Since it is complicated to install FFmpeg in some environments, at build time, pre-built binaries and its headers
      are downloaded and used as a scaffolding for torchaudio build.
      
      Now even though we did not change any code or FFmpeg version, it turned out that this causes segmentation
      fault on Ubuntu when using system Python and FFmpeg 4.4 installed via aptitude.
      While investigating the issue, I swapped the said pre-built FFmpeg scaffolding with FFmpeg 4.4 from aptitude,
      and the segmentation fault did not happen. This indicates that it is binary compatibility issue.
      
      Before https://github.com/pytorch/audio/issues/3460, each binary build job was building FFmpeg 4.1.8 using the same compiler used to build torchaudio,
      but after https://github.com/pytorch/audio/issues/3460 the environments to build FFmpeg 4.1.8 and torchaudio are different. My hypothesis is that
      this difference is causing some ABI incompatibility when linking against FFmpeg 4.4. (Also, I don't remember well,
      but I read somewhere that 4.4 has a different ABI)
      
      Through experiments, it turned out upgrading the pre-built FFmpeg scaffolding to 4.4 resolves this.
      So this commit upgrade the pre-built FFmpeg 4 to 4.4.
      The potential (yet unconfirmed) downside is that torchaudio will no longer work with 4.1, 4.2, and 4.3.
      Since FFmpeg 4.4 is what Ubuntu 20.04 and 22.04 support by default, and Google Colab is also on 20.04,
      I think it is more important to support 4.4.
      
      Therefore we drop the support for 4.1-4.3 from normal build (and official distributions). Those who wish to
      use 4.1-4.3 can build torchaudio from source by linking to specific FFmpeg.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3561
      
      Reviewed By: hwangjeff
      
      Differential Revision: D48340201
      
      Pulled By: mthrok
      
      fbshipit-source-id: 7ece82910f290c7cf83f58311c4cf6a384e8795e
      bf07ea6b
  14. 14 Aug, 2023 1 commit
  15. 11 Aug, 2023 1 commit
    • moto's avatar
      Expose AudioMetadata (#3556) · 9467fc44
      moto authored
      Summary:
      `torchaudio.info` returns `AudioMetaData`. It should be exposed as public API, without referring `backend` submodule.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3556
      
      Reviewed By: huangruizhe
      
      Differential Revision: D48267349
      
      Pulled By: mthrok
      
      fbshipit-source-id: 6ccc0c32bf62fbdcb71495fc7d8d4cc29891538a
      9467fc44
  16. 10 Aug, 2023 1 commit
  17. 07 Aug, 2023 2 commits
    • moto's avatar
      Add MMS FA Bundle (#3521) · 5e211d66
      moto authored
      Summary:
      Port the MMS FA model from tutorial to the library with post-processing module.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3521
      
      Reviewed By: huangruizhe
      
      Differential Revision: D48038285
      
      Pulled By: mthrok
      
      fbshipit-source-id: 571cf0fceaaab4790983be2719f1a85805b814f5
      5e211d66
    • moto's avatar
      Add merge_tokens / TokenSpan (#3535) · 30668afb
      moto authored
      Summary:
      This commit adds `merge_tokens` function which removes repeated tokens from CTC token sequences returned from `forced_align`.
      
      Resolving repeated tokens is a necessary step and almost universal, thus it makes sense to have such helper function in torchaudio.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3535
      
      Reviewed By: huangruizhe
      
      Differential Revision: D48111202
      
      Pulled By: mthrok
      
      fbshipit-source-id: 25354bfa210aa5c03f8c1d3e201f253ca3761b24
      30668afb
  18. 03 Aug, 2023 1 commit
  19. 01 Aug, 2023 2 commits
  20. 31 Jul, 2023 1 commit
  21. 28 Jul, 2023 3 commits
  22. 27 Jul, 2023 1 commit
    • moto's avatar
      Replace libsox with stub library (#3497) · 8588fba1
      moto authored
      Summary:
      This commit updates the way libsox is integrated to torchaudio
      
      1. We stop statically linking libsox, so torchaudio will not ship libsox.
      2. We link libsox dynamically. Users are expected to install libsox by themselves.
      3. We use stab library to build torchaudio.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3497
      
      Differential Revision: D47803706
      
      Pulled By: mthrok
      
      fbshipit-source-id: 31b05495d81069186fa52d67beea360cc7e817a8
      8588fba1
  23. 25 Jul, 2023 2 commits
  24. 18 Jul, 2023 1 commit
  25. 15 Jul, 2023 1 commit
  26. 12 Jul, 2023 1 commit
    • moto's avatar
      Support multiple FFmpeg versions (#3464) · 786066b4
      moto authored
      Summary:
      This commit introduces support for multiple FFmpeg versions for OSS binary distributions.
      
      Currently torchaudio only works with FFmpeg 4. This is inconvenient from installing to runtime linking.
      This commit allows to pick FFmpeg 4, 5 or 6 at runtime, instead of just looking for v4.
      
      The way it works is that we compile the FFmpeg extension three times with different FFmpeg and ship them.
      At runtime, we look for libavutil of specific version and when one is found, load the corresponding FFmpeg extension.
      The order of preference is 6, 5, then 4.
      
      To make the build process simple and reproducible, we use pre-built binaries of FFmpeg during the build.
      They are LGPL and downloaded from S3 at build time, instead of building every time.
      
      The use of pre-built binaries as scaffolding limits the system that can build torchaudio, so it also introduces
      single FFmpeg version support mode. setting FFMPEG_ROOT during the build will change the way binaries are built
      so that it will only support one specific version of FFmpeg.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/3464
      
      Differential Revision: D47300223
      
      Pulled By: mthrok
      
      fbshipit-source-id: 560c7968315e4c8922afa11a4693f648c0356d04
      786066b4
  27. 11 Jul, 2023 1 commit
  28. 28 Jun, 2023 1 commit
  29. 21 Jun, 2023 1 commit