1. 17 Nov, 2022 1 commit
  2. 16 Nov, 2022 2 commits
  3. 15 Nov, 2022 3 commits
  4. 14 Nov, 2022 2 commits
  5. 13 Nov, 2022 1 commit
  6. 11 Nov, 2022 1 commit
    • DanilBaibak's avatar
      Add nova workflow for MacOS and Linux (#2800) · eabf1a13
      DanilBaibak authored
      Summary:
      Added missed build workflows for MacOS and Linux:
      
      - [x] Linux conda
      - [x] MacOS conda
      
      This does not change the existing builds/uploads in CircleCI, and should not break any existing jobs/workflows. This is just to add back workflows for the MacOS and Linux conda builds with Nova.
      
      We will create a workflow (most likely in test-infra) that does this comparison between the binaries to ensure there is parity between the binaries before we start uploading with Nova.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2800
      
      Reviewed By: osalpekar
      
      Differential Revision: D41181467
      
      Pulled By: DanilBaibak
      
      fbshipit-source-id: a5c5d4dcfdd778b4045203f6016c20fb42daa01b
      eabf1a13
  7. 10 Nov, 2022 5 commits
  8. 09 Nov, 2022 2 commits
  9. 08 Nov, 2022 2 commits
    • Caroline Chen's avatar
      Enable log probs input for rnnt loss (#2798) · ca478823
      Caroline Chen authored
      Summary:
      Add `fused_log_softmax` argument (default/current behavior = True) to rnnt loss.
      
      If setting it to `False`, call `log_softmax` on the logits prior to passing it in to the rnnt loss function.
      
      The following should produce the same output:
      ```
      rnnt_loss(logits, targets, logit_lengths, target_lengths, fused_log_softmax=True)
      ```
      
      ```
      log_probs = torch.nn.functional.log_softmax(logits, dim=-1)
      rnnt_loss(log_probs, targets, logit_lengths, target_lengths, fused_log_softmax=False)
      ```
      
      testing -- unit tests + get same results on the conformer rnnt recipe
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2798
      
      Reviewed By: xiaohui-zhang
      
      Differential Revision: D41083523
      
      Pulled By: carolineechen
      
      fbshipit-source-id: e15442ceed1f461bbf06b724aa0561ff8827ad61
      ca478823
    • hwangjeff's avatar
      Add convolution transforms (#2811) · 2d99fee2
      hwangjeff authored
      Summary:
      Adds `torch.nn.Module`-based implementations for convolution and FFT convolution.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2811
      
      Reviewed By: carolineechen
      
      Differential Revision: D40881937
      
      Pulled By: hwangjeff
      
      fbshipit-source-id: bfe8969e6178ad4f58981efd4b2720ac006be8de
      2d99fee2
  10. 04 Nov, 2022 1 commit
  11. 03 Nov, 2022 1 commit
  12. 02 Nov, 2022 5 commits
  13. 01 Nov, 2022 1 commit
    • hwangjeff's avatar
      Fix convolve mode docstring (#2809) · 6318c81f
      hwangjeff authored
      Summary:
      Argument `mode` in `convolve` and `fftconvolve` is expected to be a string, but the docstrings incorrectly say bool. This PR fixes the docstrings accordingly.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2809
      
      Reviewed By: nateanl
      
      Differential Revision: D40854464
      
      Pulled By: hwangjeff
      
      fbshipit-source-id: 75b339ba34715723c93b91e7d48be2ed28bee115
      6318c81f
  14. 31 Oct, 2022 1 commit
  15. 29 Oct, 2022 1 commit
  16. 28 Oct, 2022 2 commits
  17. 27 Oct, 2022 1 commit
  18. 26 Oct, 2022 2 commits
    • hwangjeff's avatar
      Deprecate 'onesided' init param for MelSpectrogram (#2797) · 546e699a
      hwangjeff authored
      Summary:
      Initializer parameter `onesided` isn't relevant to `MelSpectrogram` — it should always be `True`. In fact, the module already assumes `onesided == True` in the filterbank it generates and fails in its forward pass when `onesided == False`. Accordingly, this PR makes param `onesided` optional and adds a deprecation warning that's fired when the param is provided.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2797
      
      Reviewed By: carolineechen, xiaohui-zhang
      
      Differential Revision: D40731238
      
      Pulled By: hwangjeff
      
      fbshipit-source-id: 6eea8eb9d4a85a805162e03ad91682a1946f92cd
      546e699a
    • moto's avatar
      Refactor StreamProcessor interface (#2791) · 9e1999ae
      moto authored
      Summary:
      StreamProcessor is constructed on top of AVStream object, and attach streams defined by client code.
      
      This commit refactor the constructor and add_stream method signature so that `add_stream`'s signature is centered around the parameters required for filter construction.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2791
      
      Reviewed By: xiaohui-zhang
      
      Differential Revision: D40667979
      
      Pulled By: mthrok
      
      fbshipit-source-id: 42220832f09a7895ede3cddf969d57feeb4ef7ec
      9e1999ae
  19. 25 Oct, 2022 1 commit
    • moto's avatar
      Fix issue with the missing video frame in StreamWriter (#2789) · 17a2b93b
      moto authored
      Summary:
      Addresses https://github.com/pytorch/audio/issues/2790.
      
      Previously AVPacket objects had duration==0.
      
      `av_interleaved_write_frame` function was inferring the duration of packets by
      comparing them against the next ones but It could not infer the duration of
      the last packet, as there is no subsequent frame, thus was omitting it from the final data.
      
      This commit fixes it by explicitly setting packet duration = 1 (one frame)
      only for video. (audio AVPacket contains multiple samples, so it's different.
      To ensure the correctness for audio, the tests were added.)
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2789
      
      Reviewed By: xiaohui-zhang
      
      Differential Revision: D40627439
      
      Pulled By: mthrok
      
      fbshipit-source-id: 4d0d827bff518c017b115445e03bdf0bf1e68320
      17a2b93b
  20. 21 Oct, 2022 1 commit
  21. 20 Oct, 2022 1 commit
  22. 19 Oct, 2022 3 commits
    • atalman's avatar
      Bump version to 0.14 (#2779) · e77b8f90
      atalman authored
      Summary:
      Bump version to 0.14
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2779
      
      Reviewed By: carolineechen
      
      Differential Revision: D40523034
      
      Pulled By: atalman
      
      fbshipit-source-id: 325e6ffcac4763a7d83ba600c2c3d9eadae03c31
      e77b8f90
    • Caroline Chen's avatar
      Add iemocap variants (#2778) · 34255386
      Caroline Chen authored
      Summary:
      add ability to load only improvised or only scripted utterances.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2778
      
      Reviewed By: nateanl
      
      Differential Revision: D40511865
      
      Pulled By: carolineechen
      
      fbshipit-source-id: e1fe3908ac2aa306ad30c242ddd25762b2268539
      34255386
    • Omkar Salpekar's avatar
      [Nova] Clean commit for Enabling Nova Linux Wheels Workflows (#2719) · 9135b544
      Omkar Salpekar authored
      Summary:
      Creating this fresh PR since we're reverting the older commit that removed build configs from the CircleCI file. This does not change the existing builds/uploads in CircleCI, and should not break any existing jobs/workflows. This is just to add back workflows to build the Linux Wheels with Nova, upload them to GH artifacts (NOT to the actual nightly channels), and ensure that they produce the same binaries as CircleCI. TO CLARIFY: this does not upload anything to nightly channels, so this PR has not effect on any existing jobs or distributed binaries.
      
      We will create a workflow (most likely in test-infra) that does this comparison between the binaries to ensure there is parity between the binaries before we start uploading with Nova.
      
      Pull Request resolved: https://github.com/pytorch/audio/pull/2719
      
      Reviewed By: hwangjeff, weiwangmeta
      
      Differential Revision: D39866440
      
      Pulled By: osalpekar
      
      fbshipit-source-id: 9ebf0402214fcd97cc519801276d85d336617410
      9135b544