- 25 Jul, 2022 2 commits
-
-
Sean Kim authored
Summary: Previous Issue: --use-tmp-hub-dir expected the temp directories used to store large file to be deleted after each test case, but pytest erases directories after 3 full test sessions. This commit fixes by manually deleting a new subdirectory created in each test case. https://github.com/pytorch/audio/pull/2565#discussion_r929007101 Pull Request resolved: https://github.com/pytorch/audio/pull/2569 Reviewed By: nateanl Differential Revision: D38117848 Pulled By: skim0514 fbshipit-source-id: 3767cb8df1238fd6218f6aaa58d5d583cea72699
-
moto authored
Summary: This commit fix build_docs job timeout by pinning `resampy=0.2.2`. For some mysterious reason, `resampy=0.3.1` causes slowdown of unrelated code. https://github.com/bmcfee/resampy/issues/106 Pull Request resolved: https://github.com/pytorch/audio/pull/2543 Reviewed By: carolineechen Differential Revision: D38115003 Pulled By: mthrok fbshipit-source-id: 67cd1c73dd4adb3091e0b88aaf5c31de0dd4b87e
-
- 22 Jul, 2022 2 commits
-
-
Sean Kim authored
Summary: Don't allow users to input incorrect dimensions Pull Request resolved: https://github.com/pytorch/audio/pull/2563 Reviewed By: carolineechen Differential Revision: D38074360 Pulled By: skim0514 fbshipit-source-id: 7bcae515706eb358ca6f68c50c7c0ccace1c3f95
-
Zhaoheng Ni authored
Summary: - Add documentation page for `SourceSeparationBundle` and `CONVTASNET_BASE_LIBRI2MIX`. - Add citation of Libri2Mix dataset in the bundle documentation. - url in integration test should use slash instead of `os.path.join` as it will fail on Windows. Change it to f-string. Pull Request resolved: https://github.com/pytorch/audio/pull/2559 Reviewed By: carolineechen Differential Revision: D38036116 Pulled By: nateanl fbshipit-source-id: 736732805191113955badfec3955e2e24e8f4836
-
- 21 Jul, 2022 4 commits
-
-
Sean Kim authored
Summary: Added back device in case of tensor creation Pull Request resolved: https://github.com/pytorch/audio/pull/2561 Reviewed By: mthrok Differential Revision: D38035351 Pulled By: skim0514 fbshipit-source-id: bdea07cbb34d0aa487187cded1a5636da6623d96
-
Jumon Nozaki authored
Summary: Fix the fallback function of load fileobj function in sox_io backend. The typo in the fallback function prevents showing the intended error message. Pull Request resolved: https://github.com/pytorch/audio/pull/2560 Reviewed By: carolineechen, nateanl Differential Revision: D38035077 Pulled By: mthrok fbshipit-source-id: 53c91c0569c7e7bba611aed6ea748dbd2f323221
-
Eli Uriegas authored
Summary: Updates the runner to the latest apple silicon machines we have that also run on macOS 12.4 Similar to https://github.com/pytorch/vision/pull/6290 Signed-off-by:
Eli Uriegas <eliuriegas@fb.com> Pull Request resolved: https://github.com/pytorch/audio/pull/2556 Reviewed By: atalman, mthrok Differential Revision: D37999959 Pulled By: seemethere fbshipit-source-id: 01d2ff01e48dcc0c4e33ed81758886fa19642aa3
-
Zhaoheng Ni authored
Summary: - Add SourceSeparationBundle class for source separation pipeline - Add `CONVTASNET_BASE_LIBRI2MIX` that is trained on Libri2Mix dataset. - Add integration test with example mixture audio and expected scale-invariant signal-to-distortion ratio (Si-SDR) score. The test computes the Si-SDR score with permutation-invariant training (PIT) criterion for all permutations of sources and use the highest value as the final output. The test verifies if the score is equal to or larger than the expected value. Pull Request resolved: https://github.com/pytorch/audio/pull/2440 Reviewed By: mthrok Differential Revision: D37997646 Pulled By: nateanl fbshipit-source-id: c951bcbbe8b7ed9553cb8793d6dc1ef90d5a29fe
-
- 20 Jul, 2022 1 commit
-
-
Sean Kim authored
Summary: Modification from pull request https://github.com/pytorch/audio/issues/2415 to improve resample. Benchmarked for a 89% time reduction, tested in comparison to original resample method. Pull Request resolved: https://github.com/pytorch/audio/pull/2553 Reviewed By: carolineechen Differential Revision: D37997533 Pulled By: skim0514 fbshipit-source-id: ef4b719450ac26794db6ea01f9882509f4fda5cf
-
- 19 Jul, 2022 3 commits
-
-
John Lu authored
Summary: `std::runtime_error` does not preserve the C++ stack trace, so it is unclear to users what went wrong internally. PyTorch's `TORCH_CHECK` macro allows to print C++ stack trace when `TORCH_SHOW_CPP_STACKTRACES` environment variable is set to 1. Pull Request resolved: https://github.com/pytorch/audio/pull/2551 Improve assertion for TorchAudio ffmpeg directory Reviewed By: mthrok Differential Revision: D37915732 fbshipit-source-id: 9f597eb00cadd0dc6a1bbf8f7d5c8092804ef685
-
moto authored
Summary: After reviewing the code for KenLM it turned out that we can build it without boost. Pull Request resolved: https://github.com/pytorch/audio/pull/2552 Reviewed By: xiaohui-zhang Differential Revision: D37949699 Pulled By: mthrok fbshipit-source-id: 4a4ffae4220d0b764b53f52b93040670d91a84a3
-
Sean Kim authored
Summary: Factory functions have been added to HDemucs class and test the implementation within the testing files. Pull Request resolved: https://github.com/pytorch/audio/pull/2547 Reviewed By: carolineechen Differential Revision: D37948600 Pulled By: skim0514 fbshipit-source-id: 7ac4e4a71519450cfbbc24ff7d7e70521f676040
-
- 18 Jul, 2022 1 commit
-
-
John Lu authored
Summary: `std::runtime_error` does not preserve the C++ stack trace, so it is unclear to users what went wrong internally. PyTorch's `TORCH_CHECK` macro allows to print C++ stack trace when `TORCH_SHOW_CPP_STACKTRACES` environment variable is set to 1. Pull Request resolved: https://github.com/pytorch/audio/pull/2550 Improves assertion for TorchAudio ffmpeg directory Reviewed By: mthrok Differential Revision: D37914953 fbshipit-source-id: 7704c41bb88b0616ae2e73961a5496bc0d95cf13
-
- 15 Jul, 2022 1 commit
-
-
moto authored
Summary: Recent CircleCI migration https://github.com/pytorch/audio/pull/2529 silently bumped the minimum supported macOS version to 11. PyTorch still supports 10.9 and the ecosystem still uses 10.9. Issue: https://github.com/pytorch/audio/issues/2536 This commit sets MACOSX_DEPLOYMENT_TARGET=10.9, so that binary distribution are compatible on macOS=10.9. Pull Request resolved: https://github.com/pytorch/audio/pull/2546 Reviewed By: atalman Differential Revision: D37854586 Pulled By: mthrok fbshipit-source-id: a43986ae4de9ef51a4261e0f9fe58e88b4b72148
-
- 12 Jul, 2022 6 commits
-
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2313 Reviewed By: carolineechen, nateanl Differential Revision: D37799552 Pulled By: mthrok fbshipit-source-id: 12e27fccb7098f3142e9ca0b748c71325cd324ee
-
Sean Kim authored
Summary: Small edit to docstring for kernel Pull Request resolved: https://github.com/pytorch/audio/pull/2542 Reviewed By: carolineechen Differential Revision: D37797937 Pulled By: skim0514 fbshipit-source-id: 4bdd1e3ddb49cbdf2bd5367edb03cf9603d4ec6e
-
moto authored
Summary: FFmpeg's API provide multiple ways to initialize decoder. This PR simplifies the initialization by delegating the HW device context management to FFmpeg's native code. Pull Request resolved: https://github.com/pytorch/audio/pull/2534 Reviewed By: hwangjeff Differential Revision: D37734573 Pulled By: mthrok fbshipit-source-id: e61736b4d4d2ca6e94d8965abd93b4e9a68e7351
-
Sean Kim authored
Summary: Draft PR with initial model implementation with minor changes from previous implementation Pull Request resolved: https://github.com/pytorch/audio/pull/2506 Reviewed By: nateanl Differential Revision: D37762671 Pulled By: skim0514 fbshipit-source-id: b7dc0a6ef725d6ae6d76c23c882623f7d339977c
-
moto authored
Summary: Python dictionary is bound to different types in TorchBind and PyBind. StreamReader has methods that receive and return dictionary. This commit cleans up the treatment of dictionary and consolidate helper functions. * The core implementation and TorchBind all uses `c10::Dict`. * PyBind version uses `std::map` and converts it to `c10::Dict`. * The helper functions to convert `std::map` <-> `c10::Dict` are consolidated in pybind directory. * The wrapper methods are implemented in `pybind` dir. Pull Request resolved: https://github.com/pytorch/audio/pull/2533 Reviewed By: hwangjeff Differential Revision: D37731866 Pulled By: mthrok fbshipit-source-id: 5a5cf1372668f7d3aacc0bb461bc69fa07212f3f
-
Zhaoheng Ni authored
Summary: The docstring of `apply_beamforming` has warning when building the documentation page. Fix it in this PR. Pull Request resolved: https://github.com/pytorch/audio/pull/2540 Reviewed By: mthrok Differential Revision: D37763745 Pulled By: nateanl fbshipit-source-id: 0e9f1e098865af032b00ac56d918cb9d2ffc5024
-
- 11 Jul, 2022 1 commit
-
-
Jeff Hwang authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2535 Modifies LibriSpeech Conformer RNN-T example recipe to make the Lightning module and datamodule more generic and reusable. Reviewed By: mthrok Differential Revision: D36731576 fbshipit-source-id: 4643e86fac78f3c2bacc15f5d385bc7b10f410a2
-
- 08 Jul, 2022 1 commit
-
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2531 Reviewed By: carolineechen Differential Revision: D37698120 Pulled By: mthrok fbshipit-source-id: d0fd6445d69758cd803a485cd17836d1936aa1ee
-
- 07 Jul, 2022 5 commits
-
-
moto authored
Summary: Preparation to add save features with ffmpeg. Pull Request resolved: https://github.com/pytorch/audio/pull/2530 Reviewed By: carolineechen Differential Revision: D37698147 Pulled By: mthrok fbshipit-source-id: feb5cbb6349a2b6b7faf44b629c574fdae47ecab
-
moto authored
Summary: CircleCI is removing Xcode 12.4.0 image on August, and there was a planned burnout on July 6th. [[detail](https://discuss.circleci.com/t/xcode-image-deprecation/44294?mkt_tok=NDg1LVpNSC02MjYAAAGFbbxbX7nSPCzN0MCKN078pw0VLJ-TMdICr8_gouRNYBM8C55RL8NDKLXA_9CQGPqnhJE5lsSFdetLRF-nH7iBLzoPGBfYpf2vuJ-XkW_C4__4)] https://app.circleci.com/pipelines/github/pytorch/audio/11566/workflows/da167296-a84f-4dfe-b1b9-60d67e7a3d1c/jobs/771638 This commit updates Xcode image to 12.5 Pull Request resolved: https://github.com/pytorch/audio/pull/2529 Reviewed By: atalman Differential Revision: D37688122 Pulled By: mthrok fbshipit-source-id: 1095edbf0d920c4dc772555915bce93875b74671
-
moto authored
Summary: This commit add support for `"yuv444p"` type as output format of StreamReader. Pull Request resolved: https://github.com/pytorch/audio/pull/2516 Reviewed By: hwangjeff Differential Revision: D37659715 Pulled By: mthrok fbshipit-source-id: eae9b5590d8f138a6ebf3808c08adfe068f11a2b
-
moto authored
Summary: This commits move helper functions/definitions around so that better locality of logics are achieved. ## Detail `ffmpeg.[h|cpp]` implements classes that convert FFmpeg structures into RAII semantics. Initially it these classes included the construction logic in their constructors, but such logics were extracted to factory functions in https://github.com/pytorch/audio/issues/2373. Now the reason why the factory functions stayed in `ffmpeg.[h|cpp]` was because the logic for the initialization and clean-up of AVDictionary class was only available in `ffmpeg.cpp`. Now AVDictionary class handling is properly defined in https://github.com/pytorch/audio/issues/2507, the factory functions, which are not that reusable better stay with the implementation that use them. This makes `ffmpeg.h` lean and clean, makes it easier to see what can be reused. Pull Request resolved: https://github.com/pytorch/audio/pull/2512 Reviewed By: hwangjeff Differential Revision: D37477592 Pulled By: mthrok fbshipit-source-id: 8c1b5059ea5f44649cc0eb1f82d1a92877ef186e
-
moto authored
Summary: Following the formatter changes heppened in fbcode, this commit update the linter config. Pull Request resolved: https://github.com/pytorch/audio/pull/2389 Reviewed By: hwangjeff Differential Revision: D37659649 Pulled By: mthrok fbshipit-source-id: 1c52ff93f0b10cb2e7303d2ad13b2d65ffccfcb0
-
- 06 Jul, 2022 1 commit
-
-
Caroline Chen authored
Summary: fluent dataset test currently fails on windows, due to new line generation in csv writer in testing and incorrect path parsing in dataset impl. Pull Request resolved: https://github.com/pytorch/audio/pull/2510 Reviewed By: carolineechen Differential Revision: D37573203 Pulled By: mthrok fbshipit-source-id: 4868bc649690c7e596b002686c6128ce735d3564
-
- 29 Jun, 2022 1 commit
-
-
moto authored
Summary: The build doc job is failing these days due to the fact that CUDA 11.6 requires different handling. Pull Request resolved: https://github.com/pytorch/audio/pull/2520 Reviewed By: xiaohui-zhang Differential Revision: D37527088 Pulled By: mthrok fbshipit-source-id: 34c23bdbf70ba9fb8e315c7036cff01b3ddf4c91
-
- 28 Jun, 2022 3 commits
-
-
hwangjeff authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2513 Reviewed By: mthrok Differential Revision: D37491994 Pulled By: hwangjeff fbshipit-source-id: 2c164bcec39342fd94abf4cc148d96dc9844699e
-
moto authored
Summary: FilterGraph is necessary for StreamWriter when saving video as Tensor array format cannot express commonot video formats like yub420. The current implementation of FilterGraph is specific to StreamReader, as it takes AVCodecParameters object. Not individual parameters. This PR refactor FilterGraph interface so that it can be constructed from more primitive information. Pull Request resolved: https://github.com/pytorch/audio/pull/2508 Reviewed By: hwangjeff Differential Revision: D37466033 Pulled By: mthrok fbshipit-source-id: 8414e985da7579c2dfe260b4dccd2afe113bb573
-
moto authored
Summary: Small clean up in ffmpeg binding code. 1. Make `get_option_dict` and `clean_up_dict` public utility 2. Merge the exception into `clean_up_dict` 3. Get rid of custom string join function and use `c10::Join`. Pull Request resolved: https://github.com/pytorch/audio/pull/2507 Reviewed By: hwangjeff Differential Revision: D37466022 Pulled By: mthrok fbshipit-source-id: 44b769ac6ff1ab20e6d6ae086cd1447deacb5969
-
- 27 Jun, 2022 5 commits
-
-
moto authored
Summary: Pull Request resolved: https://github.com/pytorch/audio/pull/2511 Reviewed By: nateanl Differential Revision: D37461021 Pulled By: mthrok fbshipit-source-id: 6f894c02bbefc5afda0f9584d26ad785f7c71ee4
-
Zhaoheng Ni authored
Summary: In https://github.com/pytorch/audio/issues/2283, torchaudio's downloading function is updated to reduce code duplication. The links in `EMFORMER_RNNT_BASE_LIBRISPEECH` are updated, but the ones in prototype pipelines are not. This PR addresses it by updating the download links of `EMFORMER_RNNT_BASE_MUSTC` and `EMFORMER_RNNT_BASE_TEDLIUM3` in prototype. Corresponding integration tests are added as well. Pull Request resolved: https://github.com/pytorch/audio/pull/2444 Reviewed By: mthrok Differential Revision: D37389178 Pulled By: nateanl fbshipit-source-id: 46598dd71c95be47d1e1b54cef89ea51d280e17a
-
moto authored
Summary: Follow-up of https://github.com/pytorch/audio/issues/2464. Add utility function to fetch the versions of FFmpeg. Pull Request resolved: https://github.com/pytorch/audio/pull/2467 Reviewed By: carolineechen Differential Revision: D37028006 Pulled By: mthrok fbshipit-source-id: 72adce1e6b43985760ce55b715b0e59af5244fdb
-
Andrey Talman authored
Summary: Fix for the cuda 11.6 and usage of cudatoolkit Pull Request resolved: https://github.com/pytorch/audio/pull/2501 Reviewed By: mthrok Differential Revision: D37388598 Pulled By: atalman fbshipit-source-id: 41add7ad6fbb3d156cc1270625dc085c62f7a531
-
Zhaoheng Ni authored
Summary: This PR adds two dataset classes of VoxCeleb1 corpus. - `VoxCeleb1Identification` Each data sample contains the waveform, sample rate, speaker id, and the file id. - `VoxCeleb1Verification` Each data sample contains a pair of waveforms, sample rate, the label indicating if they are from the same speaker, and the file ids. Pull Request resolved: https://github.com/pytorch/audio/pull/2349 Reviewed By: carolineechen Differential Revision: D35927921 Pulled By: nateanl fbshipit-source-id: 3e07ddd329178777698841565053eb59befe6449
-
- 24 Jun, 2022 1 commit
-
-
moto authored
Summary: The source build is still saying its 0.12. Pull Request resolved: https://github.com/pytorch/audio/pull/2509 Reviewed By: carolineechen Differential Revision: D37427703 Pulled By: mthrok fbshipit-source-id: a6e455ba7c583af7b1a2a355ca45a9e5ab5fe30d
-
- 23 Jun, 2022 1 commit
-
-
Summary: Meta: **If you take no action, this diff will be automatically accepted on 2022-06-23.** (To remove yourself from auto-accept diffs and just let them all land, add yourself to [this Butterfly rule](https://www.internalfb.com/butterfly/rule/904302247110220)) Produced by `tools/arcanist/lint/codemods/black-fbsource`. #nocancel Rules run: - CodemodTransformerSimpleShell Config Oncall: [lint](https://our.intern.facebook.com/intern/oncall3/?shortname=lint) CodemodConfig: [CodemodConfigFBSourceBlackLinter](https://www.internalfb.com/code/www/flib/intern/codemod_service/config/fbsource_arc_f/CodemodConfigFBSourceBlackLinter.php) ConfigType: php Sandcastle URL: https://www.internalfb.com/intern/sandcastle/job/13510799586951394/ This diff was automatically created with CodemodService. To learn more about CodemodService, check out the [CodemodService wiki](https://fburl.com/CodemodService). _____ ## Questions / Comments / Feedback? **[Click here to give feedback about this diff](https://www.internalfb.com/codemod_service/feedback?sandcastle_job_id=13510799586951394).** * Returning back to author or abandoning this diff will only cause the diff to be regenerated in the future. * Do **NOT** post in the CodemodService Feedback group about this specific diff. drop-conflicts Reviewed By: adamjernst Differential Revision: D37375235 fbshipit-source-id: 3d7eb39e5c0539a78d1412f37562dec90b0fc759
-
- 21 Jun, 2022 1 commit
-
-
Sean Kim authored
Summary: Create dataset handler and tests for new dataset. Manually tested and unit tested to test validity. Pre-commit ran for style checks. Pull Request resolved: https://github.com/pytorch/audio/pull/2484 Reviewed By: carolineechen, nateanl Differential Revision: D37250556 Pulled By: skim0514 fbshipit-source-id: d2c8d73d22fd9d7282026265676f3eab1e178d51
-