1. 05 Nov, 2020 2 commits
  2. 03 Nov, 2020 2 commits
  3. 02 Nov, 2020 1 commit
    • moto's avatar
      Sync fbcode (#996) · 758f6c2a
      moto authored
      fbshipit-source-id: 4fb853c391900d3070b936e5a3e4609eb78a780d
      
      * 20200428 pytorch/audio import
      
      Summary: [10:30:47: cpuhrsch@devvm3140 pytorch]$ ./fb_build/import_audio.sh
      
      Reviewed By: vincentqb
      
      Differential Revision: D21282421
      
      fbshipit-source-id: 9bde1455ca6a19defbf33dbbfc5f0d49a8e4dc6a
      
      * Import torchaudio 20200528
      
      Summary: Import Up to #664
      
      Reviewed By: cpuhrsch
      
      Differential Revision: D21728204
      
      fbshipit-source-id: 648dd622087fa762194ca5f89a310500e777263d
      
      * Remove unnecessary config file from torchaudio
      
      Summary: Turned out .use_external_sox is not necessary for building torchaudio in fbcode.
      
      Reviewed By: vincentqb
      
      Differential Revision: D21792939
      
      fbshipit-source-id: c0fb5173c6533e67114f50ddc8e9425bd129574f
      
      * Import torchaudio 20200605
      
      Summary: import torchaudio 0.5.0 in fbcode using import_audio.sh:
      
      Reviewed By: vincentqb
      
      Differential Revision: D21884426
      
      fbshipit-source-id: b6f2cc308e597caef2dd767c315b167c09fb0d4c
      
      * Change parameterized testing system to be compatible with unittest
      
      Summary: The previous implementation of parameterized testing worked by modifying test.common_utils inplace.  This doesn't work in general because unittest's contract with test modules is such that it must be able to load the module and run the test itself.  Because the previous implementation needed to load the module and modify it, it is incompatible.
      
      Reviewed By: mthrok
      
      Differential Revision: D21964676
      
      fbshipit-source-id: 9bb71e8c3f9fab074239b22306f3bbddb0f3975b
      
      * Import torchaudio 20200618 #718
      
      Summary: Import torchaudio up to #719
      
      Reviewed By: zhangguanheng66
      
      Differential Revision: D22119491
      
      fbshipit-source-id: e14842278a32c9373179fc132e8111a0ffe66d93
      
      * Import torchaudio 20200714 #782 (#784)
      
      Summary:
      Pull Request resolved: https://github.com/pytorch/audio/pull/784
      
       - Import torchaudio.
       - Change test util module name from test_case_utils to case_utils
      
      Reviewed By: cpuhrsch
      
      Differential Revision: D22261638
      
      fbshipit-source-id: eb4df500c1d7db0a60baa100dd22795a63851438
      
      * remediation of S205607
      
      fbshipit-source-id: 5113fe0c527595e4227ff827253b7414abbdf7ac
      
      * remediation of S205607
      
      fbshipit-source-id: 798decc90db4f13770e97cdce3c0df7d5421b2a3
      
      * Import torchaudio 20200723
      
      Summary: Import torchaudio 20200723 #814
      
      Reviewed By: fmassa
      
      Differential Revision: D22666393
      
      fbshipit-source-id: 50df07b5c158fe4e95ada7ea54381b2e26f6aecd
      
      * Support custom exception message (#41907)
      
      Summary:
      Raise and assert used to have a hard-coded error message "Exception". User provided error message was ignored. This PR adds support to represent user's error message in TorchScript.
      
      This breaks backward compatibility because now we actually need to script the user's error message, which can potentially contain unscriptable expressions. Such programs can break when scripting, but saved models can still continue to work.
      
      Increased an op count in test_mobile_optimizer.py because now we need aten::format to form the actual exception message.
      
      This is built upon an WIP PR:  https://github.com/pytorch/pytorch/pull/34112 by driazati
      
      Pull Request resolved: https://github.com/pytorch/pytorch/pull/41907
      
      
      
      Reviewed By: ngimel
      
      Differential Revision: D22778301
      
      Pulled By: gmagogsfm
      
      fbshipit-source-id: 2b94f0db4ae9fe70c4cd03f4048e519ea96323ad
      
      * Import torchaudio 20200804
      
      Summary: Up to #804
      
      Reviewed By: vincentqb
      
      Differential Revision: D22947671
      
      fbshipit-source-id: d1a005cec2f1a00913c41eda380b9f4b993ef779
      
      * Remove .python3 markers
      
      Reviewed By: ashwinp-fb
      
      Differential Revision: D22955630
      
      fbshipit-source-id: f00ef17a905e4c7cd9196c8924db39f9cdfe8cfa
      
      * Import torchaudio 20200821
      
      Reviewed By: cpuhrsch
      
      Differential Revision: D23273584
      
      fbshipit-source-id: 2fe7effa11b7f7cdf0cee1da6b1cac5556e9f55b
      
      * Import torchaudio 20200922
      
      Summary: Up to #914
      
      Reviewed By: vincentqb, cpuhrsch
      
      Differential Revision: D23846718
      
      fbshipit-source-id: 9feb4e58563b900965467bd9ff66c979211c50df
      
      * replace max-sentences with batch-size for dependencies
      
      Summary: this fixes some regressions introduced by D24121305. fairseq configuration is changing from command line to dataclasses (via hydra eventually) which no longer supports option aliases. one such alias is --max-sentences / --batch-size, and D24121305 removed --max-sentences as --batch-size is more appropriate (fairseq is not just an nlp framework dealing with sentences). unfortunately it seems some existing flows broke and this diff attempts to fix this
      
      Differential Revision: D24142488
      
      fbshipit-source-id: 075180ea10a9d706a3f8d64b978d66dfd83c3d2b
      Co-authored-by: default avatarVincent Quenneville-Belair <vincentqb@gmail.com>
      Co-authored-by: default avatarcpuhrsch <cpuhrsch@fb.com>
      Co-authored-by: default avatarJi Chen <jimchen90@fb.com>
      Co-authored-by: default avatarBen Mehne <bmehne@fb.com>
      Co-authored-by: default avatarStanislau Hlebik <stash@fb.com>
      Co-authored-by: default avatarYanan Cao <gmagogsfm@gmail.com>
      Co-authored-by: default avatarAndres Suarez <asuarez@fb.com>
      Co-authored-by: default avatarAlexei Baevski <abaevski@fb.com>
      758f6c2a
  4. 30 Oct, 2020 1 commit
  5. 27 Oct, 2020 3 commits
  6. 26 Oct, 2020 4 commits
  7. 22 Oct, 2020 1 commit
  8. 19 Oct, 2020 2 commits
  9. 16 Oct, 2020 2 commits
  10. 15 Oct, 2020 1 commit
  11. 14 Oct, 2020 1 commit
  12. 13 Oct, 2020 3 commits
  13. 12 Oct, 2020 1 commit
  14. 09 Oct, 2020 4 commits
  15. 06 Oct, 2020 2 commits
  16. 02 Oct, 2020 1 commit
  17. 01 Oct, 2020 2 commits
  18. 29 Sep, 2020 1 commit
  19. 28 Sep, 2020 2 commits
    • moto's avatar
      Add "soundfile" backend compatible to "sox_io" (#922) · 2c723e8c
      moto authored
      As a part of the "sox" backend sunset plan (#903), we add a "soundfile" backend that is compatible with the "sox_io" backend. No new public backend name is added. We provide a switch to change the interface/behavior of "soundfile" backend.
      
      This commit contains;
       - The implementation of the new "soundfile" backend.
       - The flag to switch the behavior of "soundfile" backend. (`torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE`)
       - Test for the new backend and switching mechanism.
      
      The default behavior of "soundfile" backend is not changed. The users who want to opt-in the new "soundfile" interface can do so by `torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE = False` before changing the backend to "soundfile".
      
      In 0.8.0 release, the "soundfile" backend will use this interface by default, and users can still use the legacy one with `torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE = True`. In 0.9.0, the legacy interface is removed and `torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE` flag will be eventually removed.
      2c723e8c
    • moto's avatar
      Add ConvTasNet model (#920) · 8e370559
      moto authored
      8e370559
  20. 24 Sep, 2020 1 commit
    • Vincent QB's avatar
      Example pipeline with wav2letter (#632) · 9c274228
      Vincent QB authored
      * example pipeline, initial commit.
      
      * removing notebook conversion artifacts.
      
      * remove extra comments. lint.
      
      * addressing some feedback.
      
      * main function.
      
      * defining args in function.
      
      * refactor.
      
      * lint.
      
      * checkpoint.
      
      * clean version to start with.
      
      * adding more parameters.
      
      * lint.
      
      * cleaning full version.
      
      * check for not None.
      
      * cleaning.
      
      * back -l 160
      
      * black.
      
      * fix runtime error.
      
      * removing some print statements.
      
      * add help to command line. add progress bar option.
      
      * grouping librispeech-specific transform in subclass.
      
      * typo.
      
      * fix concatenation.
      
      * typo.
      
      * black. tqdm.
      
      * missing transpose.
      
      * renaming variables.
      
      * sum cer and wer
      
      * clip norm.
      
      * second signal handler removed.
      
      * cosmetic.
      
      * default to no checkpoint.
      
      * remove non_blocking.
      
      * adadelta works better than sgd.
      
      * anomaly detection.
      
      * moving dataset to separate file.
      
      * lint.
      
      * move to separate module: languagemodel, decoder, metric.
      
      * flush=True.
      
      * renaming decoder.
      
      * CTC Decoders.
      
      * flush=True.
      
      * pass length for viterbi decoder.
      
      * progress bar. relative path.
      
      * generalize transition matrix to n-gram. progress bar.
      
      * choice of decoder.
      
      * collate func.
      
      * remove signal handling.
      
      * adding distributed.
      
      * lint.
      
      * normalize w/r to length of dataset, and w/r to total number characters.
      
      * relative cer/wer.
      
      * clip grad parameter. momentum back but not yet used.
      
      * Switch to SGD.
      
      * choice of optimizer.
      
      * scheduler.
      
      * move to utils file.
      
      * metric log, and utils file.
      
      * rename metric_logger.
      
      * stderr and stdout. simpler metric logger.
      
      * replace by logging.
      
      * adding time measurement in metric logger.
      
      * fix duplicate name. remove tqdm. keep track of epoch instead and iteration instead.
      
      * rename main file. and add readme.
      
      * refactor distributed.
      
      * swap example and output in readme.
      
      * remove time from logger.
      
      * check non-empty tensor input.
      
      * typo in variable name and log update.
      
      * typo.
      
      * compute cer/wer in training too.
      
      * typo.
      
      * add back slurm signal capture to resubmit job.
      
      * update levinstein distance.
      
      * adding tests for levenstein distance.
      
      * record error rate during iteration.
      
      * metric logger using setitem.
      
      * moving signal break to end of loop and return loss so far.
      
      * typo.
      
      * add citation.
      
      * change default to best run.
      
      * adding other experiment with decoders.
      
      * remove other decoders than greedy.
      
      * Revert "remove other decoders than greedy."
      
      This reverts commit fb114372e89e317bf48d0b1f846c60bca8efe1ac.
      
      * changing name of folfder.
      
      * remove other decoders, and unused dataset class.
      
      * rename functions to align with other pipeline.
      
      * pick which parts to train with.
      
      * adding specaugment to validation. note that caching prevents randomization from happening in validation.
      
      * updating readme.
      
      * typo in metric logging.
      
      * Revert "typo in metric logging."
      
      This reverts commit acac245eec250f61d2039a67933d3c01f1975ce9.
      
      * Revert "Revert "typo in metric logging.""
      
      This reverts commit 2c80d9691ed401044da734c40df3715dba92d0db.
      
      * update metric logger.
      
      * simplify metric logger implementation.
      
      * use json dumps instead.
      
      * group metric together.
      
      * move function.
      
      * lint.
      
      * quick summary of files in folder.
      
      * pass clip_grad explictly.
      
      * typo in default dataset name.
      
      * option to disable logger.
      
      * ergonomics for distributed.
      
      * reminder about signal handler.
      
      * minor refactor of main in main.
      
      * replace by not_main_rank.
      
      * raising error if parameter not supported.
      
      * move model before invoking DDP.
      
      * changing log level. using python 2 style string for logging.
      
      * dynamic augmentations.
      
      * update metric log.
      
      batch cer/wer metric. correct typo in time. adding other dimensions in metric.
      
      * save learning rate even if function not available.
      
      * add type option to model.
      
      * add adamw.
      
      * reduce lr on validation step or training step.
      
      * specify hop-length and win-length.
      
      * normalize option.
      
      * rename parameter.
      
      * add dropout and tweak to number of channels.
      
      * copy model in pipeline folder for experimentation.
      
      * fix scheduler stepping.
      
      * fix input_type and num_features.
      
      * waveform mode changes shape more.
      
      * adding best character error rate with current implementation of model with mfcc.
      
      * comment update.
      
      * remove signal. remove custom wav2letter model.
      
      * remove comment.
      
      * simpler import with pandas.
      9c274228
  21. 23 Sep, 2020 1 commit
  22. 21 Sep, 2020 1 commit
  23. 18 Sep, 2020 1 commit