- 22 Oct, 2020 1 commit
-
-
Vasilis Vryniotis authored
-
- 19 Oct, 2020 2 commits
-
-
Brian Johnson authored
Adds introductory context and links to the PyTorch Libraries to audio docs.
-
moto authored
-
- 16 Oct, 2020 2 commits
- 15 Oct, 2020 1 commit
-
-
moto authored
-
- 14 Oct, 2020 1 commit
-
-
moto authored
-
- 13 Oct, 2020 3 commits
-
-
moto authored
-
moto authored
-
lawlict authored
Co-authored-by:linqj3 <linqj3@lenovo.com>
-
- 12 Oct, 2020 1 commit
-
-
moto authored
-
- 09 Oct, 2020 4 commits
-
-
Vincent QB authored
-
moto authored
-
moto authored
-
Vincent QB authored
and add test on other backend.
-
- 06 Oct, 2020 2 commits
-
-
moto authored
-
peterjc123 authored
-
- 02 Oct, 2020 1 commit
-
-
moto authored
-
- 01 Oct, 2020 2 commits
-
-
Vincent QB authored
* remove * in import of models. * only importing WaveRNN in tests.
-
moto authored
-
- 29 Sep, 2020 1 commit
-
-
cpuhrsch authored
* Suggested changes to speed up phaser * Checkpoint * Checkpoint * Checkpoint * Checkpoint * removing todo items Co-authored-by:Vincent QB <vincentqb@users.noreply.github.com>
-
- 28 Sep, 2020 2 commits
-
-
moto authored
As a part of the "sox" backend sunset plan (#903), we add a "soundfile" backend that is compatible with the "sox_io" backend. No new public backend name is added. We provide a switch to change the interface/behavior of "soundfile" backend. This commit contains; - The implementation of the new "soundfile" backend. - The flag to switch the behavior of "soundfile" backend. (`torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE`) - Test for the new backend and switching mechanism. The default behavior of "soundfile" backend is not changed. The users who want to opt-in the new "soundfile" interface can do so by `torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE = False` before changing the backend to "soundfile". In 0.8.0 release, the "soundfile" backend will use this interface by default, and users can still use the legacy one with `torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE = True`. In 0.9.0, the legacy interface is removed and `torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE` flag will be eventually removed.
-
moto authored
-
- 24 Sep, 2020 1 commit
-
-
Vincent QB authored
* example pipeline, initial commit. * removing notebook conversion artifacts. * remove extra comments. lint. * addressing some feedback. * main function. * defining args in function. * refactor. * lint. * checkpoint. * clean version to start with. * adding more parameters. * lint. * cleaning full version. * check for not None. * cleaning. * back -l 160 * black. * fix runtime error. * removing some print statements. * add help to command line. add progress bar option. * grouping librispeech-specific transform in subclass. * typo. * fix concatenation. * typo. * black. tqdm. * missing transpose. * renaming variables. * sum cer and wer * clip norm. * second signal handler removed. * cosmetic. * default to no checkpoint. * remove non_blocking. * adadelta works better than sgd. * anomaly detection. * moving dataset to separate file. * lint. * move to separate module: languagemodel, decoder, metric. * flush=True. * renaming decoder. * CTC Decoders. * flush=True. * pass length for viterbi decoder. * progress bar. relative path. * generalize transition matrix to n-gram. progress bar. * choice of decoder. * collate func. * remove signal handling. * adding distributed. * lint. * normalize w/r to length of dataset, and w/r to total number characters. * relative cer/wer. * clip grad parameter. momentum back but not yet used. * Switch to SGD. * choice of optimizer. * scheduler. * move to utils file. * metric log, and utils file. * rename metric_logger. * stderr and stdout. simpler metric logger. * replace by logging. * adding time measurement in metric logger. * fix duplicate name. remove tqdm. keep track of epoch instead and iteration instead. * rename main file. and add readme. * refactor distributed. * swap example and output in readme. * remove time from logger. * check non-empty tensor input. * typo in variable name and log update. * typo. * compute cer/wer in training too. * typo. * add back slurm signal capture to resubmit job. * update levinstein distance. * adding tests for levenstein distance. * record error rate during iteration. * metric logger using setitem. * moving signal break to end of loop and return loss so far. * typo. * add citation. * change default to best run. * adding other experiment with decoders. * remove other decoders than greedy. * Revert "remove other decoders than greedy." This reverts commit fb114372e89e317bf48d0b1f846c60bca8efe1ac. * changing name of folfder. * remove other decoders, and unused dataset class. * rename functions to align with other pipeline. * pick which parts to train with. * adding specaugment to validation. note that caching prevents randomization from happening in validation. * updating readme. * typo in metric logging. * Revert "typo in metric logging." This reverts commit acac245eec250f61d2039a67933d3c01f1975ce9. * Revert "Revert "typo in metric logging."" This reverts commit 2c80d9691ed401044da734c40df3715dba92d0db. * update metric logger. * simplify metric logger implementation. * use json dumps instead. * group metric together. * move function. * lint. * quick summary of files in folder. * pass clip_grad explictly. * typo in default dataset name. * option to disable logger. * ergonomics for distributed. * reminder about signal handler. * minor refactor of main in main. * replace by not_main_rank. * raising error if parameter not supported. * move model before invoking DDP. * changing log level. using python 2 style string for logging. * dynamic augmentations. * update metric log. batch cer/wer metric. correct typo in time. adding other dimensions in metric. * save learning rate even if function not available. * add type option to model. * add adamw. * reduce lr on validation step or training step. * specify hop-length and win-length. * normalize option. * rename parameter. * add dropout and tweak to number of channels. * copy model in pipeline folder for experimentation. * fix scheduler stepping. * fix input_type and num_features. * waveform mode changes shape more. * adding best character error rate with current implementation of model with mfcc. * comment update. * remove signal. remove custom wav2letter model. * remove comment. * simpler import with pandas.
-
- 23 Sep, 2020 1 commit
-
- 21 Sep, 2020 1 commit
-
-
toddstep authored
* Warn if create_fb_matrix produces a column whose weights are all zeros See also https://github.com/librosa/librosa/issues/478
-
- 18 Sep, 2020 1 commit
-
-
Tim Loderhose authored
-
- 15 Sep, 2020 4 commits
-
-
Eli Uriegas authored
Default to using the `cpu` directory since the base one is not reliable Signed-off-by:Eli Uriegas <eliuriegas@fb.com>
-
moto authored
-
moto authored
* Add deprecation warning to sox backend Refer to https://github.com/pytorch/audio/issues/903
-
Jaime Ferrando Huertas authored
-
- 11 Sep, 2020 1 commit
-
-
sdarkhovsky authored
* updated the build_generator call to include the models argument * fixed RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same
-
- 03 Sep, 2020 1 commit
-
-
Eli Uriegas authored
Upstream is on 9.4.1 and we were experiencing issues when conda-build was attempting to check for overlinking and failed out due to some files not existing in xcode 9.0 Signed-off-by:Eli Uriegas <eliuriegas@fb.com>
-
- 28 Aug, 2020 1 commit
-
-
moto authored
When running smoke test, proper way to install conda package is to install from workspace, instead of installing from archive. Otherwise the dependencies are not properly installed https://github.com/conda/conda/issues/1884 For this, we pass the whole `conda-bld` directory from conda package build job to upload job then smoke test job. `torchaudio` does not have a mandatory third-party python package dependency, so this was not an issue. See also https://github.com/pytorch/text/pull/803
-
- 24 Aug, 2020 1 commit
-
-
moto authored
-
- 23 Aug, 2020 1 commit
-
-
Tejasvi S Tomar authored
* Fix incorrect extension parsing in sox_io_backend.save * Add tests for compression=None
-
- 20 Aug, 2020 1 commit
-
-
JianwuXu authored
* Tweak docstring, audio_ext, load method signature and constructor of VCTK_092 * Add test for VCTK_092 dataset.
-
- 19 Aug, 2020 1 commit
-
-
Abhishek Dubey authored
* Added version 0.92 of VCTK dataset Signed-off-by:Abhishek Dubey <abhi.dubey011999@gmail.com>
-
- 12 Aug, 2020 2 commits