Commits · 04733c05900ed8fd71981b2a47b3793642e6471b · OpenDAS / Torchaudio

22 Oct, 2020 1 commit
- Fixing clang-format version to what PyTorch uses. (#981) · 04733c05
  Vasilis Vryniotis authored Oct 22, 2020
  
  04733c05
19 Oct, 2020 2 commits
- Update index.rst (#968) · ba1698ba
  Brian Johnson authored Oct 19, 2020
```
Adds introductory context and links to the PyTorch Libraries to audio docs.
```
  ba1698ba
- Cherry-pick 'Fix binary smoke test (#964)' (#965) · dc9f1fca
  moto authored Oct 19, 2020
  
  dc9f1fca
16 Oct, 2020 2 commits
- Cherry-pick 'Use PyTorch RC for unittest (#953)' (#963) · b8402baa
  moto authored Oct 16, 2020
  
  b8402baa
- Update package script to use UPLOAD_CHANNEL env (#962) · ba00419a
  moto authored Oct 16, 2020
  
  ba00419a
15 Oct, 2020 1 commit
- Upload to "test" channel on release branch (#952) (#954) · 6a0053e9
  moto authored Oct 15, 2020
  
  6a0053e9
14 Oct, 2020 1 commit
- Run GPU test on release/* branch (#958) · 8e8c5277
  moto authored Oct 14, 2020
  
  8e8c5277
13 Oct, 2020 3 commits
- Add Conv-TasNet training script to example (#896) · 4e97213b
  moto authored Oct 13, 2020
  
  4e97213b
- Make VCTK_092 return regular type for the consistency (#949) · 2c07658b
  moto authored Oct 13, 2020
  
  2c07658b
- Improve the speed of kaldi.fbank with fused operator (#947) · c92392fc
  lawlict authored Oct 14, 2020
```
Co-authored-by: linqj3 <linqj3@lenovo.com>
```
  c92392fc
12 Oct, 2020 1 commit
- Add wsj0-mix dataset to source separation example (#895) · 2d879132
  moto authored Oct 12, 2020
  
  2d879132
09 Oct, 2020 4 commits
- lint. (#945) · ba7b7a2f
  Vincent QB authored Oct 09, 2020
  
  ba7b7a2f
- [doc] Update librosa link (#940) · 61cb8f26
  moto authored Oct 09, 2020
  
  61cb8f26
- [doc] Update backend docstring/documentation (#935) · e17c2634
  moto authored Oct 09, 2020
  
  e17c2634
- fix tedlium load_audio (#934) · 4f0acc58
  Vincent QB authored Oct 09, 2020
```
and add test on other backend.
```
  4f0acc58
06 Oct, 2020 2 commits
- Add metrics to source separation example(#894) · 725f8b06
  moto authored Oct 06, 2020
  
  725f8b06
- Fix Windows unit tests (#937) · 9871219d
  peterjc123 authored Oct 07, 2020
  
  9871219d
02 Oct, 2020 1 commit
- Update docstrings/documentations of all the datasets (#931) · e3d1d746
  moto authored Oct 02, 2020
  
  e3d1d746
01 Oct, 2020 2 commits
- remove * in import of models (#932) · 963224f5
  Vincent QB authored Oct 01, 2020
```
* remove * in import of models.

* only importing WaveRNN in tests.
```
  963224f5
- Update model documentation (#933) · 1df9e201
  moto authored Oct 01, 2020
  
  1df9e201
29 Sep, 2020 1 commit

cpuhrsch authored Sep 29, 2020



* Suggested changes to speed up phaser

* Checkpoint

* Checkpoint

* Checkpoint

* Checkpoint

* removing todo items
Co-authored-by: Vincent QB <vincentqb@users.noreply.github.com>

3250d3df

28 Sep, 2020 2 commits

Add "soundfile" backend compatible to "sox_io" (#922) · 2c723e8c

moto authored Sep 28, 2020

As a part of the "sox" backend sunset plan (#903), we add a "soundfile" backend that is compatible with the "sox_io" backend. No new public backend name is added. We provide a switch to change the interface/behavior of "soundfile" backend.

This commit contains;
 - The implementation of the new "soundfile" backend.
 - The flag to switch the behavior of "soundfile" backend. (`torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE`)
 - Test for the new backend and switching mechanism.

The default behavior of "soundfile" backend is not changed. The users who want to opt-in the new "soundfile" interface can do so by `torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE = False` before changing the backend to "soundfile".

In 0.8.0 release, the "soundfile" backend will use this interface by default, and users can still use the legacy one with `torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE = True`. In 0.9.0, the legacy interface is removed and `torchaudio.USE_SOUNDFILE_LEGACY_INTERFACE` flag will be eventually removed.

2c723e8c

Add ConvTasNet model (#920) · 8e370559
moto authored Sep 28, 2020

8e370559

24 Sep, 2020 1 commit

Example pipeline with wav2letter (#632) · 9c274228

Vincent QB authored Sep 24, 2020

* example pipeline, initial commit.

* removing notebook conversion artifacts.

* remove extra comments. lint.

* addressing some feedback.

* main function.

* defining args in function.

* refactor.

* lint.

* checkpoint.

* clean version to start with.

* adding more parameters.

* lint.

* cleaning full version.

* check for not None.

* cleaning.

* back -l 160

* black.

* fix runtime error.

* removing some print statements.

* add help to command line. add progress bar option.

* grouping librispeech-specific transform in subclass.

* typo.

* fix concatenation.

* typo.

* black. tqdm.

* missing transpose.

* renaming variables.

* sum cer and wer

* clip norm.

* second signal handler removed.

* cosmetic.

* default to no checkpoint.

* remove non_blocking.

* adadelta works better than sgd.

* anomaly detection.

* moving dataset to separate file.

* lint.

* move to separate module: languagemodel, decoder, metric.

* flush=True.

* renaming decoder.

* CTC Decoders.

* flush=True.

* pass length for viterbi decoder.

* progress bar. relative path.

* generalize transition matrix to n-gram. progress bar.

* choice of decoder.

* collate func.

* remove signal handling.

* adding distributed.

* lint.

* normalize w/r to length of dataset, and w/r to total number characters.

* relative cer/wer.

* clip grad parameter. momentum back but not yet used.

* Switch to SGD.

* choice of optimizer.

* scheduler.

* move to utils file.

* metric log, and utils file.

* rename metric_logger.

* stderr and stdout. simpler metric logger.

* replace by logging.

* adding time measurement in metric logger.

* fix duplicate name. remove tqdm. keep track of epoch instead and iteration instead.

* rename main file. and add readme.

* refactor distributed.

* swap example and output in readme.

* remove time from logger.

* check non-empty tensor input.

* typo in variable name and log update.

* typo.

* compute cer/wer in training too.

* typo.

* add back slurm signal capture to resubmit job.

* update levinstein distance.

* adding tests for levenstein distance.

* record error rate during iteration.

* metric logger using setitem.

* moving signal break to end of loop and return loss so far.

* typo.

* add citation.

* change default to best run.

* adding other experiment with decoders.

* remove other decoders than greedy.

* Revert "remove other decoders than greedy."

This reverts commit fb114372e89e317bf48d0b1f846c60bca8efe1ac.

* changing name of folfder.

* remove other decoders, and unused dataset class.

* rename functions to align with other pipeline.

* pick which parts to train with.

* adding specaugment to validation. note that caching prevents randomization from happening in validation.

* updating readme.

* typo in metric logging.

* Revert "typo in metric logging."

This reverts commit acac245eec250f61d2039a67933d3c01f1975ce9.

* Revert "Revert "typo in metric logging.""

This reverts commit 2c80d9691ed401044da734c40df3715dba92d0db.

* update metric logger.

* simplify metric logger implementation.

* use json dumps instead.

* group metric together.

* move function.

* lint.

* quick summary of files in folder.

* pass clip_grad explictly.

* typo in default dataset name.

* option to disable logger.

* ergonomics for distributed.

* reminder about signal handler.

* minor refactor of main in main.

* replace by not_main_rank.

* raising error if parameter not supported.

* move model before invoking DDP.

* changing log level. using python 2 style string for logging.

* dynamic augmentations.

* update metric log.

batch cer/wer metric. correct typo in time. adding other dimensions in metric.

* save learning rate even if function not available.

* add type option to model.

* add adamw.

* reduce lr on validation step or training step.

* specify hop-length and win-length.

* normalize option.

* rename parameter.

* add dropout and tweak to number of channels.

* copy model in pipeline folder for experimentation.

* fix scheduler stepping.

* fix input_type and num_features.

* waveform mode changes shape more.

* adding best character error rate with current implementation of model with mfcc.

* comment update.

* remove signal. remove custom wav2letter model.

* remove comment.

* simpler import with pandas.

9c274228

23 Sep, 2020 1 commit
- Revert "Run tests in parallel with pytest-xdist (#807)" (#915) · 95d9f2d2
  moto authored Sep 23, 2020
```
This reverts commit 1ecbc249.

Reason: The macOS CI jobs are stuck with xdist recently.
```
  95d9f2d2
21 Sep, 2020 1 commit

Issue warning when a Mel filter is all zero (#914) · 77df44e3

toddstep authored Sep 21, 2020

* Warn if create_fb_matrix produces a column whose weights are all zeros

See also https://github.com/librosa/librosa/issues/478

77df44e3

18 Sep, 2020 1 commit
- Add pathlib.Path support to sox_io backend (#907) · e7161acf
  Tim Loderhose authored Sep 18, 2020
  
  e7161acf
15 Sep, 2020 4 commits
- packaging: Use cpu directory by default (#909) · 4cdd8cad
  Eli Uriegas authored Sep 15, 2020
```
Default to using the `cpu` directory since the base one is not reliable
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>
```
  4cdd8cad
- Add deprecation warnings to load_wav functions (#905) · 5418d937
  moto authored Sep 15, 2020
  
  5418d937
- Add deprecation warning to sox backend (#904) · 92b027b0
  moto authored Sep 15, 2020
```
* Add deprecation warning to sox backend

Refer to https://github.com/pytorch/audio/issues/903
```
  92b027b0
- Add tedlium dataset (#882) · 914a846d
  Jaime Ferrando Huertas authored Sep 15, 2020
  
  914a846d
11 Sep, 2020 1 commit

Fix interactive asr (#900) · b6a61c3f

sdarkhovsky authored Sep 11, 2020

* updated the build_generator call to include the models argument

* fixed RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same

b6a61c3f

03 Sep, 2020 1 commit

.circleci: Bump xcode workers to 9.4.1 (#898) · c388ec2b

Eli Uriegas authored Sep 02, 2020



Upstream is on 9.4.1 and we were experiencing issues when conda-build
was attempting to check for overlinking and failed out due to some files
not existing in xcode 9.0
Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

c388ec2b

28 Aug, 2020 1 commit

[CI] Install conda package from workspace (#881) · ef01e245

moto authored Aug 28, 2020

When running smoke test, proper way to install conda package is to
install from workspace, instead of installing from archive.
Otherwise the dependencies are not properly installed

https://github.com/conda/conda/issues/1884

For this, we pass the whole `conda-bld` directory from
conda package build job to upload job then smoke test job.

`torchaudio` does not have a mandatory third-party python package dependency,
so this was not an issue.

See also https://github.com/pytorch/text/pull/803

ef01e245

24 Aug, 2020 1 commit
- Remove default compression level test for mp3 (#886) · 52a18a9e
  moto authored Aug 23, 2020
  
  52a18a9e
23 Aug, 2020 1 commit
- Fix incorrect extension parsing in sox_io_backend.save(#885) · 080cd303
  Tejasvi S Tomar authored Aug 23, 2020
```
* Fix incorrect extension parsing in sox_io_backend.save
* Add tests for compression=None
```
  080cd303
20 Aug, 2020 1 commit

Update VCTK_092 interface and add tests (#875) · 2205cc9e

JianwuXu authored Aug 20, 2020

* Tweak docstring, audio_ext, load method signature and constructor of VCTK_092

* Add test for VCTK_092 dataset.

2205cc9e

19 Aug, 2020 1 commit

Add VCTK_092 dataset (#812) · 4bfebd85

Abhishek Dubey authored Aug 19, 2020



* Added version 0.92 of VCTK dataset
Signed-off-by: Abhishek Dubey <abhi.dubey011999@gmail.com>

4bfebd85

12 Aug, 2020 2 commits
- [CI] Fix macOS unittest env setup (#872) · c692fe99
  moto authored Aug 12, 2020
  
  c692fe99
- [CI] Increase resource class to prevent OOM (#873) · 06feab58
  moto authored Aug 12, 2020
  
  06feab58