- 04 Nov, 2021 1 commit
-
-
moto authored
After #1966, `git submodule` is performed automatically.
-
- 03 Nov, 2021 5 commits
-
-
moto authored
Following the plan #1337, this commit drops the support for pseudo complex type from `F.phase_vocoder` and `T.TimeStretch`.
-
moto authored
Following the plan #1337, this commit drops the support for pseudo complex type from `F.spectrogram` and `T.Spectrogram`. It also deprecates the use of `return_complex` argument.
-
moto authored
-
moto authored
This commit changes the build process so that the third party code are fetched automatically when `setup.py` is invoked. This is to allow 1. `build_sdist` to contain the third party library codes, so that source code distribution created at the time release tag is created is complete. 2. It makes it possible to do `pip install https://github.com/pytorch/audio.git`. Example: ``` !pip install 'cmake>=3.18' ninja !pip install --verbose git+https://github.com/pytorch/audio.git@auto-source ```
-
nateanl authored
-
- 02 Nov, 2021 11 commits
-
-
moto authored
Include the list of labels.
-
yangarbiter authored
-
Caroline Chen authored
-
moto authored
-
yangarbiter authored
-
nateanl authored
-
nateanl authored
-
moto authored
-
moto authored
-
moto authored
* Add wav2vec2 ASR German pretrained model from voxpopuli
- 01 Nov, 2021 2 commits
-
-
nateanl authored
-
Harish Kulkarni authored
-
- 30 Oct, 2021 1 commit
-
-
nateanl authored
-
- 29 Oct, 2021 3 commits
-
-
Caroline Chen authored
-
Caroline Chen authored
-
Caroline Chen authored
-
- 28 Oct, 2021 2 commits
-
-
S Harish authored
-
Caroline Chen authored
-
- 27 Oct, 2021 4 commits
- 26 Oct, 2021 2 commits
-
-
moto authored
In Wav2Vec2 ASR pipelines, the `get_model` method performs on-the-fly model surgery to remove unused dimensions common to all the Wav2Vec2 model trained with fairseq. In VoxPopuli, there seems to be an extra dimensions introduced due to some issue in the preprocessing stage. For example, the label `1` shows up in the training dataset of German (1 out of 16M), English (1 / 28M), Spanish (1 / 9.4M), Romanian (1 / 4.7M) and Polish (6 / 5.8M). This code changes will allow the customization of excluded dimensions for such cases.
-
S Harish authored
-
- 25 Oct, 2021 1 commit
-
-
moto authored
-
- 24 Oct, 2021 1 commit
-
-
moto authored
-
- 22 Oct, 2021 5 commits
- 21 Oct, 2021 2 commits
-
-
Matti Picus authored
-
moto authored
* [BC-breaking] Remove unused dimension from pretrained Wav2Vec2 ASR The Wav2Vec2 ASR pretrained weights originated from fairseq have extra dimension that have nothing to do with the ASR task. https://github.com/pytorch/fairseq/blob/c5ff181125c7e6126b49a85e5ebdd5f5b6a07914/fairseq/data/dictionary.py#L18-L37 which is masked during the loss computation as https://github.com/pytorch/fairseq/blob/c5ff181125c7e6126b49a85e5ebdd5f5b6a07914/fairseq/criterions/ctc.py#L126-L128 This change removes it. * Use '-' for blank token representation.
-