".github/vscode:/vscode.git/clone" did not exist on "c827e94da08bff143b8f364b53fbea998855ea0a"
- 03 Nov, 2021 2 commits
-
-
moto authored
This commit changes the build process so that the third party code are fetched automatically when `setup.py` is invoked. This is to allow 1. `build_sdist` to contain the third party library codes, so that source code distribution created at the time release tag is created is complete. 2. It makes it possible to do `pip install https://github.com/pytorch/audio.git`. Example: ``` !pip install 'cmake>=3.18' ninja !pip install --verbose git+https://github.com/pytorch/audio.git@auto-source ```
-
nateanl authored
-
- 02 Nov, 2021 11 commits
-
-
moto authored
Include the list of labels.
-
yangarbiter authored
-
Caroline Chen authored
-
moto authored
-
yangarbiter authored
-
nateanl authored
-
nateanl authored
-
moto authored
-
moto authored
-
moto authored
* Add wav2vec2 ASR German pretrained model from voxpopuli
- 01 Nov, 2021 2 commits
-
-
nateanl authored
-
Harish Kulkarni authored
-
- 30 Oct, 2021 1 commit
-
-
nateanl authored
-
- 29 Oct, 2021 3 commits
-
-
Caroline Chen authored
-
Caroline Chen authored
-
Caroline Chen authored
-
- 28 Oct, 2021 2 commits
-
-
S Harish authored
-
Caroline Chen authored
-
- 27 Oct, 2021 4 commits
- 26 Oct, 2021 2 commits
-
-
moto authored
In Wav2Vec2 ASR pipelines, the `get_model` method performs on-the-fly model surgery to remove unused dimensions common to all the Wav2Vec2 model trained with fairseq. In VoxPopuli, there seems to be an extra dimensions introduced due to some issue in the preprocessing stage. For example, the label `1` shows up in the training dataset of German (1 out of 16M), English (1 / 28M), Spanish (1 / 9.4M), Romanian (1 / 4.7M) and Polish (6 / 5.8M). This code changes will allow the customization of excluded dimensions for such cases.
-
S Harish authored
-
- 25 Oct, 2021 1 commit
-
-
moto authored
-
- 24 Oct, 2021 1 commit
-
-
moto authored
-
- 22 Oct, 2021 5 commits
- 21 Oct, 2021 6 commits
-
-
Matti Picus authored
-
moto authored
* [BC-breaking] Remove unused dimension from pretrained Wav2Vec2 ASR The Wav2Vec2 ASR pretrained weights originated from fairseq have extra dimension that have nothing to do with the ASR task. https://github.com/pytorch/fairseq/blob/c5ff181125c7e6126b49a85e5ebdd5f5b6a07914/fairseq/data/dictionary.py#L18-L37 which is masked during the loss computation as https://github.com/pytorch/fairseq/blob/c5ff181125c7e6126b49a85e5ebdd5f5b6a07914/fairseq/criterions/ctc.py#L126-L128 This change removes it. * Use '-' for blank token representation.
-
moto authored
-
Caroline Chen authored
-
nateanl authored
-
Nikita Shulga authored
This is follow up after https://github.com/pytorch/builder/pull/823 that gets rids of `feature` and migrate it to `run_constrained`
-