Commits · 1852d3e16db2942fb2105bd5afd67a3149e7286e · OpenDAS / Torchaudio

04 Nov, 2021 1 commit
- Update CONTRIBUTING.md (#1975) · 1852d3e1
  moto authored Nov 03, 2021
```
After #1966, `git submodule` is performed automatically.
```
  1852d3e1
03 Nov, 2021 5 commits

[BC-Breaking] Drop pseudo complex support from phase_vocoder / TimeStretch (#1957) · d3e146fd
moto authored Nov 03, 2021
```
Following the plan #1337, this commit drops the support for pseudo complex type from `F.phase_vocoder` and `T.TimeStretch`.
```
d3e146fd

[BC-Breaking] Drop pseudo complex support from spectrogram (#1958) · 5ec6ada6

moto authored Nov 03, 2021

Following the plan #1337, this commit drops the support for pseudo complex type from 
`F.spectrogram` and `T.Spectrogram`.

It also deprecates the use of `return_complex` argument.

5ec6ada6

Add wav2vec2 ASR English pretrained model from voxpopuli (#1956) · f2eec77b
moto authored Nov 03, 2021

f2eec77b

Fetch third party sources automatically (#1966) · af336d66

moto authored Nov 03, 2021

This commit changes the build process so that the third party code are fetched automatically when `setup.py` is invoked.

This is to allow
1. `build_sdist` to contain the third party library codes, so that source code distribution created at the time release tag is created is complete.
2. It makes it possible to do `pip install https://github.com/pytorch/audio.git`.
    Example:
    ```
    !pip install 'cmake>=3.18' ninja
    !pip install --verbose git+https://github.com/pytorch/audio.git@auto-source
    ```

af336d66

Support multi-node training for source separation pipeline (#1968) · ad899151
nateanl authored Nov 03, 2021

ad899151

02 Nov, 2021 11 commits
- Revert "Update PR label notifier (#1964)" (#1965) · 15ab315c
  moto authored Nov 02, 2021
```
This reverts commit 0822fc05.
```
  15ab315c
- Update PR label notifier (#1964) · 0822fc05
  moto authored Nov 02, 2021
```
Include the list of labels.
```
  0822fc05
- Add citation information in the documentation (#1962) · 8a93717c
  yangarbiter authored Nov 02, 2021
  
  8a93717c
- Rename bug fix label (#1961) · 8c78273b
  Caroline Chen authored Nov 02, 2021
  
  8c78273b
- Run integration tests on CI (#1939) · 5594eae6
  moto authored Nov 02, 2021
  
  5594eae6
- Add citation information (#1947) · 3c021f1f
  yangarbiter authored Nov 02, 2021
  
  3c021f1f
- Refactor collecting-PR script for release note (#1951) · 42e67972
  nateanl authored Nov 02, 2021
  
  42e67972
- Fix bandit failure (#1960) · 6206d301
  nateanl authored Nov 02, 2021
  
  6206d301
- Add wav2vec2 ASR Italian pretrained model from voxpopuli (#1954) · 5c8541b7
  moto authored Nov 02, 2021
  
  5c8541b7
- Update requirements.txt for doc build (#1955) · 108e93af
  moto authored Nov 02, 2021
  
  108e93af
- Add wav2vec2 ASR German pretrained model from voxpopuli (#1953) · e15431b7
  moto authored Nov 01, 2021
```
* Add wav2vec2 ASR German pretrained model from voxpopuli
```
  e15431b7
01 Nov, 2021 2 commits
- Add melkwargs setting for MFCC in HuBERT pipeline (#1949) · 184466a9
  nateanl authored Nov 01, 2021
  
  184466a9
- Updated compatibility matrix to include LTS version (#1896) · 0c48eddf
  Harish Kulkarni authored Nov 01, 2021
  
  0c48eddf
30 Oct, 2021 1 commit
- Add preprocessing scripts for HuBERT model training (#1911) · 4fa77623
  nateanl authored Oct 30, 2021
  
  4fa77623
29 Oct, 2021 3 commits
- Fix PR labeling requirement (#1946) · 207d8119
  Caroline Chen authored Oct 29, 2021
  
  207d8119
- Improve backend and transforms docs (#1944) · 0f8014f5
  Caroline Chen authored Oct 29, 2021
  
  0f8014f5
- Add script to collect PRs between commits (#1943) · f2dff4d4
  Caroline Chen authored Oct 29, 2021
  
  f2dff4d4
28 Oct, 2021 2 commits
- Remove F.complex_norm and T.ComplexNorm (#1942) · ab50909d
  S Harish authored Oct 28, 2021
  
  ab50909d
- Notify merger if PR is incorrectly labeled (#1937) · 7e5f8021
  Caroline Chen authored Oct 28, 2021
  
  7e5f8021
27 Oct, 2021 4 commits
- Tweak wav2vec2 checkpoint conversion tool (#1938) · 3ca81107
  moto authored Oct 27, 2021
  
  3ca81107
- Update smoke test docker image (#1905) · 18685a51
  moto authored Oct 27, 2021
  
  18685a51
- Remove deprecated F.angle (#1935) · 1d3dcdbd
  S Harish authored Oct 27, 2021
  
  1d3dcdbd
- Add wav2vec2 ASR Spanish pretrained model from voxpopuli (#1924) · 3a599315
  moto authored Oct 26, 2021
  
  3a599315
26 Oct, 2021 2 commits

Allow the customization of axis exclusion for ASR head (#1932) · 56f3b927

moto authored Oct 26, 2021

In Wav2Vec2 ASR pipelines, the `get_model` method performs on-the-fly model
surgery to remove unused dimensions common to all the Wav2Vec2 model trained
with fairseq.

In VoxPopuli, there seems to be an extra dimensions introduced due to
some issue in the preprocessing stage.

For example, the label `1` shows up in the training dataset of German (1 out of 16M),
English (1 / 28M), Spanish (1 / 9.4M), Romanian (1 / 4.7M) and Polish (6 / 5.8M).

This code changes will allow the customization of excluded dimensions for such cases.

56f3b927

Remove deprecated `F.magphase` (#1934) · d35ea80e
S Harish authored Oct 26, 2021

d35ea80e

25 Oct, 2021 1 commit
- Add pretrained French ASR from voxpopuli (#1919) · cbf267c3
  moto authored Oct 25, 2021
  
  cbf267c3
24 Oct, 2021 1 commit
- Add anaconda stats to README (#1910) · 9355e20e
  moto authored Oct 23, 2021
  
  9355e20e
22 Oct, 2021 5 commits
- Refactor wav2vec2 pipeline util (#1925) · 47ca3aa9
  moto authored Oct 22, 2021
  
  47ca3aa9
- Refactor integration test (#1922) · 19d8f1c2
  moto authored Oct 22, 2021
```
- Make the test support other languages
- Fetch tetst asset on-the-fly
```
  19d8f1c2
- Add tool to convert voxpopuli model (#1923) · 716aa416
  moto authored Oct 22, 2021
  
  716aa416
- Remove MACOSX_DEPLOYMENT_TARGET (#1880) · a7161298
  moto authored Oct 21, 2021
```
It seems that this is no longer necessary for recent macOS .
```
  a7161298
- Add more example to TTS doc (#1917) · fe1ca374
  moto authored Oct 21, 2021
  
  fe1ca374
21 Oct, 2021 2 commits

fix formatting CIRCLECI_TAG when building docs (#1915) · 31dbb754
Matti Picus authored Oct 22, 2021

31dbb754

[BC-breaking] Remove unused dimension from pretrained Wav2Vec2 ASR (#1914) · ec4837dc

moto authored Oct 21, 2021

* [BC-breaking] Remove unused dimension from pretrained Wav2Vec2 ASR

The Wav2Vec2 ASR pretrained weights originated from fairseq have
extra dimension that have nothing to do with the ASR task.

https://github.com/pytorch/fairseq/blob/c5ff181125c7e6126b49a85e5ebdd5f5b6a07914/fairseq/data/dictionary.py#L18-L37

which is masked during the loss computation as

https://github.com/pytorch/fairseq/blob/c5ff181125c7e6126b49a85e5ebdd5f5b6a07914/fairseq/criterions/ctc.py#L126-L128

This change removes it.

* Use '-' for blank token representation.

ec4837dc