Fix hubert fine-tuning recipe bugs (#2588)

Summary: - The optimizer in fine-tuning recipe should also be `AdamW`. See https://github.com/pytorch/audio/pull/2412 - Fix the import of `DistributedBatchSampler` in hubert dataset - Fix `dataset_path` in fine-tuning module. Pull Request resolved: https://github.com/pytorch/audio/pull/2588 Reviewed By: carolineechen Differential Revision: D38243423 Pulled By: nateanl fbshipit-source-id: badc88ce9eddfd71270201a65ae89433fae2733f

Fix hubert fine-tuning recipe bugs (#2588)
Summary: - The optimizer in fine-tuning recipe should also be `AdamW`. See https://github.com/pytorch/audio/pull/2412 - Fix the import of `DistributedBatchSampler` in hubert dataset - Fix `dataset_path` in fine-tuning module. Pull Request resolved: https://github.com/pytorch/audio/pull/2588 Reviewed By: carolineechen Differential Revision: D38243423 Pulled By: nateanl fbshipit-source-id: badc88ce9eddfd71270201a65ae89433fae2733f
0092aa3c · Zhaoheng Ni · Facebook GitHub Bot · d84ce3b2 · 0092aa3c · 0092aa3c
Commit 0092aa3c authored Jul 28, 2022 by Zhaoheng Ni Committed by Facebook GitHub Bot Jul 28, 2022
4 changed files
--- a/examples/hubert/README.md
+++ b/examples/hubert/README.md
@@ -50,7 +50,7 @@ Sample SLURM command for fine-tuning on `10h` subset of `LibriLightLimited` data
 ```
 srun --gpus-per-node=1 -N 1 --ntasks-per-node=1 --cpus-per-task=10 \
  python finetune.py --dataset-path /root/datasets/ --exp-dir ./exp_finetune \
-  --checkpoint /exp_iter2/checkpoints_librispeech_hubert_pretrain_base/epoch=361-step=399999.ckpt \
+  --checkpoint ./exp_iter2/checkpoints_librispeech_hubert_pretrain_base/epoch=361-step=399999.ckpt \
  --gpus 1 --debug --warmup-updates 2000 --hold-updates 8000 --decay-updates 10000 --max-updates 20000 --learning-rate 5e-5
 ```


--- a/examples/hubert/dataset/__init__.py
+++ b/examples/hubert/dataset/__init__.py
@@ -4,6 +4,7 @@ from .hubert_dataset import (
    BucketizeBatchSampler,
    CollateFnHubert,
    CollateFnLibriLightLimited,
+    DistributedBatchSampler,
    HuBERTDataSet,
 )

@@ -14,5 +15,6 @@ __all__ = [
    "BucketizeBatchSampler",
    "CollateFnHubert",
    "CollateFnLibriLightLimited",
+    "DistributedBatchSampler",
    "HuBERTDataSet",
 ]
--- a/examples/hubert/finetune.py
+++ b/examples/hubert/finetune.py
@@ -75,7 +75,7 @@ def run_train(args):
        mask_channel_length=args.mask_channel_length,
        aux_num_out=args.aux_num_out,
        checkpoint=args.checkpoint,
-        dataset_paths=args.dataset_path,
+        dataset_path=args.dataset_path,
        seconds_per_batch=args.seconds_per_batch,
        subset=args.subset,
        learning_rate=args.learning_rate,

--- a/examples/hubert/lightning.py
+++ b/examples/hubert/lightning.py
@@ -273,7 +273,7 @@ class HuBERTFineTuneModule(LightningModule):
        for p in self.model.wav2vec2.feature_extractor.parameters():
            p.requires_grad = False
        self.loss_fn = torch.nn.CTCLoss(blank=0, reduction="sum", zero_infinity=True)
-        self.optimizer = torch.optim.Adam(
+        self.optimizer = torch.optim.AdamW(
            list(self.aux.parameters()) + list(self.model.wav2vec2.encoder.parameters()),
            lr=learning_rate,
            betas=betas,