Print model and number of trained params

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/469 Differential Revision: D13802945 Pulled By: myleott fbshipit-source-id: b6976506a8336b96ee40505c4a7638541cc99c95

Print model and number of trained params
Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/469 Differential Revision: D13802945 Pulled By: myleott fbshipit-source-id: b6976506a8336b96ee40505c4a7638541cc99c95
d0ebcec4 · Myle Ott · Facebook Github Bot · 38f1dee9 · d0ebcec4
Commit d0ebcec4 authored Jan 24, 2019 by Myle Ott Committed by Facebook Github Bot Jan 24, 2019
Hide whitespace changes
Inline Side-by-side

Showing with 5 additions and 1 deletion

train.py train.py +5 -1

No files found.
--- a/train.py
+++ b/train.py
@@ -44,8 +44,12 @@ def main(args):
    # Build model and criterion
    model = task.build_model(args)
    criterion = task.build_criterion(args)
+    print(model)
    print('| model {}, criterion {}'.format(args.arch, criterion.__class__.__name__))
-    print('| num. model params: {}'.format(sum(p.numel() for p in model.parameters())))
+    print('| num. model params: {} (num. trained: {})'.format(
+        sum(p.numel() for p in model.parameters()),
+        sum(p.numel() for p in model.parameters() if p.requires_grad),
+    ))
    # Make a dummy batch to (i) warm the caching allocator and (ii) as a
    # placeholder DistributedDataParallel when there's an uneven number of