Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Megatron-LM
Commits
3dadd16d
Commit
3dadd16d
authored
Jun 07, 2021
by
Deepak Narayanan
Browse files
Update T5 scripts
parent
83c4d95a
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
9 additions
and
6 deletions
+9
-6
examples/pretrain_t5.sh
examples/pretrain_t5.sh
+3
-2
examples/pretrain_t5_distributed.sh
examples/pretrain_t5_distributed.sh
+3
-2
examples/pretrain_t5_distributed_with_mp.sh
examples/pretrain_t5_distributed_with_mp.sh
+3
-2
No files found.
examples/pretrain_t5.sh
View file @
3dadd16d
...
@@ -15,7 +15,7 @@ python pretrain_t5.py \
...
@@ -15,7 +15,7 @@ python pretrain_t5.py \
--encoder-seq-length
512
\
--encoder-seq-length
512
\
--decoder-seq-length
128
\
--decoder-seq-length
128
\
--micro-batch-size
16
\
--micro-batch-size
16
\
--global-batch-size
2048
\
--global-batch-size
16
\
--max-position-embeddings
512
\
--max-position-embeddings
512
\
--train-iters
1000000
\
--train-iters
1000000
\
--lr-decay-iters
1000000
\
--lr-decay-iters
1000000
\
...
@@ -35,4 +35,5 @@ python pretrain_t5.py \
...
@@ -35,4 +35,5 @@ python pretrain_t5.py \
--save-interval
10000
\
--save-interval
10000
\
--eval-interval
1000
\
--eval-interval
1000
\
--eval-iters
10
\
--eval-iters
10
\
--fp16
--fp16
\
--vocab-extra-ids
100
examples/pretrain_t5_distributed.sh
View file @
3dadd16d
...
@@ -24,7 +24,7 @@ python -m torch.distributed.launch $DISTRIBUTED_ARGS \
...
@@ -24,7 +24,7 @@ python -m torch.distributed.launch $DISTRIBUTED_ARGS \
--encoder-seq-length
512
\
--encoder-seq-length
512
\
--decoder-seq-length
128
\
--decoder-seq-length
128
\
--micro-batch-size
16
\
--micro-batch-size
16
\
--global-batch-size
2
04
8
\
--global-batch-size
1
28
\
--max-position-embeddings
512
\
--max-position-embeddings
512
\
--train-iters
1000000
\
--train-iters
1000000
\
--lr-decay-iters
1000000
\
--lr-decay-iters
1000000
\
...
@@ -44,4 +44,5 @@ python -m torch.distributed.launch $DISTRIBUTED_ARGS \
...
@@ -44,4 +44,5 @@ python -m torch.distributed.launch $DISTRIBUTED_ARGS \
--save-interval
10000
\
--save-interval
10000
\
--eval-interval
1000
\
--eval-interval
1000
\
--eval-iters
10
\
--eval-iters
10
\
--fp16
--fp16
\
--vocab-extra-ids
100
examples/pretrain_t5_distributed_with_mp.sh
View file @
3dadd16d
...
@@ -24,7 +24,7 @@ python -m torch.distributed.launch $DISTRIBUTED_ARGS \
...
@@ -24,7 +24,7 @@ python -m torch.distributed.launch $DISTRIBUTED_ARGS \
--encoder-seq-length
512
\
--encoder-seq-length
512
\
--decoder-seq-length
128
\
--decoder-seq-length
128
\
--micro-batch-size
16
\
--micro-batch-size
16
\
--global-batch-size
2
04
8
\
--global-batch-size
1
28
\
--seq-length
512
\
--seq-length
512
\
--max-position-embeddings
512
\
--max-position-embeddings
512
\
--train-iters
1000000
\
--train-iters
1000000
\
...
@@ -45,4 +45,5 @@ python -m torch.distributed.launch $DISTRIBUTED_ARGS \
...
@@ -45,4 +45,5 @@ python -m torch.distributed.launch $DISTRIBUTED_ARGS \
--save-interval
10000
\
--save-interval
10000
\
--eval-interval
1000
\
--eval-interval
1000
\
--eval-iters
10
\
--eval-iters
10
\
--fp16
--fp16
\
--vocab-extra-ids
100
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment