Commit 4a35d50a authored by Jared Casper's avatar Jared Casper
Browse files

Merge branch 'patch-1' of https://github.com/stas00/Megatron-LM into github-pr

parents e1318f0e 3f755374
......@@ -25,7 +25,6 @@ python -m torch.distributed.launch $DISTRIBUTED_ARGS \
--decoder-seq-length 128 \
--micro-batch-size 16 \
--global-batch-size 128 \
--seq-length 512 \
--max-position-embeddings 512 \
--train-iters 1000000 \
--lr-decay-iters 1000000 \
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment