Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Megatron-LM
Commits
7d462a77
Commit
7d462a77
authored
Dec 04, 2024
by
wxj
Browse files
Update llama_pretraining.sh
parent
b44f3138
Pipeline
#2038
passed with stage
Changes
1
Pipelines
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
22 additions
and
4 deletions
+22
-4
llama_pretraining.sh
llama_pretraining.sh
+22
-4
No files found.
llama_pretraining.sh
View file @
7d462a77
...
@@ -98,11 +98,29 @@ EVAL_AND_LOGGING_ARGS=(
...
@@ -98,11 +98,29 @@ EVAL_AND_LOGGING_ARGS=(
--tensorboard-dir
$TENSORBOARD_LOGS_PATH
--tensorboard-dir
$TENSORBOARD_LOGS_PATH
)
)
NNODES
=
1
NODE_RANK
=
0
MASTER_ADDR
=
localhost
while
[
$#
-gt
0
]
do
case
$1
in
--NNODES
)
NNODES
=
$2
;
shift
;;
--NODE_RANK
)
NODE_RANK
=
$2
;
shift
;;
--MASTER_ADDR
)
MASTER_ADDR
=
$2
;
shift
;;
(
*
)
break
;;
esac
shift
done
DISTRIBUTED_ARGS
=(
DISTRIBUTED_ARGS
=(
--nproc_per_node
4
--nproc_per_node
2
--nnodes
1
--nnodes
$NNODES
--node_rank
0
--node_rank
$NODE_RANK
--master_addr
localhost
--master_addr
$MASTER_ADDR
--master_port
29500
--master_port
29500
)
)
export
HIP_VISIBLE_DEVICES
=
0,1,2,3
#4,5,6,7
export
HIP_VISIBLE_DEVICES
=
0,1,2,3
#4,5,6,7
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment