Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
ColossalAI
Commits
e00cedd1
Unverified
Commit
e00cedd1
authored
Jan 04, 2023
by
HELSON
Committed by
GitHub
Jan 04, 2023
Browse files
[example] update gemini benchmark bash (#2306)
parent
9b765e7a
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
28 additions
and
24 deletions
+28
-24
examples/language/gpt/benchmark_gemini.sh
examples/language/gpt/benchmark_gemini.sh
+19
-21
examples/language/gpt/run_gemini.sh
examples/language/gpt/run_gemini.sh
+8
-1
examples/language/gpt/train_gpt_demo.py
examples/language/gpt/train_gpt_demo.py
+1
-2
No files found.
examples/language/gpt/benchmark_gemini.sh
View file @
e00cedd1
for
MODEL_TYPE
in
"gpt2_medium"
for
MODEL_TYPE
in
"gpt2_medium"
;
do
do
for
BATCH_SIZE
in
16
;
do
for
BATCH_SIZE
in
16
for
GPUNUM
in
1 2 4 8
;
do
do
for
TPDEGREE
in
1 2 4 8
;
do
for
GPUNUM
in
1 2 4 8
if
[
${
TPDEGREE
}
-gt
${
GPUNUM
}
]
;
then
do
continue
for
TPDEGREE
in
1 2 4 8
fi
do
for
PLACEMENT
in
"cpu"
"auto"
;
do
if
[
${
TPDEGREE
}
-gt
${
GPUNUM
}
]
echo
"****************** Begin ***************************"
then
echo
"* benchmrking MODEL_TYPE
${
MODEL_TYPE
}
BS
${
BATCH_SIZE
}
BS
${
BS
}
GPUNUM
${
GPUNUM
}
TPDEGREE
${
TPDEGREE
}
PLACEMENT
${
PLACEMENT
}
"
continue
MODEL_TYPE
=
${
MODEL_TYPE
}
BATCH_SIZE
=
${
BATCH_SIZE
}
GPUNUM
=
${
GPUNUM
}
TPDEGREE
=
${
TPDEGREE
}
PLACEMENT
=
${
PLACEMENT
}
\
fi
bash ./run_gemini.sh
echo
"****************** Begin ***************************"
echo
"****************** Finished ***************************"
echo
"* benchmrking MODEL_TYPE
${
MODEL_TYPE
}
BS
${
BATCH_SIZE
}
BS
${
BS
}
GPUNUM
${
GPUNUM
}
TPDEGREE
${
TPDEGREE
}
"
echo
""
MODEL_TYPE
=
${
MODEL_TYPE
}
BATCH_SIZE
=
${
BATCH_SIZE
}
GPUNUM
=
${
GPUNUM
}
TPDEGREE
=
${
TPDEGREE
}
bash ./run_gemini.sh
echo
""
echo
"****************** Finished ***************************"
done
echo
""
done
echo
""
done
done
done
done
done
done
done
examples/language/gpt/run_gemini.sh
View file @
e00cedd1
...
@@ -10,4 +10,11 @@ export BATCH_SIZE=${BATCH_SIZE:-16}
...
@@ -10,4 +10,11 @@ export BATCH_SIZE=${BATCH_SIZE:-16}
export
MODEL_TYPE
=
${
MODEL_TYPE
:-
"gpt2_medium"
}
export
MODEL_TYPE
=
${
MODEL_TYPE
:-
"gpt2_medium"
}
mkdir
-p
gemini_logs
mkdir
-p
gemini_logs
torchrun
--standalone
--nproc_per_node
=
${
GPUNUM
}
train_gpt_demo.py
--tp_degree
=
${
TPDEGREE
}
--model_type
=
${
MODEL_TYPE
}
--batch_size
=
${
BATCH_SIZE
}
--placement
${
PLACEMENT
}
--shardinit
${
USE_SHARD_INIT
}
--distplan
${
DISTPAN
}
2>&1 |
tee
./gemini_logs/
${
MODEL_TYPE
}
_
${
DISTPAN
}
_gpu_
${
GPUNUM
}
_bs_
${
BATCH_SIZE
}
_tp_
${
TPDEGREE
}
.log
torchrun
--standalone
--nproc_per_node
=
${
GPUNUM
}
train_gpt_demo.py
\
--tp_degree
=
${
TPDEGREE
}
\
--model_type
=
${
MODEL_TYPE
}
\
--batch_size
=
${
BATCH_SIZE
}
\
--placement
=
${
PLACEMENT
}
\
--shardinit
=
${
USE_SHARD_INIT
}
\
--distplan
=
${
DISTPAN
}
\
2>&1 |
tee
./gemini_logs/
${
MODEL_TYPE
}
_
${
DISTPAN
}
_gpu_
${
GPUNUM
}
_bs_
${
BATCH_SIZE
}
_tp_
${
TPDEGREE
}
_
${
PLACEMENT
}
.log
examples/language/gpt/train_gpt_demo.py
View file @
e00cedd1
...
@@ -217,8 +217,7 @@ def build_gemini(model: torch.nn.Module, pg: ProcessGroup, placement_policy: str
...
@@ -217,8 +217,7 @@ def build_gemini(model: torch.nn.Module, pg: ProcessGroup, placement_policy: str
def
main
():
def
main
():
# version check
# version check
# this example is supposed to work for versions less than 0.2.0 but greater than 0.1.9
# this example is supposed to work for versions greater than 0.1.9
assert
version
.
parse
(
CAI_VERSION
)
<
version
.
parse
(
"0.2.0"
)
assert
version
.
parse
(
CAI_VERSION
)
>=
version
.
parse
(
"0.1.9"
)
assert
version
.
parse
(
CAI_VERSION
)
>=
version
.
parse
(
"0.1.9"
)
set_cpu_maximum_parallelism
()
set_cpu_maximum_parallelism
()
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment