Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Megatron-LM
Commits
96eb05f5
Commit
96eb05f5
authored
Nov 11, 2024
by
liangjing
Browse files
update
parent
2b09ea90
Pipeline
#1890
passed with stage
Changes
4
Pipelines
1
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
13 additions
and
9 deletions
+13
-9
scripts/llama2_13b.sh
scripts/llama2_13b.sh
+3
-1
scripts/llama2_70b.sh
scripts/llama2_70b.sh
+4
-5
scripts/llama2_7b.sh
scripts/llama2_7b.sh
+3
-1
scripts/qwen2_7b.sh
scripts/qwen2_7b.sh
+3
-2
No files found.
scripts/llama2_13b.sh
View file @
96eb05f5
...
...
@@ -11,7 +11,9 @@ export NCCL_IB_TIMEOUT=22
export
CUDA_DEVICE_MAX_CONNECTIONS
=
1
export
NCCL_NET_GDR_LEVEL
=
SYS
export
NCCL_NET_GDR_READ
=
0
export
NVTE_FLASH_ATTN
=
1
export
NVTE_FUSED_ATTN
=
0
lrank
=
$OMPI_COMM_WORLD_LOCAL_RANK
RANK
=
$OMPI_COMM_WORLD_RANK
...
...
scripts/llama2_70b.sh
View file @
96eb05f5
...
...
@@ -6,14 +6,13 @@ export OMP_NUM_THREADS=1
export
NCCL_P2P_LEVEL
=
SYS
export
NCCL_ALGO
=
Ring
export
NCCL_NCHANNELS_PER_PEER
=
16
export
NCCL_MIN_NCHANNELS
=
20
export
NCCL_IB_TIMEOUT
=
22
export
CUDA_DEVICE_MAX_CONNECTIONS
=
1
export
NCCL_IB_HCA
=
xx
#based on your environment
export
NCCL_NET_GDR_LEVEL
=
SYS
export
NCCL_NET_GDR_READ
=
0
export
NVTE_FLASH_ATTN
=
1
export
NVTE_FUSED_ATTN
=
0
lrank
=
$OMPI_COMM_WORLD_LOCAL_RANK
RANK
=
$OMPI_COMM_WORLD_RANK
...
...
@@ -49,7 +48,7 @@ TRAINING_ARGS=(
--clip-grad
1.0
--bf16
--use-distributed-optimizer
--use-flash-attn
-triton
--use-flash-attn
--disable-bias-linear
--attention-dropout
0
--hidden-dropout
0
...
...
scripts/llama2_7b.sh
View file @
96eb05f5
...
...
@@ -10,7 +10,9 @@ export NCCL_IB_TIMEOUT=22
export
CUDA_DEVICE_MAX_CONNECTIONS
=
1
export
NCCL_NET_GDR_LEVEL
=
SYS
export
NCCL_NET_GDR_READ
=
0
export
NVTE_FLASH_ATTN
=
1
export
NVTE_FUSED_ATTN
=
0
lrank
=
$OMPI_COMM_WORLD_LOCAL_RANK
RANK
=
$OMPI_COMM_WORLD_RANK
...
...
scripts/qwen2_7b.sh
View file @
96eb05f5
...
...
@@ -13,8 +13,9 @@ export CUDA_DEVICE_MAX_CONNECTIONS=1
export
NCCL_IB_HCA
=
xx
#based on your environment
export
NCCL_NET_GDR_LEVEL
=
SYS
export
NCCL_NET_GDR_READ
=
0
source
/opt/dtk/env.sh
export
NVTE_FLASH_ATTN
=
1
export
NVTE_FUSED_ATTN
=
0
lrank
=
$OMPI_COMM_WORLD_LOCAL_RANK
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment