"megatron/data/gpt_dataset.py" did not exist on "6c3f6c7bb582b4509b28b64c3772e56f11627b7f"
Merge branch 'sequence_parallel' into 'main'
Sequence parallelism + attention checkpoint See merge request ADLR/megatron-lm!413
Showing
This diff is collapsed.
Please register or sign in to comment