Commit ad00894f authored by Ziyue Jiang's avatar Ziyue Jiang
Browse files

polish

parent 9ae9e740
# Auto-Parallelism with GPT2 # Pipeline Parallelism Demo with GPT2
## Requirements ## Requirements
...@@ -33,5 +33,6 @@ For simplicity, the input data is randonly generated here. ...@@ -33,5 +33,6 @@ For simplicity, the input data is randonly generated here.
```bash ```bash
#Run the Pipeline Parallel on GPT with default setting and a dummy dataset. #Run the Pipeline Parallel on GPT with default setting and a dummy dataset.
#You can change the GPU number or microbatch number in the run.sh .
bash run.sh bash run.sh
``` ```
import torch
# Randomly Generated Data
def get_data(batch_size, seq_len, vocab_size):
input_ids = torch.randint(0, vocab_size, (batch_size, seq_len), device=torch.cuda.current_device())
attention_mask = torch.ones_like(input_ids)
return input_ids, attention_mask
def get_tflops(model_numel, batch_size, seq_len, step_time):
return model_numel * batch_size * seq_len * 8 / 1e12 / (step_time + 1e-12)
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment