Unverified Commit e86d9bb2 authored by github-actions[bot]'s avatar github-actions[bot] Committed by GitHub
Browse files

[format] applied code formatting on changed files in pull request 3025 (#3026)


Co-authored-by: default avatargithub-actions <github-actions@github.com>
parent cd2b0eaa
...@@ -15,9 +15,9 @@ Use these code to train your reward model. ...@@ -15,9 +15,9 @@ Use these code to train your reward model.
```shell ```shell
# Naive reward model training # Naive reward model training
python train_reward_model.py --pretrain <your model path> --model <your model type> --strategy naive python train_reward_model.py --pretrain <your model path> --model <your model type> --strategy naive
# use colossalai_zero2 # use colossalai_zero2
torchrun --standalone --nproc_per_node=2 train_reward_model.py --pretrain <your model path> --model <your model type> --strategy colossalai_zero2 torchrun --standalone --nproc_per_node=2 train_reward_model.py --pretrain <your model path> --model <your model type> --strategy colossalai_zero2
``` ```
## Train with dummy prompt data (Stage 3) ## Train with dummy prompt data (Stage 3)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment