@@ -10,7 +10,7 @@ Since the current pypi peft package(0.2) has some bugs, please install the peft
git clone https://github.com/huggingface/peft
cd peft
pip install .
```
```
# Usage
For SFT training, just call train_peft_sft.py
...
...
@@ -21,4 +21,4 @@ For stage-3 rlhf training, call train_peft_prompts.py.
Its arguments are almost idential to train_prompts.py. The only difference is that I use text files to indicate the prompt and pretrained data file. The models are included in easy_models.py. Currently only bloom models are tested, but technically gpt2/opt/llama should be supported.
# Dataformat
Please refer the formats in test_sft.txt, test_prompts.txt, test_pretrained.txt.
\ No newline at end of file
Please refer the formats in test_sft.txt, test_prompts.txt, test_pretrained.txt.
Easy SFT just accept a text file which can be read line by line. However the datasest will group texts together to max_length so LLM will learn the texts meaning better.
If individual lines are not related, just set is_group_texts to False.