• Camille Zhong's avatar
    [Doc] enhancement on README.md for chat examples (#3646) · 8bccb72c
    Camille Zhong authored
    * Add RoBERTa for RLHF Stage 2 & 3 (test)
    
    RoBERTa for RLHF Stage 2 & 3 (still in testing)
    
    Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"
    
    This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.
    
    Add RoBERTa for RLHF stage 2 & 3
    
    1. add roberta folder under model folder
    2. add  roberta option in train_reward_model.py
    3. add some test in testci
    
    Update test_ci.sh
    
    Revert "Update test_ci.sh"
    
    This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.
    
    Add RoBERTa for RLHF Stage 2 & 3 (test)
    
    RoBERTa for RLHF Stage 2 & 3 (still in testing)
    
    Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"
    
    This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.
    
    Add RoBERTa for RLHF stage 2 & 3
    
    1. add roberta folder under model folder
    2. add  roberta option in train_reward_model.py
    3. add some test in testci
    
    Update test_ci.sh
    
    Revert "Update test_ci.sh"
    
    This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.
    
    update roberta with coati
    
    chat ci update
    
    Revert "chat ci update"
    
    This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846.
    
    * Update README.md
    
    Update README.md
    
    * update readme
    
    * Update test_ci.sh
    8bccb72c
train_prompts.py 10.4 KB