• Wenhao Chen's avatar
    [chat] use official transformers and fix some issues (#4117) · 3d8d5d0d
    Wenhao Chen authored
    * feat: remove on_learn_epoch fn as not used
    
    * revert: add _on_learn_epoch fn
    
    * feat: remove NaiveStrategy
    
    * test: update train_prompts tests
    
    * fix: remove prepare_llama_tokenizer_and_embedding
    
    * test: add lora arg
    
    * feat: remove roberta support in train_prompts due to runtime errs
    
    * feat: remove deberta & roberta in rm as not used
    
    * test: remove deberta and roberta tests
    
    * feat: remove deberta and roberta models as not used
    
    * fix: remove calls to roberta
    
    * fix: remove prepare_llama_tokenizer_and_embedding
    
    * chore: update transformers version
    
    * docs: update transformers version
    
    * fix: fix actor inference
    
    * fix: fix ci
    
    * feat: change llama pad token to unk
    
    * revert: revert ddp setup_distributed
    
    * fix: change llama pad token to unk
    
    * revert: undo unnecessary changes
    
    * fix: use pip to install transformers
    3d8d5d0d
train_prompts.py 9.89 KB