• lvhan028's avatar
    Refactor the chat template of supported models using factory pattern (#144) · 7b470f07
    lvhan028 authored
    * refactor model.py and support baichuan-7b
    
    * remove model_name
    
    * remove hard session_len
    
    * export tokenizer.py to target dir
    
    * remove model_name from client
    
    * remove model_name
    
    * update
    
    * correct throughput equation
    
    * fix session.response
    
    * update serving.md
    
    * update readme
    
    * update according to review comments
    
    * update
    
    * update
    
    * update
    
    * update
    7b470f07
README.md 5.66 KB