"vscode:/vscode.git/clone" did not exist on "d19e8ad7abac44fb0093d7a0456de353d6f23d80"
  • lvhan028's avatar
    Refactor the chat template of supported models using factory pattern (#144) · 7b470f07
    lvhan028 authored
    * refactor model.py and support baichuan-7b
    
    * remove model_name
    
    * remove hard session_len
    
    * export tokenizer.py to target dir
    
    * remove model_name from client
    
    * remove model_name
    
    * update
    
    * correct throughput equation
    
    * fix session.response
    
    * update serving.md
    
    * update readme
    
    * update according to review comments
    
    * update
    
    * update
    
    * update
    
    * update
    7b470f07
profile_serving.py 6.27 KB