Support codellama (#359)
* tmp * add demo for codellama inference * update * update * update * update codellama.md * export rope_theta * update * update doc * fix client.py * define SamplingParam * rollback 'end' * rotary_emb_base to rotary_embedding_base * change to baichuan2-7b
Showing
| ... | @@ -12,6 +12,6 @@ setuptools | ... | @@ -12,6 +12,6 @@ setuptools |
| shortuuid | shortuuid | ||
| tiktoken | tiktoken | ||
| torch | torch | ||
| transformers | transformers>=4.33.0 | ||
| tritonclient[all] | tritonclient[all] | ||
| uvicorn | uvicorn |
Please register or sign in to comment