1、同步到最新版本;2、增加batch推理接口;3、解决内存泄漏问题;4、修复llama系列流式输出不流畅的问题
Showing
pyfastllm/fastllm/models.py
0 → 100644
pyfastllm/install.sh
0 → 100644
pyfastllm/test_func.sh
0 → 100644
This diff is collapsed.
Please register or sign in to comment