Commit ae0539e7 authored by zhangzbb's avatar zhangzbb
Browse files

Update README.md

parent 5379fc16
# <div align="center"><strong>vLLM</strong></div>
# <div align="center"><strong>deprecated vLLM</strong></div>
## 简介
vLLM是一个快速且易于使用的LLM推理和服务库,使用PageAttention高效管理kv内存,Continuous batching传入请求,支持很多Hugging Face模型,如LLaMA & LLaMA-2、Qwen、Chatglm2 & Chatglm3等。
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment