Commit e375aa36 authored by zhangzbb's avatar zhangzbb
Browse files

[DOCs] add vllm v0.15.1 dcu use readme.md file including introduction,...

[DOCs] add vllm v0.15.1 dcu use readme.md file including  introduction, supported models, installation, PD, and EP usage
parent f90d9d07
# <div align="center"><strong>vLLM/strong></div>
# <div align="center"><strong>DCU vLLM</strong></div>
## vLLM_dcu简介
vLLM 是一个快速易用的 LLM 推理和服务库。可用于大型语言模型和多模态模型的高性能服务框架,旨在在从单个GPU到大型分布式集群的各种设置中提供低延迟和高吞吐量的推理,我们基于开源社区做了DCU平台的适配和针对性的优化。
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment