Unverified Commit fb10ffb5 authored by ZichengMa's avatar ZichengMa Committed by GitHub
Browse files

docs: Add LMCache in vLLM README.md (#2462)


Signed-off-by: default avatarZichengMa <zichengma1225@gmail.com>
parent eaf42eb0
......@@ -41,6 +41,7 @@ git checkout $(git describe --tags $(git rev-list --tags --max-count=1))
| [**SLA-Based Planner**](../../../docs/architecture/sla_planner.md) | ✅ | |
| [**Load Based Planner**](../../../docs/architecture/load_planner.md) | 🚧 | WIP |
| [**KVBM**](../../../docs/architecture/kvbm_architecture.md) | 🚧 | WIP |
| [**LMCache**](./LMCache_Integration.md) | ✅ | |
### Large Scale P/D and WideEP Features
......
......@@ -132,7 +132,6 @@ async def init_prefill(runtime: DistributedRuntime, config: Config):
"""
Instantiate and serve
"""
component = runtime.namespace(config.namespace).component(config.component)
await component.create_service()
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment