README.md 241 Bytes
Newer Older
Woosuk Kwon's avatar
Woosuk Kwon committed
1
# CacheFlow
Woosuk Kwon's avatar
Woosuk Kwon committed
2
3
4
5

## Installation

```bash
6
pip install psutil numpy torch transformers
7
pip install flash-attn # This may take up to 10 mins.
Woosuk Kwon's avatar
Woosuk Kwon committed
8
9
10
11
12
13
pip install -e .
```

## Run

```bash
Zhuohan Li's avatar
Zhuohan Li committed
14
15
ray start --head
python server.py [--tensor-parallel-size <N>]
Woosuk Kwon's avatar
Woosuk Kwon committed
16
```